Information retrieval

Introduction edit

Information retrieval (IR) in computing and information science is the task of identifying and retrieving information system resources that are relevant to an information need. The information need can be specified in the form of a search query. In the case of document retrieval, queries can be based on full-text or other content-based indexing. Information retrieval is the science[1] of searching for information in a document, searching for documents themselves, and also searching for the metadata that describes data, and for databases of texts, images or sounds.

Automated information retrieval systems are used to reduce what has been called information overload. An IR system is a software system that provides access to books, journals and other documents; it also stores and manages those documents. Web search engines are the most visible IR applications.

Learning Tasks edit

  • (Information Systems) Explore the learning resource about Information Systems and identify the links between different information systems and information retrieval.
  • (Mathematical Foundations) Explain, how mathematics can be used to handle
    • search requests,
    • how to represent knowledge
  • (Web Crawler) What is web crawler and why is it necessary to rely on web crawlers to create e.g. an web index to handle search requests of users.

Learning Modules edit

Overview edit

An information retrieval process begins when a user enters a query into the system. Queries are formal statements of information needs, for example search strings in web search engines. In information retrieval, a query does not uniquely identify a single object in the collection. Instead, several objects may match the query, perhaps with different degrees of relevance.

An object is an entity that is represented by information in a content collection or database. User queries are matched against the database information. However, as opposed to classical SQL queries of a database, in information retrieval the results returned may or may not match the query, so results are typically ranked. This ranking of results is a key difference of information retrieval searching compared to database searching.[2]

Depending on the application the data objects may be, for example, text documents, images,[3] audio,[4] mind maps[5] or videos. Often the documents themselves are not kept or stored directly in the IR system, but are instead represented in the system by document surrogates or metadata.

Most IR systems compute a numeric score on how well each object in the database matches the query, and rank the objects according to this value. The top ranking objects are then shown to the user. The process may then be iterated if the user wishes to refine the query.[6]

Applications edit

Areas where information retrieval techniques are employed include (the entries are in alphabetical order within each category):

General applications edit

Domain-specific applications edit

Other retrieval methods edit

Methods/Techniques in which information retrieval techniques are employed include:

Major conferences edit

Awards in the field edit

See also edit

References edit

  1. Luk, R. W. P. (2022). "Why is information retrieval a scientific discipline?". Foundations of Science 27 (2): 427–453. doi:10.1007/s10699-020-09685-x. 
  2. Jansen, B. J. and Rieh, S. (2010) The Seventeen Theoretical Constructs of Information Searching and Information Retrieval Archived 2016-03-04 at the Wayback Machine. Journal of the American Society for Information Sciences and Technology. 61(8), 1517-1534.
  3. Goodrum, Abby A. (2000). "Image Information Retrieval: An Overview of Current Research". Informing Science 3 (2). 
  4. Foote, Jonathan (1999). "An overview of audio information retrieval". Multimedia Systems 7: 2–10. doi:10.1007/s005300050106. 
  5. Beel, Jöran; Gipp, Bela; Stiller, Jan-Olaf (2009). Information Retrieval On Mind Maps - What Could It Be Good For?. Proceedings of the 5th International Conference on Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom'09). Washington, DC: IEEE. Archived from the original on 2011-05-13. Retrieved 2012-03-13.
  6. Frakes, William B.; Baeza-Yates, Ricardo (1992). Information Retrieval Data Structures & Algorithms. Prentice-Hall, Inc.. ISBN 978-0-13-463837-9. https://www.scribd.com/doc/13742235/Information-Retrieval-Data-Structures-Algorithms-William-B-Frakes. 

Further reading edit

  • Yeo, ShinJoung. (2023) Behind the Search Box: Google and the Global Internet Industry (U of Illinois Press, 2023) ISBN 10:0252087127 online

External links edit

  Search Wikiquote for quotations related to: Information retrieval

Page Information edit

This page was based on the following wikipedia-source page: