Skip to main content

Models in Information Retrieval

  • Chapter
  • First Online:
Book cover Lectures on Information Retrieval (ESSIR 2000)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1980))

Included in the following conference series:

Abstract

Retrieval models form the theoretical basis for computing the answer to a query. They differ not only in the syntax and expressiveness of the query language, but also in the representation of the documents. Following Rijsbergen’s approach of regarding IR as uncertain inference, we can distinguish models according to the expressiveness of the underlying logic and the way uncertainty is handled. Classical retrieval models are based on propositional logic. In the vector space model, documents and queries are represented as vectors in a vector space spanned by the index terms, and uncertainty is modelled by considering geometric similarity. Probabilistic models make assumptions about the distribution of terms in relevant and nonrelevant documents in order to estimate the probability of relevance of a document for a query. Language models compute the probability that the query is generated from a document. All these models can be interpreted within a framework that is based on a probabilistic concept space. For IR applications dealing not only with texts, but also with multimedia or factual data, propositional logic is not suffcient. Therefore, advanced IR models use restricted forms of predicate logic as basis. Terminological/ description logics are rooted in semantic networks and terminological languages like e.g. KL-ONE. Datalog uses function-free horn clauses. Probabilistic versions of both approaches are able to cope with the intrinsic uncertainty of IR.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. M. J. Bates. Where should the person stop and the information search interface start? Information Processing & Management, 26(5): 575–591, 1990.

    Article  MathSciNet  Google Scholar 

  2. S. Ceri, G. Gottlob, and L. Tanca. Logic Programming and Databases. Springer, Berlin et al., 1990.

    Google Scholar 

  3. W.S. Cooper. Some inconsistencies and misidentified modeling assumptions in probabilistic information retrieval. ACM Transactions on Information Systems, 13(1): 100–111, Jan 1995.

    Article  Google Scholar 

  4. Fabio Crestani, Mounia Lalmas, Cornelis J. van Rijsbergen, and Iain Campbell. “ Is this document relevant?... probably ”: a survey of probabilistic models in information retrieval. ACM Computer Surveys, 30(4): 528–552, 1998.

    Article  Google Scholar 

  5. N. Fuhr. Models for retrieval with probabilistic indexing. Information Processing & Management, 25(1): 55–72, 1989.

    Article  MathSciNet  Google Scholar 

  6. N. Fuhr. Probabilistic models in information retrieval. The Computer Journal, 35(3):243–255, 1992.

    Article  MATH  Google Scholar 

  7. N. Fuhr. Information retrieval methods for multimedia objects. To appear in: Proceedings Dagstuhl WS Content-Based Image and Video Retrieval, 2000.

    Google Scholar 

  8. N. Fuhr and C. Buckley. A probabilistic learning approach for document indexing. ACM Transactions on Information Systems, 9(3):223–248, 1991.

    Article  Google Scholar 

  9. Norbert Fuhr. Probabilistic Datalog: Implementing logical information retrieval for advanced applications. Journal of the American Society for Information Science, 51(2):95–110, 2000.

    Article  MathSciNet  Google Scholar 

  10. Djoerd Hiemstra. A linguistically motivated probabilistic model of information retrieval. In C. Nikolaou and C. Stephanidis, editors, Lecture Notes In Computer Science-Research and Advanced Technology for Digital Libraries-Proceedings of the second European Conference on Research and Advanced Technology for Digital Libraries: ECDL’98, pages 569–584. Springer Verlag, 1998.

    Google Scholar 

  11. M.E. Maron and J.L. Kuhns. On relevance, probabilistic indexing, and information retrieval. Journal of the ACM, 7:216–244, 1960.

    Article  Google Scholar 

  12. C. Meghini, F. Sebastiani, U. Straccia, and C. Thanos. A model of information retrieval based on a terminological logic. In Proceedings of the Sixteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 298–308, New York, 1993. ACM.

    Google Scholar 

  13. S. R. Newcomb, N. A. Kipp, and V. T. Newcomb. The “HyTime” hypermedia/time-based document structuring language. Communications of the ACM, 34(11):67–83, November 1991.

    Article  Google Scholar 

  14. Jianyun Nie. An information retrieval model based on modal logic. Information processing & management., 25(5):477–491, 1989.

    Article  Google Scholar 

  15. J. Pearl. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufman, San Mateo, California, 1988.

    Google Scholar 

  16. J.M. Ponte and W.B. Croft. A language modeling approach to information retrieval. In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 275–281, New York, 1998. ACM.

    Google Scholar 

  17. S.E. Robertson. The probability ranking principle in IR. Journal of Documentation, 33: 294–304, 1977.

    Article  Google Scholar 

  18. S.E. Robertson and K. Sparck Jones. Relevance weighting of search terms. Journal of the American Society for Information Science, 27:129–146, 1976.

    Article  Google Scholar 

  19. G. Salton, editor. The SMART Retrieval System-Experiments in Automatic Document Processing. Prentice Hall, Englewood, Cliffs, New Jersey, 1971.

    Google Scholar 

  20. Jeffrey D. Ullman. Principles of Database and Knowledge-Base Systems, volume I. Computer Science Press, Rockville (Md.), 1988.

    Google Scholar 

  21. C. J. van Rijsbergen. A non-classical logic for information retrieval. The Computer Journal, 29(6): 481–485, 1986.

    Article  MATH  Google Scholar 

  22. S.K.M. Wong and Y.Y. Yao. On modeling information retrieval with probabilistic inference. ACM Transactions on Information Systems, 13(1):38–68, 1995.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Fuhr, N. (2000). Models in Information Retrieval. In: Agosti, M., Crestani, F., Pasi, G. (eds) Lectures on Information Retrieval. ESSIR 2000. Lecture Notes in Computer Science, vol 1980. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45368-7_2

Download citation

  • DOI: https://doi.org/10.1007/3-540-45368-7_2

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41933-4

  • Online ISBN: 978-3-540-45368-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics