Skip to main content

Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) Model

  • Reference work entry

Synonyms

BIR model; Probabilistic model; RSJ model

Definition

Information retrieval (IR) systems aim to retrieve relevant documents while not retrieving non-relevant ones. This can be viewed as the foundation and justification of the binary independence retrieval (BIR) model, which proposes to base the ranking of documents on the division of the probability of relevance and non-relevance.

For a set r of relevant documents, and a set \(\bar{r}\) of non-relevant documents, the BIR model defines the following term weight and retrieval status value (RSV) for a document-query pair “d, q”:

$${\rm birw}( t,r,\bar r)\,{\rm{: = }}\,{{P{(t|r)} \cdot P{(\bar t| \bar r)} \over {P{(t|\bar r)}} \cdot P{(\bar t|r)}}} $$
((1))
$${\rm{RSV}}_{{\rm{BIR}}} {(d,q,r,\bar r)}\,{\rm{:}}\,{\rm{ = }}\,\sum\limits_{t \in d \cap q} {{\rm \log birw}(t,r,\bar r)}$$
((2))

Here, P(t|r) is the probability that term t occurs in the relevant documents, and P(t|\(\bar{r}\)) is the respective probability for term tin...

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   2,500.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Recommended Reading

  1. Chaudhuri S., Das G., Hristidis V., and Weikum G. Probabilistic ranking of database query results. In Proc. 30th Int. Conf. on Very Large Data Bases, 2004, pp. 888–899.

    Google Scholar 

  2. Croft W.B. and Harper D.J. Using probabilistic models of document retrieval without relevance information. J. Doc., 35:285–295, 1979.

    Google Scholar 

  3. Grossman D.A. and Frieder O. Information Retrieval. Algorithms and Heuristics, 2nd edn., volume 15 of The Information Retrieval Series. Springer, Berlin, 2004.

    Google Scholar 

  4. Harper D.J. and van Rijsbergen C.J. An evaluation of feedback in document retrieval using cooccurrence data. J. Doc., 34:189–216, 1978.

    Google Scholar 

  5. Richard K. Belew. Finding out about. Cambridge University Press, 2000.

    Google Scholar 

  6. van Rijsbergen C.J. Inform. Retr.. Butterworths, London, 2nd edn., 1979. http://www.dcs.glasgow.ac.uk/Keith/Preface.html.

  7. Robertson S. On event spaces and probabilistic models in information retrieval. Inform. Retr. J., 8(2):319–329, 2005.

    Google Scholar 

  8. Robertson S.E. The probability ranking principle in IR. J. Doc., 33:294–304, 1977.

    Google Scholar 

  9. Robertson S.E. Understanding inverse document frequency: On theoretical arguments for idf. J. Doc., 60:503–520, 2004.

    Google Scholar 

  10. Robertson S.E. and Sparck Jones K. Relevance weighting of search terms. J. Am. Soc. Inform. Sci., 27:129–146, 1976.

    Google Scholar 

  11. Robertson S.E. and Walker S. On relevance weights with little relevance information. In Proc. 20th Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, 1997, pp. 16–24.

    Google Scholar 

  12. Roelleke T. and Wang J. A parallel derivation of probabilistic information retrieval models. In Proc. 32nd Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2006, pp. 107–114.

    Google Scholar 

  13. de Vries A. and Roelleke T. Relevance information: a loss of entropy but a gain for IDF? In Proc. 31st Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2005, pp. 282–289.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer Science+Business Media, LLC

About this entry

Cite this entry

Roelleke, T., Wang, J., Robertson, S. (2009). Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) Model. In: LIU, L., ÖZSU, M.T. (eds) Encyclopedia of Database Systems. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-39940-9_919

Download citation

Publish with us

Policies and ethics