Model-Based Inference about IR Systems

Carterette, Ben

doi:10.1007/978-3-642-23318-0_11

Model-Based Inference about IR Systems

Ben Carterette¹⁸

Conference paper

844 Accesses
7 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6931))

Abstract

Researchers and developers of IR systems generally want to make inferences about the effectiveness of their systems over a population of user needs, topics, or queries. The most common framework for this is statistical hypothesis testing, which involves computing the probability of measuring the observed effectiveness of two systems over a sample of topics under a null hypothesis that the difference in effectiveness is unremarkable. It is not commonly known that these tests involve models of effectiveness. In this work we first explicitly describe the modeling assumptions of the t-test, then develop a Bayesian modeling approach that makes modeling assumptions explicit and easy to change for specific challenges in IR evaluation.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Gollapudi, S., Halverson, H., Ieong, S.: Diversifying search results. In: Proceedings of WSDM 2009, pp. 5–14 (2009)
Google Scholar
Carterette, B.: System effectiveness, user models, and user utility: A conceptual framework for investigation. In: Proceedings of SIGIR (to appear, 2011)
Google Scholar
Chapelle, O., Metzler, D., Zhang, Y., Grinspan, P.: Expceted reciprocal rank for graded relevance. In: Proceedings of the Annual International ACM Conference on Knowledge and Information Management, CIKM (2009)
Google Scholar
Clarke, C.L.A., Kolla, M., Cormack, G.V., Vechtomova, O., Ashkan, A., Büttcher, S., MacKinnon, I.: Novelty and diversity in information retrieval evaluation. In: Proceedings of SIGIR 2008, pp. 659–666 (2008)
Google Scholar
Cormack, G.V., Palmer, C.R., Clarke, C.L.A.: Efficient construction of large test collections. In: Proceedings of SIGIR, pp. 282–289 (1998)
Google Scholar
Gelman, A., Carlin, J.B., Stern, H.S., Rubin, D.B.: Bayesian Data Analysis. Chapman & Hall/CRC, Boca Raton (2004)
MATH Google Scholar
Jarvelin, K., Kekalainen, J.: Cumulated gain-based evaluation of ir techniques. ACM Trans. Inf. Syst. 20(4), 422–446 (2002)
Article Google Scholar
Kanoulas, E., Carterette, B., Clough, P.D., Sanderson, M.: Evaluation over multi-query sessions. In: Proceedings of SIGIR (to appear, 2011)
Google Scholar
Moffat, A., Zobel, J.: Rank-biased precision for measurement of retrieval effectiveness. ACM Trans. Info. Sys. 27(1), 1–27 (2008)
Article Google Scholar
Monahan, J.F.: A Primer on Linear Models, 1st edn. Chapman and Hall/CRC, Boca Raton (2008)
MATH Google Scholar
Robertson, S.E.: A new interpretation of average precision. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 689–690 (2008)
Google Scholar
Robertson, S.E., Kanoulas, E., Yilmaz, E.: Extending average precision to graded relevance judgments. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 603–610 (2010)
Google Scholar
Smucker, M., Allan, J., Carterette, B.: A comparison of statistical significance tests for information retrieval evaluation. In: Proceedings of CIKM, pp. 623–632 (2007)
Google Scholar
Venables, W.N., Ripley, B.D.: Modern Applied Statistics with S, 4th edn. Springer, Heidelberg (2002)
Book MATH Google Scholar
Voorhees, E.: Variations in relevance judgments and the measurement of retrieval effectiveness. In: Proceedings of SIGIR, pp. 315–323 (1998)
Google Scholar
Yilmaz, E., Shokouhi, M., Craswell, N., Robertson, S.: Expected browsing utility for web search evaluation. In: Proceedings of the ACM International Conference on Knowledge and Information Management (to appear, 2010)
Google Scholar
Zhang, Y., Park, L.A., Moffat, A.: Click-based evidence for decaying weight distributions in search effectiveness metrics. Inf. Retr. 13, 46–69 (2010)
Article Google Scholar
Zobel, J.: How reliable are the results of large-scale information retrieval experiments? In: Proceedings of SIGIR, pp. 307–314 (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer & Info Sciences, University of Delaware, Newark, DE, USA
Ben Carterette

Authors

Ben Carterette
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Fondazione Ugo Bordoni, Viale del Policlinico 147, 00161, Rome, Italy
Giambattista Amati
Faculty of Informatics, University of Lugano, 6900, Lugano, Switzerland
Fabio Crestani

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Carterette, B. (2011). Model-Based Inference about IR Systems. In: Amati, G., Crestani, F. (eds) Advances in Information Retrieval Theory. ICTIR 2011. Lecture Notes in Computer Science, vol 6931. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23318-0_11

Download citation

DOI: https://doi.org/10.1007/978-3-642-23318-0_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23317-3
Online ISBN: 978-3-642-23318-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics