Skip to main content

Evaluating Query Performance Predictors Based on Brownian Distance Correlation

  • Conference paper
Pervasive Computing and the Networked World (ICPCA/SWS 2013)

Part of the book series: Lecture Notes in Computer Science ((LNCCN,volume 8351))

  • 3035 Accesses

Abstract

Modern information retrieval system suffers from radical variance performance even though its mean performance is well. In order to solve this problem, the Query Performance Predictor, predicting the performance of query without relevance assessment information , which is quite expensive or even infeasible in real system, was studied in the past decade. To evaluate the Query Performance Predictor, the correlation coefficient between predictor’s output and the Average Precision of query is calculated. Current works mainly employ correlation metrics including the Pearson correlation coefficient, Spearman’s Rho and Kendall’s tau. However, these correlation metrics have some limitation in evaluating the quality of predictor. To this end, we introduce a novel metric based on Brownian Distance Correlation (Dcor) in evaluating query performance predictor, which is able to measure the independence relationship apart from the linear correlation relationship between two variables. Therefore, the new method can report more reliable results especially when there are nonlinear or non-monotone relationships. We conduct a series of experiments on several standard TREC datasets and compare the results between Dcor and the classic metrics. In the experiments, the novel metric exhibits consistent evaluating results compared with the three classic coefficients. However, the results suggest a better stability when tuning the predictor’s parameters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Carmel, D., Yom-Tov, E.: Estimating the query difficulty for information retrieval. Synthesis Lectures on Information Concepts, Retrieval, and Services 2(1), 1–89 (2010)

    Article  Google Scholar 

  2. Cronen-Townsend, S., Zhou, Y., Croft, W.B.: Predicting query performance. In: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 299–306. ACM (2002)

    Google Scholar 

  3. Grabisch, M., Marichal, J.-L., Mesiar, R., Pap, E.: Aggregation Functions (Encyclopedia of Mathematics and its Applications), 1st edn. Cambridge University Press, New York (2009)

    Book  Google Scholar 

  4. Hauff, C., Azzopardi, L., Hiemstra, D.: The combination and evaluation of query performance prediction methods. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.) ECIR 2009. LNCS, vol. 5478, pp. 301–312. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  5. He, B., Ounis, I.: Inferring query performance using pre-retrieval predictors. In: Apostolico, A., Melucci, M. (eds.) SPIRE 2004. LNCS, vol. 3246, pp. 43–54. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  6. Kwok, K.L.: A new method of weighting query terms for ad-hoc retrieval. In: Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 187–195. ACM (1996)

    Google Scholar 

  7. Raju, T.N.K.: William sealy gosset and william a. silverman: two students of science. Pediatrics 116(3), 732–735 (2005)

    Article  Google Scholar 

  8. Rodgers, J.L., Nicewander, W.A.: Thirteen ways to look at the correlation coefficient. The American Statistician 42(1), 59–66 (1988)

    Article  Google Scholar 

  9. Scholer, F., Garcia, S.: A case for improved evaluation of query difficulty prediction. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 640–641. ACM (2009)

    Google Scholar 

  10. Shtok, A., Kurland, O., Carmel, D.: Predicting query performance by query-drift estimation. In: Azzopardi, L., Kazai, G., Robertson, S., Rüger, S., Shokouhi, M., Song, D., Yilmaz, E. (eds.) ICTIR 2009. LNCS, vol. 5766, pp. 305–312. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  11. Szekely, G.J., Rizzo, M.L.: Brownian distance covariance. The Annals of Applied Statistics 3(4), 1236–1265 (2009)

    Article  MATH  MathSciNet  Google Scholar 

  12. Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to ad hoc information retrieval. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 334–342. ACM (2001)

    Google Scholar 

  13. Zhou, Y.: Retrieval performance prediction and document quality. PhD thesis, University of Massachusetts Amherst (2007)

    Google Scholar 

  14. Zhou, Y., Croft, W.B.: Query performance prediction in web search environments. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 543–550. ACM (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Wang, X., Luo, T., Huang, Y., Wang, W. (2014). Evaluating Query Performance Predictors Based on Brownian Distance Correlation. In: Zu, Q., Vargas-Vera, M., Hu, B. (eds) Pervasive Computing and the Networked World. ICPCA/SWS 2013. Lecture Notes in Computer Science, vol 8351. Springer, Cham. https://doi.org/10.1007/978-3-319-09265-2_65

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-09265-2_65

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-09264-5

  • Online ISBN: 978-3-319-09265-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics