Abstract
Modern information retrieval system suffers from radical variance performance even though its mean performance is well. In order to solve this problem, the Query Performance Predictor, predicting the performance of query without relevance assessment information , which is quite expensive or even infeasible in real system, was studied in the past decade. To evaluate the Query Performance Predictor, the correlation coefficient between predictor’s output and the Average Precision of query is calculated. Current works mainly employ correlation metrics including the Pearson correlation coefficient, Spearman’s Rho and Kendall’s tau. However, these correlation metrics have some limitation in evaluating the quality of predictor. To this end, we introduce a novel metric based on Brownian Distance Correlation (Dcor) in evaluating query performance predictor, which is able to measure the independence relationship apart from the linear correlation relationship between two variables. Therefore, the new method can report more reliable results especially when there are nonlinear or non-monotone relationships. We conduct a series of experiments on several standard TREC datasets and compare the results between Dcor and the classic metrics. In the experiments, the novel metric exhibits consistent evaluating results compared with the three classic coefficients. However, the results suggest a better stability when tuning the predictor’s parameters.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Carmel, D., Yom-Tov, E.: Estimating the query difficulty for information retrieval. Synthesis Lectures on Information Concepts, Retrieval, and Services 2(1), 1–89 (2010)
Cronen-Townsend, S., Zhou, Y., Croft, W.B.: Predicting query performance. In: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 299–306. ACM (2002)
Grabisch, M., Marichal, J.-L., Mesiar, R., Pap, E.: Aggregation Functions (Encyclopedia of Mathematics and its Applications), 1st edn. Cambridge University Press, New York (2009)
Hauff, C., Azzopardi, L., Hiemstra, D.: The combination and evaluation of query performance prediction methods. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.) ECIR 2009. LNCS, vol. 5478, pp. 301–312. Springer, Heidelberg (2009)
He, B., Ounis, I.: Inferring query performance using pre-retrieval predictors. In: Apostolico, A., Melucci, M. (eds.) SPIRE 2004. LNCS, vol. 3246, pp. 43–54. Springer, Heidelberg (2004)
Kwok, K.L.: A new method of weighting query terms for ad-hoc retrieval. In: Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 187–195. ACM (1996)
Raju, T.N.K.: William sealy gosset and william a. silverman: two students of science. Pediatrics 116(3), 732–735 (2005)
Rodgers, J.L., Nicewander, W.A.: Thirteen ways to look at the correlation coefficient. The American Statistician 42(1), 59–66 (1988)
Scholer, F., Garcia, S.: A case for improved evaluation of query difficulty prediction. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 640–641. ACM (2009)
Shtok, A., Kurland, O., Carmel, D.: Predicting query performance by query-drift estimation. In: Azzopardi, L., Kazai, G., Robertson, S., Rüger, S., Shokouhi, M., Song, D., Yilmaz, E. (eds.) ICTIR 2009. LNCS, vol. 5766, pp. 305–312. Springer, Heidelberg (2009)
Szekely, G.J., Rizzo, M.L.: Brownian distance covariance. The Annals of Applied Statistics 3(4), 1236–1265 (2009)
Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to ad hoc information retrieval. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 334–342. ACM (2001)
Zhou, Y.: Retrieval performance prediction and document quality. PhD thesis, University of Massachusetts Amherst (2007)
Zhou, Y., Croft, W.B.: Query performance prediction in web search environments. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 543–550. ACM (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Wang, X., Luo, T., Huang, Y., Wang, W. (2014). Evaluating Query Performance Predictors Based on Brownian Distance Correlation. In: Zu, Q., Vargas-Vera, M., Hu, B. (eds) Pervasive Computing and the Networked World. ICPCA/SWS 2013. Lecture Notes in Computer Science, vol 8351. Springer, Cham. https://doi.org/10.1007/978-3-319-09265-2_65
Download citation
DOI: https://doi.org/10.1007/978-3-319-09265-2_65
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09264-5
Online ISBN: 978-3-319-09265-2
eBook Packages: Computer ScienceComputer Science (R0)