A Robust Ranking Methodology Based on Diverse Calibration of AdaBoost

Busa-Fekete, Róbert; Kégl, Balázs; Éltető, Tamás; Szarvas, György

doi:10.1007/978-3-642-23780-5_27

Róbert Busa-Fekete^23,24,
Balázs Kégl^23,25,
Tamás Éltető²⁵ &
…
György Szarvas^24,26

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6911))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

3076 Accesses

Abstract

In subset ranking, the goal is to learn a ranking function that approximates a gold standard partial ordering of a set of objects (in our case, relevance labels of a set of documents retrieved for the same query). In this paper we introduce a learning to rank approach to subset ranking based on multi-class classification. Our technique can be summarized in three major steps. First, a multi-class classification model (AdaBoost.MH) is trained to predict the relevance label of each object. Second, the trained model is calibrated using various calibration techniques to obtain diverse class probability estimates. Finally, the Bayes-scoring function (which optimizes the popular Information Retrieval performance measure NDCG), is approximated through mixing these estimates into an ultimate scoring function. An important novelty of our approach is that many different methods are applied to estimate the same probability distribution, and all these hypotheses are combined into an improved model. It is well known that mixing different conditional distributions according to a prior is usually more efficient than selecting one “optimal” distribution. Accordingly, using all the calibration techniques, our approach does not require the estimation of the best suited calibration method and is therefore less prone to overfitting. In an experimental study, our method outperformed many standard ranking algorithms on the LETOR benchmark datasets, most of which are based on significantly more complex learning to rank algorithms than ours.

Download to read the full chapter text

Chapter PDF

An Empirical Study of the Impact of Field Features in Learning-to-rank Method

Learning to Rank

Impact of Feature Selection on Average Ranking Method via Metalearning

Keywords

References

Busa-Fekete, R., Kégl, B., Éltető, T., Szarvas, G.: Ranking by calibrated AdaBoost. In: JMLR W&CP, vol. 14, pp. 37–48 (2011)
Google Scholar
Cao, Z., Qin, T., Liu, T., Tsai, M., Li, H.: Learning to rank: from pairwise approach to listwise approach. In: Proceedings of the 24rd International Conference on Machine Learning, pp. 129–136 (2007)
Google Scholar
Cesa-Bianchi, N., Lugosi, G.: Prediction, Learning, and Games. Cambridge University Press, New York (2006)
Book MATH Google Scholar
Chapelle, O., Chang, Y.: Yahoo! Learning to Rank Challenge Overview. In: Yahoo Learning to Rank Challenge (JMLR W&CP), Haifa, Israel, vol. 14, pp. 1–24 (2010)
Google Scholar
Chapelle, O., Metlzer, D., Zhang, Y., Grinspan, P.: Expected reciprocal rank for graded relevance. In: Proceeding of the 18th ACM Conference on Information and Knowledge Management, pp. 621–630. ACM, New York (2009)
Google Scholar
Chapelle, O., Wu, M.: Gradient descent optimization of smoothed information retrieval metrics. Information Retrievel 13(3), 216–235 (2010)
Article Google Scholar
Cossock, D., Zhang, T.: Statistical analysis of Bayes optimal subset ranking. IEEE Transactions on Information Theory 54(11), 5140–5154 (2008)
Article MathSciNet MATH Google Scholar
Freund, Y., Iyer, R., Schapire, R.E., Singer, Y.: An efficient boosting algorithm for combining preferences. Journal of Machine Learning Research 4, 933–969 (2003)
MathSciNet MATH Google Scholar
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences 55, 119–139 (1997)
Article MathSciNet MATH Google Scholar
Herbrich, R., Graepel, T., Obermayer, K.: Large margin rank boundaries for ordinal regression. In: Smola, B., Schoelkopf, S. (eds.) Advances in Large Margin Classifiers, pp. 115–132. MIT Press, Cambridge (2000)
Google Scholar
Kégl, B., Busa-Fekete, R.: Boosting products of base classifiers. In: International Conference on Machine Learning, Montreal, Canada, vol. 26, pp. 497–504 (2009)
Google Scholar
Li, P., Burges, C., Wu, Q.: McRank: Learning to rank using multiple classification and gradient boosting. In: Advances in Neural Information Processing Systems, vol. 19, pp. 897–904. The MIT Press, Cambridge (2007)
Google Scholar
Mease, D., Wyner, A.: Evidence contrary to the statistical view of boosting. Journal of Machine Learning Research 9, 131–156 (2007)
Google Scholar
Niculescu-Mizil, A., Caruana, R.: Obtaining calibrated probabilities from boosting. In: Proceedings of the 21st International Conference on Uncertainty in Artificial Intelligence, pp. 413–420 (2005)
Google Scholar
Rissanen, J.: A universal prior for integers and estimation by minimum description length. Annals of Statistics 11, 416–431 (1983)
Article MathSciNet MATH Google Scholar
Robertson, S., Zaragoza, H.: The probabilistic relevance framework: BM25 and beyond. Found. Trends Inf. Retr. 3, 333–389 (2009)
Article Google Scholar
Schapire, R.E., Singer, Y.: Improved boosting algorithms using confidence-rated predictions. Machine Learning 37(3), 297–336 (1999)
Article MATH Google Scholar
Valizadegan, H., Jin, R., Zhang, R., Mao, J.: Learning to rank by optimizing NDCG measure. In: Advances in Neural Information Processing Systems, vol. 22, pp. 1883–1891 (2009)
Google Scholar
Wu, Q., Burges, C.J.C., Svore, K.M., Gao, J.: Adapting boosting for information retrieval measures. Inf. Retr. 13(3), 254–270 (2010)
Article Google Scholar
Xu, J., Li, H.: AdaRank: a boosting algorithm for information retrieval. In: SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 391–398. ACM, New York (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Linear Accelerator Laboratory (LAL), University of Paris-Sud, CNRS, Orsay, 91898, France
Róbert Busa-Fekete & Balázs Kégl
Research Group on Artificial Intelligence of the Hungarian Academy of Sciences and University of Szeged, Aradi vértanúk tere 1., H-6720, Szeged, Hungary
Róbert Busa-Fekete & György Szarvas
Computer Science Laboratory (LRI), University of Paris-Sud, CNRS and INRIA-Saclay, 91405, Orsay, France
Balázs Kégl & Tamás Éltető
Ubiquitous Knowledge Processing (UKP) Lab, Computer Science Department, Technische Universität Darmstadt, D-64289, Darmstadt, Germany
György Szarvas

Authors

Róbert Busa-Fekete
View author publications
You can also search for this author in PubMed Google Scholar
Balázs Kégl
View author publications
You can also search for this author in PubMed Google Scholar
Tamás Éltető
View author publications
You can also search for this author in PubMed Google Scholar
György Szarvas
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics and Telecommunications, University of Athens, Panepistimioupolis, Ilisia, 15784, Athens, Greece
Dimitrios Gunopulos
Google Switzerland GmbH, Brandschenkestrasse 110, 8002, Zurich, Switzerland
Thomas Hofmann
Department of Computer Science, University of Bari “Aldo Moro”, via Orabona 4, 70125, Bari, Italy
Donato Malerba
Deptartment of Informatics, Athens University of Economics and Business, Patision 76, 10434, Athens, Greece
Michalis Vazirgiannis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Busa-Fekete, R., Kégl, B., Éltető, T., Szarvas, G. (2011). A Robust Ranking Methodology Based on Diverse Calibration of AdaBoost. In: Gunopulos, D., Hofmann, T., Malerba, D., Vazirgiannis, M. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2011. Lecture Notes in Computer Science(), vol 6911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23780-5_27

Download citation

DOI: https://doi.org/10.1007/978-3-642-23780-5_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23779-9
Online ISBN: 978-3-642-23780-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Robust Ranking Methodology Based on Diverse Calibration of AdaBoost

Abstract

Chapter PDF

Similar content being viewed by others

An Empirical Study of the Impact of Field Features in Learning-to-rank Method

Learning to Rank

Impact of Feature Selection on Average Ranking Method via Metalearning

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Robust Ranking Methodology Based on Diverse Calibration of AdaBoost

Abstract

Chapter PDF

Similar content being viewed by others

An Empirical Study of the Impact of Field Features in Learning-to-rank Method

Learning to Rank

Impact of Feature Selection on Average Ranking Method via Metalearning

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation