Multiple Ranker Method in Document Retrieval

Li, Dong; Xie, Maoqiang; Wang, Yang; Huang, Yalou; Ni, Weijian

doi:10.1007/978-3-540-85930-7_52

Dong Li¹,
Maoqiang Xie²,
Yang Wang¹,
Yalou Huang² &
…
Weijian Ni¹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 15))

Included in the following conference series:

International Conference on Intelligent Computing

1606 Accesses

Abstract

In this paper, we propose a multiple-ranker approach to make learning to rank methods more effective for document retrieval application. In traditional learning to rank methods, a ranker is learned from a set of queries together with their corresponding document rankings labeled by experts, and it is then used to predict the document rankings for new queries. But in practice, user queries vary in large diversity, which makes the single ranker learned from a close set of data not representative. The single ranker cannot be guaranteed with the best ranking result for every single query, and this becomes the bottleneck of traditional learning to rank approaches. To address this problem, we propose a multi-ranker approach. We train multiple diverse rankers which can cover diverse categories of queries, instead of an isolated one, and take an ensemble of these rankers for final prediction. We verify the proposed multipleranker approach over real-world datasets. The experimental results indicate that the proposed approach can outperform existing ‘learning to rank’ methods significantly.

This work is supported by National Science Foundation of China under the grant 60673009, Tianjin Science and Technology Research Foundation under the grant 05YFGZGX24000 and Microsoft Research Asia Foundation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Herbrich, R., Graepel, T., Obermayer, K.: Large Margin Rank Boundaries for Ordinal Regression. Advances in Large Margin Classifiers, pp. 115–132 (2000)
Google Scholar
Crammer, K., Singer, Y.: PRanking with Ranking. Proceedings of NIPS 2001, Vancouver, British Columbia, Canada (2001)
Google Scholar
Burges, C., Shaked, T., Renshaw, E., Lazier, A., Deeds, M., Hamilton, N., Hullender, G.: Learning to Rank Using Gradient Descent. In: Proceedings of ICML 2005, Bonn, Germany (2005)
Google Scholar
Cao, Z., Qin, T., Liu, T.Y., Tsai, M.F., Li, H.: Learning to Rank: from Pairwise Approach to Listwise Approach. In: Proceedings of ICML 2007, Oregon, USA (2007)
Google Scholar
Freund, Y., Iyer, R.D., Schapire, R.E., Singer, Y.: An Efficient Boosting Algorithm for Combining Preferences. Journal of Machine Learning Research 4, 933–969 (2003)
Article MathSciNet Google Scholar
Xu, J., Li, H.: AdaRank: a Boosting Algorithm for Information Retrieval. In: Proceedings of SIGIR 2007, Amsterdam, The Netherlands (2007)
Google Scholar
Kullback, S.: Information Theory and Statistics, New York, Dover (1968)
Google Scholar
Hersh, W.R., Buckley, C., Leone, T.J., Hickam, D.H.: OHSUMED: An Interactive Retrieval Evaluation and New Large Test Collection for Research. In: Proceedings of SIGIR 1994, Dublin, Ireland (1994)
Google Scholar
Breiman, L.: Bagging predictors. Machine Learning 24, 123–140 (1996)
MATH MathSciNet Google Scholar
MacQueen, J.B.: Some Methods for Classification and Analysis of Multivariate Observations. In: Proceedings of 5-th Berkeley Symposium on Mathematical Statistics and Probability. University of California Press, Berkeley (1967)
Google Scholar
Johnson, S.C.: Hierarchical Clustering Schemes. Psychometrika 2, 241–254 (1967)
Article Google Scholar
Liu, T.Y., Qin, T., Xu, J., Xiong, W.Y., Li, H.: LETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval. In: Proceedings of LR4IR 2007, in conjunction with SIGIR 2007, Amsterdam, Netherlands (2007)
Google Scholar
Craswell, N., Hawking, D.:Overview of the TREC-2004 Web Track. In: TREC (2004)
Google Scholar
Baeza-Yates, R.A., Ribeiro-Neto, B.: Modern Information Retrieval, Addison-Wesley Longman Publishing Co., Inc., Boston, MA (1999)
Google Scholar
Jarvelin, K., Kekalainen, J.: Cumulated Gain-based Evaluation of IR Techniques. ACM Transactions on Information Systems 20(4), 422–446 (2002)
Article Google Scholar

Download references

Author information

Authors and Affiliations

College of Information Technology Science, Nankai University, Tianjin, China
Dong Li, Yang Wang & Weijian Ni
College of Software, Nankai University, Tianjin, China
Maoqiang Xie & Yalou Huang

Authors

Dong Li
View author publications
You can also search for this author in PubMed Google Scholar
Maoqiang Xie
View author publications
You can also search for this author in PubMed Google Scholar
Yang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yalou Huang
View author publications
You can also search for this author in PubMed Google Scholar
Weijian Ni
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

De-Shuang Huang Donald C. Wunsch II Daniel S. Levine Kang-Hyun Jo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, D., Xie, M., Wang, Y., Huang, Y., Ni, W. (2008). Multiple Ranker Method in Document Retrieval. In: Huang, DS., Wunsch, D.C., Levine, D.S., Jo, KH. (eds) Advanced Intelligent Computing Theories and Applications. With Aspects of Contemporary Intelligent Computing Techniques. ICIC 2008. Communications in Computer and Information Science, vol 15. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85930-7_52

Download citation

DOI: https://doi.org/10.1007/978-3-540-85930-7_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85929-1
Online ISBN: 978-3-540-85930-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics