Skip to main content

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 15))

Included in the following conference series:

  • 1606 Accesses

Abstract

In this paper, we propose a multiple-ranker approach to make learning to rank methods more effective for document retrieval application. In traditional learning to rank methods, a ranker is learned from a set of queries together with their corresponding document rankings labeled by experts, and it is then used to predict the document rankings for new queries. But in practice, user queries vary in large diversity, which makes the single ranker learned from a close set of data not representative. The single ranker cannot be guaranteed with the best ranking result for every single query, and this becomes the bottleneck of traditional learning to rank approaches. To address this problem, we propose a multi-ranker approach. We train multiple diverse rankers which can cover diverse categories of queries, instead of an isolated one, and take an ensemble of these rankers for final prediction. We verify the proposed multipleranker approach over real-world datasets. The experimental results indicate that the proposed approach can outperform existing ‘learning to rank’ methods significantly.

This work is supported by National Science Foundation of China under the grant 60673009, Tianjin Science and Technology Research Foundation under the grant 05YFGZGX24000 and Microsoft Research Asia Foundation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Herbrich, R., Graepel, T., Obermayer, K.: Large Margin Rank Boundaries for Ordinal Regression. Advances in Large Margin Classifiers, pp. 115–132 (2000)

    Google Scholar 

  2. Crammer, K., Singer, Y.: PRanking with Ranking. Proceedings of NIPS 2001, Vancouver, British Columbia, Canada (2001)

    Google Scholar 

  3. Burges, C., Shaked, T., Renshaw, E., Lazier, A., Deeds, M., Hamilton, N., Hullender, G.: Learning to Rank Using Gradient Descent. In: Proceedings of ICML 2005, Bonn, Germany (2005)

    Google Scholar 

  4. Cao, Z., Qin, T., Liu, T.Y., Tsai, M.F., Li, H.: Learning to Rank: from Pairwise Approach to Listwise Approach. In: Proceedings of ICML 2007, Oregon, USA (2007)

    Google Scholar 

  5. Freund, Y., Iyer, R.D., Schapire, R.E., Singer, Y.: An Efficient Boosting Algorithm for Combining Preferences. Journal of Machine Learning Research 4, 933–969 (2003)

    Article  MathSciNet  Google Scholar 

  6. Xu, J., Li, H.: AdaRank: a Boosting Algorithm for Information Retrieval. In: Proceedings of SIGIR 2007, Amsterdam, The Netherlands (2007)

    Google Scholar 

  7. Kullback, S.: Information Theory and Statistics, New York, Dover (1968)

    Google Scholar 

  8. Hersh, W.R., Buckley, C., Leone, T.J., Hickam, D.H.: OHSUMED: An Interactive Retrieval Evaluation and New Large Test Collection for Research. In: Proceedings of SIGIR 1994, Dublin, Ireland (1994)

    Google Scholar 

  9. Breiman, L.: Bagging predictors. Machine Learning 24, 123–140 (1996)

    MATH  MathSciNet  Google Scholar 

  10. MacQueen, J.B.: Some Methods for Classification and Analysis of Multivariate Observations. In: Proceedings of 5-th Berkeley Symposium on Mathematical Statistics and Probability. University of California Press, Berkeley (1967)

    Google Scholar 

  11. Johnson, S.C.: Hierarchical Clustering Schemes. Psychometrika 2, 241–254 (1967)

    Article  Google Scholar 

  12. Liu, T.Y., Qin, T., Xu, J., Xiong, W.Y., Li, H.: LETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval. In: Proceedings of LR4IR 2007, in conjunction with SIGIR 2007, Amsterdam, Netherlands (2007)

    Google Scholar 

  13. Craswell, N., Hawking, D.:Overview of the TREC-2004 Web Track. In: TREC (2004)

    Google Scholar 

  14. Baeza-Yates, R.A., Ribeiro-Neto, B.: Modern Information Retrieval, Addison-Wesley Longman Publishing Co., Inc., Boston, MA (1999)

    Google Scholar 

  15. Jarvelin, K., Kekalainen, J.: Cumulated Gain-based Evaluation of IR Techniques. ACM Transactions on Information Systems 20(4), 422–446 (2002)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

De-Shuang Huang Donald C. Wunsch II Daniel S. Levine Kang-Hyun Jo

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Li, D., Xie, M., Wang, Y., Huang, Y., Ni, W. (2008). Multiple Ranker Method in Document Retrieval. In: Huang, DS., Wunsch, D.C., Levine, D.S., Jo, KH. (eds) Advanced Intelligent Computing Theories and Applications. With Aspects of Contemporary Intelligent Computing Techniques. ICIC 2008. Communications in Computer and Information Science, vol 15. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85930-7_52

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-85930-7_52

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-85929-1

  • Online ISBN: 978-3-540-85930-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics