Skip to main content

Semi-Supervised Graph-Ranking for Text Retrieval

  • Conference paper
Information Retrieval Technology (AIRS 2008)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4993))

Included in the following conference series:

Abstract

Much work has been done on supervised ranking for information retrieval, where the goal is to rank all searched documents in a known repository with many labeled query-document pairs. Unfortunately, the labeled pairs are lack because human labeling is often expensive, difficult and time consuming. To address this issue, we employ graph to represent pairwise relationships among the labeled and unlabeled documents, in order that the ranking score can be propagated to their neighbors. Our main contribution in this paper is to propose a semi-supervised ranking method based on graph-ranking and different weighting schemas. Experimental results show that our method called SSG-Rank on 20-newsgroups dataset outperforms supervised ranking (Ranking SVM and PRank) and unsupervised graph ranking significantly.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agarwal, S.: Ranking on Graph Data. In: The proceedings of International Conference of Machine Learning 2006, pp. 25–32 (2006)

    Google Scholar 

  2. Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of Annual Conference on Computational Learning Theory, pp. 92–100 (1998)

    Google Scholar 

  3. Brin, S., Page, L.: The Anatomy of a Large Scale Hypertextual Web Search Engine. In: Proceedings of 7th International World Wide Web Conference, pp. 107–117 (1998)

    Google Scholar 

  4. Cao, Y., Xu, J., Liu, T.Y., Li, H., Huang, Y.L., Hon, H.W., Adapting Ranking, S.V.M.: to Document Retrieval. In: Proceedings of ACM SIGIR, vol. 29, pp. 186–193 (2006)

    Google Scholar 

  5. Crammer, K., Singer, Y.: PRanking with ranking. Advances in Neural Information Processing Systems, Canada (2002)

    Google Scholar 

  6. Herbrich, R., Graepel, T., Obermayer, K.: Large Margin Rank Boundaries for Ordinal Regression, Advances in Large Margin Classifiers, pp. 115–132. MIT Press, Cambridge (2000)

    Google Scholar 

  7. Joachims, T.: Transductive inference for text classification using support vector machine. In: Proceedings of 16th International Conference of Machine Learning, pp. 200–209 (1999)

    Google Scholar 

  8. Kleinberg, J.: Authoritative sources in a hyperlinked environment. In: Proceedings of the 9th ACM-SIAM Symposium on Discrete Algorithms, New Orleans, pp. 668–677 (1997)

    Google Scholar 

  9. Liu, T., Xu, J., Qin, T., Xiong, W., Li, H.: LETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval. In: SIGIR 2007 Workshop on Learning to Rank for Information Retrieval (2007)

    Google Scholar 

  10. Robertson, S., Hull, D.: The TREC-9 filtering track final report. In: TREC, pp. 25–40 (2000)

    Google Scholar 

  11. Wan, X., Yang, J., Xiao, J.: Document Similarity Search Based on Manifold- Ranking of TextTiles. In: The 3rd Asia Information Retrieval Symposium, Singapore, pp. 14–25 (2006)

    Google Scholar 

  12. Wang, F., Zhang, C.: Label Propagation Through Linear Neighborhoods. In: Proceedings of 23rd International Conference of Machine Learning, pp. 985–992 (2006)

    Google Scholar 

  13. Xu, J., Cao, Y., Li, H., Huang, Y.: Cost-Sensitive Learning of SVM for Ranking. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 833–840. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  14. Xu, J., Li, H.: AdaRank: A Boosting Algorithm for Information Retrieval. In: The proceedings of SIGIR 2007, pp. 391–398 (2007)

    Google Scholar 

  15. Zhou, D.Y., Bousquet, O., Lal, T.N., Weston, J., Schölkopf, B.: Learning with Local and Global Consistency. In: Advances in Neural Information Processing Systems 16, pp. 321–328 (2004)

    Google Scholar 

  16. Zhou, D.Y., Weston, J., Gretton, A., et al.: Ranking on Data Manifolds. In: Advances in Neural Information Processing System 16 (2003)

    Google Scholar 

  17. Zhou, Z.H., Li, M.: Semi-supervised regression with co-training. In: Proceedings of International Joint Conference on Artificial Intelligence 2005 (2005)

    Google Scholar 

  18. Zhu, X.J.: Semi-Supervised Learning Literature Survey, Computer Sciences Technical Report 1530, University of Wisconsin-Madison (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Hang Li Ting Liu Wei-Ying Ma Tetsuya Sakai Kam-Fai Wong Guodong Zhou

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Xie, M., Liu, J., Zheng, N., Li, D., Huang, Y., Wang, Y. (2008). Semi-Supervised Graph-Ranking for Text Retrieval. In: Li, H., Liu, T., Ma, WY., Sakai, T., Wong, KF., Zhou, G. (eds) Information Retrieval Technology. AIRS 2008. Lecture Notes in Computer Science, vol 4993. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68636-1_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-68636-1_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68633-0

  • Online ISBN: 978-3-540-68636-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics