Semi-Supervised Graph-Ranking for Text Retrieval

Xie, Maoqiang; Liu, Jinli; Zheng, Nan; Li, Dong; Huang, Yalou; Wang, Yang

doi:10.1007/978-3-540-68636-1_25

Maoqiang Xie¹,
Jinli Liu¹,
Nan Zheng¹,
Dong Li²,
Yalou Huang¹ &
…
Yang Wang¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4993))

Included in the following conference series:

Asia Information Retrieval Symposium

1436 Accesses
4 Citations

Abstract

Much work has been done on supervised ranking for information retrieval, where the goal is to rank all searched documents in a known repository with many labeled query-document pairs. Unfortunately, the labeled pairs are lack because human labeling is often expensive, difficult and time consuming. To address this issue, we employ graph to represent pairwise relationships among the labeled and unlabeled documents, in order that the ranking score can be propagated to their neighbors. Our main contribution in this paper is to propose a semi-supervised ranking method based on graph-ranking and different weighting schemas. Experimental results show that our method called SSG-Rank on 20-newsgroups dataset outperforms supervised ranking (Ranking SVM and PRank) and unsupervised graph ranking significantly.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agarwal, S.: Ranking on Graph Data. In: The proceedings of International Conference of Machine Learning 2006, pp. 25–32 (2006)
Google Scholar
Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of Annual Conference on Computational Learning Theory, pp. 92–100 (1998)
Google Scholar
Brin, S., Page, L.: The Anatomy of a Large Scale Hypertextual Web Search Engine. In: Proceedings of 7th International World Wide Web Conference, pp. 107–117 (1998)
Google Scholar
Cao, Y., Xu, J., Liu, T.Y., Li, H., Huang, Y.L., Hon, H.W., Adapting Ranking, S.V.M.: to Document Retrieval. In: Proceedings of ACM SIGIR, vol. 29, pp. 186–193 (2006)
Google Scholar
Crammer, K., Singer, Y.: PRanking with ranking. Advances in Neural Information Processing Systems, Canada (2002)
Google Scholar
Herbrich, R., Graepel, T., Obermayer, K.: Large Margin Rank Boundaries for Ordinal Regression, Advances in Large Margin Classifiers, pp. 115–132. MIT Press, Cambridge (2000)
Google Scholar
Joachims, T.: Transductive inference for text classification using support vector machine. In: Proceedings of 16th International Conference of Machine Learning, pp. 200–209 (1999)
Google Scholar
Kleinberg, J.: Authoritative sources in a hyperlinked environment. In: Proceedings of the 9th ACM-SIAM Symposium on Discrete Algorithms, New Orleans, pp. 668–677 (1997)
Google Scholar
Liu, T., Xu, J., Qin, T., Xiong, W., Li, H.: LETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval. In: SIGIR 2007 Workshop on Learning to Rank for Information Retrieval (2007)
Google Scholar
Robertson, S., Hull, D.: The TREC-9 filtering track final report. In: TREC, pp. 25–40 (2000)
Google Scholar
Wan, X., Yang, J., Xiao, J.: Document Similarity Search Based on Manifold- Ranking of TextTiles. In: The 3rd Asia Information Retrieval Symposium, Singapore, pp. 14–25 (2006)
Google Scholar
Wang, F., Zhang, C.: Label Propagation Through Linear Neighborhoods. In: Proceedings of 23rd International Conference of Machine Learning, pp. 985–992 (2006)
Google Scholar
Xu, J., Cao, Y., Li, H., Huang, Y.: Cost-Sensitive Learning of SVM for Ranking. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 833–840. Springer, Heidelberg (2006)
Chapter Google Scholar
Xu, J., Li, H.: AdaRank: A Boosting Algorithm for Information Retrieval. In: The proceedings of SIGIR 2007, pp. 391–398 (2007)
Google Scholar
Zhou, D.Y., Bousquet, O., Lal, T.N., Weston, J., Schölkopf, B.: Learning with Local and Global Consistency. In: Advances in Neural Information Processing Systems 16, pp. 321–328 (2004)
Google Scholar
Zhou, D.Y., Weston, J., Gretton, A., et al.: Ranking on Data Manifolds. In: Advances in Neural Information Processing System 16 (2003)
Google Scholar
Zhou, Z.H., Li, M.: Semi-supervised regression with co-training. In: Proceedings of International Joint Conference on Artificial Intelligence 2005 (2005)
Google Scholar
Zhu, X.J.: Semi-Supervised Learning Literature Survey, Computer Sciences Technical Report 1530, University of Wisconsin-Madison (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Software, Nankai University, Tianjin, China
Maoqiang Xie, Jinli Liu, Nan Zheng, Yalou Huang & Yang Wang
College of Information Technology and Science, Nankai University, Tianjin, China
Dong Li

Authors

Maoqiang Xie
View author publications
You can also search for this author in PubMed Google Scholar
Jinli Liu
View author publications
You can also search for this author in PubMed Google Scholar
Nan Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Dong Li
View author publications
You can also search for this author in PubMed Google Scholar
Yalou Huang
View author publications
You can also search for this author in PubMed Google Scholar
Yang Wang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Hang Li Ting Liu Wei-Ying Ma Tetsuya Sakai Kam-Fai Wong Guodong Zhou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xie, M., Liu, J., Zheng, N., Li, D., Huang, Y., Wang, Y. (2008). Semi-Supervised Graph-Ranking for Text Retrieval. In: Li, H., Liu, T., Ma, WY., Sakai, T., Wong, KF., Zhou, G. (eds) Information Retrieval Technology. AIRS 2008. Lecture Notes in Computer Science, vol 4993. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68636-1_25

Download citation

DOI: https://doi.org/10.1007/978-3-540-68636-1_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68633-0
Online ISBN: 978-3-540-68636-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics