Cost-Sensitive Supported Vector Learning to Rank Imbalanced Data Set

Chang, Xiao; Zheng, Qinghua; Lin, Peng

doi:10.1007/978-3-642-04020-7_33

Xiao Chang^24,25,
Qinghua Zheng^24,25 &
Peng Lin^24,25

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5755))

Included in the following conference series:

International Conference on Intelligent Computing

1532 Accesses

Abstract

In recent years, the algorithms of learning to rank have been proposed by researchers. Most of these algorithms are pairwise approach. In many real world applications, instances of ranks are imbalanced. After the instances of ranks are composed to pairs, the pairs of ranks are imbalanced too. In this paper, a cost-sensitive risk minimum model of pairwise learning to rank imbalance data sets is proposed. Following this model, the algorithm of cost-sensitive supported vector learning to rank is investigated. In experiment, the convention Ranking SVM is used as baseline. The document retrieval data set is used in experiment. The experimental results show that the performance of cost-sensitive supported vector learning to rank is better than Ranking SVM on the document retrieval data set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Freund, Y., Iyer, R., Schapire, R.E., Singer, Y.: An Efficient Boosting Algorithm for Combining Preferences. In: 15th International Conference on Machine Learning, pp. 170–178 (1998)
Google Scholar
Herbrich, R., Graepel, T., Obermayer, K.: Support Vector Learning for Ordinal Regression. In: Nineth Ann. Conf. Artificial Neural Networks (ICANN 1999), pp. 97–102 (1999)
Google Scholar
Crammer, K., Singer, Y.: Pranking with Ranking. In: Fourteenth Ann. Conf. Neural Information Processing Systems, NIPS 2001 (2001)
Google Scholar
Shashua, A., Levin, A.: Ranking with Large Margin Principle: Two Approaches. In: 16th Ann. Conf. Neural Information Processing Systems (NIPS 2003), pp. 961–968 (2003)
Google Scholar
Burges, C., Shaked, T., Renshaw, E., Lazier, A., Deeds, M., Hamilton, N., Hullender, G.: Learning to Rank Using Gradient Descent. In: The 22nd International Conference on Machine Learning, pp. 89–96 (2005)
Google Scholar
Chu, W., Ghahramani, Z.: Preference Learning with Gaussian Processes. In: 22nd International Conference on Machine Learning, pp. 137–144 (2005)
Google Scholar
Chu, W., Ghahramani, Z.: Gaussian Processes for Ordinal Regression. Journal of Machine Learning Research 6, 23 (2005)
MathSciNet Google Scholar
Lin, H.-T., Li, L.: Large-margin thresholded ensembles for ordinal regression: Theory and practice. In: Balcázar, J.L., Long, P.M., Stephan, F. (eds.) ALT 2006. LNCS, vol. 4264, pp. 319–333. Springer, Heidelberg (2006)
Chapter Google Scholar
Tsai, M.-F., Liu, T.-Y., Qin, T., Chen, H.-H., Ma, W.-Y.: FRank: A Ranking Method with Fidelity Loss. In: The 30th Annual International ACM SIGIR Conference (2007)
Google Scholar
Pahikkala, T., Tsivtsivadze, E., Airola, A., Boberg, J., Salakoski, T.: Learning to rank with pairwise regularized least-squares. In: The 30th International Conference on Research and Development in Information Retrieval -Workshop on Learning to Rank for Information Retrieval, pp. 27–33 (2007)
Google Scholar
Cao, Y., Xu, J., Liu, T.Y., Li, H., Huang, Y., Hon, H.-W.: Adapting Ranking SVM to Document Retrieval. In: 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 186–193 (2006)
Google Scholar
Joachims, T.: Optimizing Search Engines Using Clickthrough Data. In: The ACM Conference on Knowledge Discovery and Data Mining, pp. 133–142 (2002)
Google Scholar
Hersh, W., Buckley, C., Leone, T.J., Hickam, D.: OHSUMED: An Interactive Retrieval Evaluation and New Large Test Collection for Research. In: Seventeenth Ann. ACM-SIGIR Conf. Research and Development in Information Retrieval (SIGIR 1994), pp. 192–201|358 (1994)
Google Scholar
Liu, T.Y., Xu, J., Qin, T., Xiong, W., Li, H.: Letor: Benchmark Dataset for Research on Learning to Rank for Information Retrieval. In: SIGIR 2007 Workshop on Learning to Rank for Information Retrieval (2007)
Google Scholar
Kekalainen, J.: Binary and Graded Relevance in IR Evaluations - Comparison of the Effects on Ranking of IR Systems. Information Processing & Management 41, 1019–1033 (2005)
Article Google Scholar
Raskutti, B., Kowalczyk, A.: Extreme re-balancing for SVMs: a Case Study. ACM SIGKDD Explorations Newsletter 6, 60–69 (2004)
Article Google Scholar
Tao, Q., Wu, G.W., Wang, F.Y., Wang, J.: Posterior Probability Support Vector Machines for Unbalanced Data. IEEE Transactions on Neural Networks 16, 1561–1573 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dept. Computer Science and Engineering, Xi’an Jiaotong University,
Xiao Chang, Qinghua Zheng & Peng Lin
Shaanxi Key Lab. of Satellite and Computer Network, No.28 Xianning West Road, Xi’an, Shaanxi, 710049, P.R. China
Xiao Chang, Qinghua Zheng & Peng Lin

Authors

Xiao Chang
View author publications
You can also search for this author in PubMed Google Scholar
Qinghua Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Peng Lin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Intelligent Machines, Chinese Academy of Sciences, China
De-Shuang Huang
Graduate School of Electrical Engineering, University of Ulsan, Korea, San 29, Mugeo-Dong, Nam-Ku, 680 - 749, Ulsan, Korea
Kang-Hyun Jo
School of Electrical Engineering, University of Ulsan, Ulsan, South Korea
Hong-Hee Lee
School of Electrical Engineering, University of Ulsan, South Korea
Hee-Jun Kang
e.B.I.S. s.r.l. (electronic Business in Security), Spin-Off of Polytechnic of Bari, Str. Prov. per Casamassima Km., 3, 70010, Valenzano, (BA), Italy
Vitoantonio Bevilacqua

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chang, X., Zheng, Q., Lin, P. (2009). Cost-Sensitive Supported Vector Learning to Rank Imbalanced Data Set. In: Huang, DS., Jo, KH., Lee, HH., Kang, HJ., Bevilacqua, V. (eds) Emerging Intelligent Computing Technology and Applications. With Aspects of Artificial Intelligence. ICIC 2009. Lecture Notes in Computer Science(), vol 5755. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04020-7_33

Download citation

DOI: https://doi.org/10.1007/978-3-642-04020-7_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04019-1
Online ISBN: 978-3-642-04020-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics