research-article

Learning to rank with a novel kernel perceptron method

Authors:
Xue-wen Chen

University of Kansas, Lawrence, KS, USA

University of Kansas, Lawrence, KS, USA
View Profile

,
Haixun Wang

Microsoft Research Asia, Beijing, China

Microsoft Research Asia, Beijing, China
View Profile

,
Xiaotong Lin

University of Kansas, Lawrence, KS, USA

University of Kansas, Lawrence, KS, USA
View Profile

CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge managementNovember 2009Pages 505–512https://doi.org/10.1145/1645953.1646018

Published:02 November 2009Publication History

CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management

Pages 505–512

ABSTRACT

While conventional ranking algorithms, such as the PageRank, rely on the web structure to decide the relevancy of a web page, learning to rank seeks a function capable of ordering a set of instances using a supervised learning approach. Learning to rank has gained increasing popularity in information retrieval and machine learning communities. In this paper, we propose a novel nonlinear perceptron method for rank learning. The proposed method is an online algorithm and simple to implement. It introduces a kernel function to map the original feature space into a nonlinear space and employs a perceptron method to minimize the ranking error by avoiding converging to a solution near the decision boundary and alleviating the effect of outliers in the training dataset. Furthermore, unlike existing approaches such as RankSVM and RankBoost, the proposed method is scalable to large datasets for online learning. Experimental results on benchmark corpora show that our approach is more efficient and achieves higher or comparable accuracies in instance ranking than state of the art methods such as FRank, RankSVM and RankBoost.

References

Page, L., Brin, S., Motwani, R., and Winograd, T. 1998. The pagerank citation ranking: Bring order to the web. Technical report, Stanford University.Google Scholar
Burges, C. 2005. Ranking as Learning Structured Outputs. Proceedings of the NIPS 2005 Workshop on Learning to Rank, 7--11.Google Scholar
Brinker, K. and Hullermeier, E. 2005. Calibrated Label-Ranking. Proceedings of the NIPS 2005 Workshop on Learning to Rank, 1--6.Google Scholar
Grangier, D. and Bengio, S. 2005. Exploiting Hyperlinks to Learn a Retrieval Model. Proceedings of the NIPS 2005 Workshop on Learning to Rank, 12--17.Google Scholar
Panikkaia, T., Tsivtsiadze, E., Airola, A., Boberg, J., and Salakoski, T. 2007. Learning to Rank with Pairwise Regularized Least--Squares. In Proceedings of SIGIR 2007 Workshop on Learning to Rank for Information Retrieval.Google Scholar
Cao, G., Nie, J., Si, L., and Bai, J. 2007. Learning to Rank Documents for Ad-Retrieval with Regularized Models. In Proceedings of SIGIR 2007 Workshop on Learning to Rank for Information Retrieval.Google Scholar
Yeh, J., Lin, J., Ke, H., and Yang, W. 2007. Learning to Rank for Information Retrieval using genetic Programming. In Proceedings of SIGIR 2007 Workshop on Learning to Rank for Information Retrieval.Google Scholar
Veloso, A., Almeida, H., Goncalves, M., and Meira Jr., W. 2008. Learning to Rank at Query-time Using Association Rules. Proceedings of the 31th Annual Internaitonal ACM SIGIR Conference on Research and Development in Information Retrieval, 267--274. Google ScholarDigital Library
Cao, Z., Qin, T., Liu, T., Tsai, M., and Li, H., 2007. Learning to Rank: from Pairwise Approach to Listwise Approach. Proceedings of the 24th International Conference on Machine Learning, 129--136. Google ScholarDigital Library
Thorsten Joachims, 2002. Optimizing search engines using clickthrough data. In Proceedings of the Eighth SIGKDD, 133--142. Google ScholarDigital Library
Yoav Freund, Raj Iyer, Robert E Schapire, and Yoram Singer, 2003. An efficient boosting algorithm for combining preferences. In J. Mach. Learn. Res., 4:933--969. Google ScholarDigital Library
Raul Rojas, 1996. Neural Networks: A Systematic Introduction. Springer. Google ScholarDigital Library
Koby Crammer, Yoram Singer, 2001. Pranking for ranking. In Advances in Neural Information Processing Systems 14.Google Scholar
Chris Burges, Tal Shaked, Erin Renshaw, Matt Deeds, Nicole Hamilton, and Greg Hullender, 2005. Learning to rank using gradient descent. In ICML, pages 89--96. Google ScholarDigital Library
T. Graepel, R. Herbrich, and R.C. Williamson, 2001. From Margin to Sparsity. In Advances in Neural Information Processing Systems, 210--216.Google Scholar
Scholkopf, B. and Smola, A. 2002. Learning with Kernels. MIT Press, Cambridge, MA.Google Scholar
Cristianini, N. and Shawe-Taylor, J., 2000. An Introduction to support Vector machines. Cambridge University Press, Cambridge, UK. Google ScholarDigital Library
Liu, T., Qin, T., Xu, J., Xiong, W., and Li, H. 2007. LETOR: Benchmark dataset for research on learning to rank for information retrieval. LR3IR 2007, in conjunction with SIGIR 2007. Google ScholarDigital Library
Baeza-Yates, R. and Ribeiro-Neto, B. 1999. Modern information retrieval. Addison Wesley, 1999. Google ScholarDigital Library
Jarvelin, K. and Kekalainen, J. 2000. IR evaluation methods for retrieving highly relevant documents. Proceedings of SIGIR, 41--48. Google ScholarDigital Library
Jarvelin, K. and Kekalainen, 2002. Cumulated gain-based evaluation of IR techniques. ACM Trans. on Information Systems, 20(4), 422--446. Google ScholarDigital Library

Index Terms

Learning to rank with a novel kernel perceptron method
1. Information systems
  1. Information retrieval
    1. Evaluation of retrieval results
      1. Relevance assessment

Recommendations

Quality-biased ranking for queries with commercial intent
WWW '13 Companion: Proceedings of the 22nd International Conference on World Wide Web

Modern search engines are good enough to answer popular commercial queries with mainly highly relevant documents. However, our experiments show that users behavior on such relevant commercial sites may differ from one to another web-site with the same ...
Read More
Incremental learning to rank with partially-labeled data
WSCD '09: Proceedings of the 2009 workshop on Web Search Click Data

In this paper we present a semi-supervised learning method for a problem of learning to rank where we exploit Markov random walks and graph regularization in order to incorporate not only "labeled" web pages but also plenty of "un-labeled" web pages (...
Read More
Learning to rank code examples for code search engines

Source code examples are used by developers to implement unfamiliar tasks by learning from existing solutions. To better support developers in finding existing solutions, code search engines are designed to locate and rank code examples relevant to user'...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management
November 2009
2162 pages
ISBN:9781605585123
DOI:10.1145/1645953
General Chairs:
David Cheung
University of Hong Kong, Hong Kong
,
Il-Yeol Song
Drexel University, USA
,
Program Chairs:
Wesley Chu
UCLA, USA
,
Xiaohua Hu
Drexel University, USA
,
Jimmy Lin
University of Maryland, USA
Copyright © 2009 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 2 November 2009
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
learning to rank
perceptron
web search
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate1,861of8,427submissions,22%
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 451
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Learning to rank with a novel kernel perceptron method

CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management

ABSTRACT

References

Cited By

Index Terms

Recommendations

Quality-biased ranking for queries with commercial intent

Incremental learning to rank with partially-labeled data

Learning to rank code examples for code search engines

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Learning to rank with a novel kernel perceptron method

CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management

ABSTRACT

References

Cited By

Index Terms

Recommendations

Quality-biased ranking for queries with commercial intent

Incremental learning to rank with partially-labeled data

Learning to rank code examples for code search engines

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media