Abstract
Many different ranking algorithms based on content and context have been used in web search engines to find pages based on a user query. Furthermore, to achieve better performance some new solutions combine different algorithms. In this paper we use simulated click-through data to learn how to combine many content and context features of web pages. This method is simple and practical to use with actual click-through data in a live search engine. The proposed approach is evaluated using the LETOR benchmark and we found it is competitive to Ranking SVM based on user judgments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agichtein, E., Brill, E., Dumais, S.: Improving Web Search Ranking by Incorporating User Behavior Information. In: ACM SIGIR (2006)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley, Reading (1999)
Burges, C., Shaked, T., Renshaw, E., Lazier, A., Deeds, M., Hamilton, N., Hullender, G.: Learning to rank using gradient descent. In: ICML (2005)
Cao, Y.B., Xu, J., Liu, T.Y., Li, H., Huang, Y.L., Hon, H.W.: Adaptive ranking SVM to document retrieval. In: ACM SIGIR (2006)
Freund, Y., Iyer, R., Schapire, R.E., Singer, Y.: An efficient boosting algorithm for combining preferences. Journal of Machine Learning Research 4, 933–969 (2003)
Herbrich, R., Graepel, T., Obermayer, K.: Large margin rank boundaries for ordinal regression. Advances in Large Margin Classifiers, pp. 115–132. MIT Press, Cambridge (2000)
Jarvelin, K., Kekalainen, J.: Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems 20(4), 422–446 (2002)
Joachims, T.: Optimizing Search Engines Using Click through Data. In: ACM SIGKDD, pp. 132–142 (2002)
Liu, T.-Y., Xu, J., Qin, T., Xiong, W., Li, H.: LETOR: Benchmark dataset for research on learning to rank for information retrieval. In: SIGIR Workshop on Learning to Rank (2007)
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the Web, Technical report, Stanford University (1998)
Radlinski, F., Joachims, T.: Evaluating the robustness of learning from implicit feedback. In: ICML Workshop on Learning in Web Search (2005)
Renda, M.E., Straccia, U.: Web metasearch: rank vs. score based rank aggregation methods. In: Proceedings of the ACM symposium on Applied computing (2003)
Robertson, S.E.: Overview of the okapi projects. Journal of Documentation 53(1), 3–7 (1997)
Shakery, A., Zhai, C.X.: Relevance Propagation for Topic Distillation UIUC TREC 2003 Web Track Experiments. In: Proceedings of TREC (2003)
Xue, G.R., Zeng, H.J., Chen, Z., Yu, Y., Ma, W.Y., Xi, W.S., Fan, W.G.: Optimizing Web Search Using Web Click-through Data. In: ACM CIKM, pp. 118–126 (2004)
Yeh, J., Lin, J., Ke, H., Yang, W.: Learning to Rank for Information Retrieval Using Genetic Programming. In: SIGIR Workshop on Learning to Rank (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bidoki, A.M.Z., Thom, J.A. (2009). Combination of Documents Features Based on Simulated Click-through Data. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds) Advances in Information Retrieval. ECIR 2009. Lecture Notes in Computer Science, vol 5478. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00958-7_48
Download citation
DOI: https://doi.org/10.1007/978-3-642-00958-7_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00957-0
Online ISBN: 978-3-642-00958-7
eBook Packages: Computer ScienceComputer Science (R0)