research-article

On Application of Learning to Rank for E-Commerce Search

Authors:

Shubhra Kanti Karmaker Santu,

Parikshit Sondhi,

ChengXiang ZhaiAuthors Info & Claims

SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 475 - 484

https://doi.org/10.1145/3077136.3080838

Published: 07 August 2017 Publication History

Abstract

E-Commerce (E-Com) search is an emerging important new application of information retrieval. Learning to Rank (LETOR) is a general effective strategy for optimizing search engines, and is thus also a key technology for E-Com search. While the use of LETOR for web search has been well studied, its use for E-Com search has not yet been well explored. In this paper, we discuss the practical challenges in applying learning to rank methods to E-Com search, including the challenges in feature representation, obtaining reliable relevance judgments, and optimally exploiting multiple user feedback signals such as click rates, add-to-cart ratios, order rates, and revenue. We study these new challenges using experiments on industry data sets and report several interesting findings that can provide guidance on how to optimally apply LETOR to E-Com search: First, popularity-based features defined solely on product items are very useful and LETOR methods were able to effectively optimize their combination with relevance-based features. Second, query attribute sparsity raises challenges for LETOR, and selecting features to reduce/avoid sparsity is beneficial. Third, while crowdsourcing is often useful for obtaining relevance judgments for Web search, it does not work as well for E-Com search due to difficulty in eliciting sufficiently fine grained relevance judgments. Finally, among the multiple feedback signals, the order rate is found to be the most robust training objective, followed by click rate, while add-to-cart ratio seems least robust, suggesting that an effective practical strategy may be to initially use click rates for training and gradually shift to using order rates as they become available.

References

[1]

Omar Alonso and Stefano Mizzaro. 2009. Relevance criteria for e-commerce: a crowdsourcing-based experimental analysis. In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval. ACM, 760--761.

Digital Library

[2]

Omar Alonso, Daniel E. Rose, and Benjamin Stewart. 2008. Crowdsourcing for relevance evaluation. In ACM SigIR Forum, Vol. 42. ACM, 9--15.

Digital Library

[3]

Leif Azzopardi and Guido Zuccon. 2016. Advances in Formal Models of Search and Search Behaviour. In Proceedings of the 2016 ACM on International Conference on the Theory of Information Retrieval. ACM, 1--4.

Digital Library

[4]

Leo Breiman. 2001. Random forests. Machine learning 45, 1 (2001), 5--32.

Digital Library

[5]

Chris Burges, Tal Shaked, Erin Renshaw, Ari Lazier, Matt Deeds, Nicole Hamilton, and Greg Hullender. 2005. Learning to rank using gradient descent. In Proceedings of the 22nd international conference on Machine learning. ACM, 89--96.

Digital Library

[6]

Christopher J. C. Burges. 2010. From ranknet to lambdarank to lambdamart: An overview. Learning 11 (2010), 23--581.

[7]

Olivier Chapelle and Yi Chang. 2011. Yahoo! Learning to Rank Challenge Overview. In Yahoo! Learning to Rank Challenge. 1--24.

[8]

Olivier Chapelle, Yi Chang, and Tie-Yan Liu. 2011. Future directions in learning to rank. In Yahoo! Learning to Rank Challenge. 91--100.

[9]

Xiangru Chen, Haofen Wang, Xinruo Sun, Junfeng Pan, and Yong Yu. 2011. Diversifying Product Search Results. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '11). ACM, New York, NY, USA, 1093--1094.

Digital Library

[10]

Huizhong Duan, ChengXiang Zhai, Jinxing Cheng, and Abhishek Gattani. 2013. Supporting Keyword Search in Product Database: A Probabilistic Approach. Proc. VLDB Endow. 6, 14 (Sept. 2013), 1786--1797.

Digital Library

[11]

Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Xiang-Rui Wang, and Chih-Jen Lin. 2008. LIBLINEAR: A library for large linear classification. The Journal of Machine Learning Research 9 (2008), 1871--1874.

Digital Library

[12]

Yoav Freund, Raj Iyer, Robert E Schapire, and Yoram Singer. 2003. An efficient boosting algorithm for combining preferences. The Journal of machine learning research 4 (2003), 933--969.

Digital Library

[13]

Wolfgang Hardle. 2012. Smoothing techniques: with implementation in S. Springer Science & Business Media.

[14]

Kalervo Järvelin and Jaana Kekäläinen. 2002. Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems (TOIS) 20, 4 (2002), 422--446.

Digital Library

[15]

John Le, Andy Edmonds, Vaughn Hester, and Lukas Biewald. 2010. Ensuring quality in crowdsourced search relevance evaluation: The effects of training question distribution. In SIGIR 2010 workshop on crowdsourcing for search evaluation. 21--26.

[16]

Su-In Lee, Honglak Lee, Pieter Abbeel, and Andrew Y. Ng. 2006. Efficient l1 regularized logistic regression. In Proceedings of the National Conference on Artificial Intelligence, Vol. 21. 401.

[17]

Beibei Li, Anindya Ghose, and Panagiotis G. Ipeirotis. 2011. Towards a Theory Model for Product Search. In Proceedings of the 20th International Conference on World Wide Web (WWW '11). ACM, New York, NY, USA, 327--336.

Digital Library

[18]

Hang Li. 2014. Learning to rank for information retrieval and natural language processing. Synthesis Lectures on Human Language Technologies 7, 3 (2014), 1--121.

[19]

Chih-Jen Lin, Ruby C. Weng, and S. Sathiya Keerthi. 2008. Trust region newton method for logistic regression. The Journal of Machine Learning Research 9 (2008), 627--650.

Digital Library

[20]

Tie-Yan Liu. 2009. Learning to rank for information retrieval. Foundations and Trends in Information Retrieval 3, 3 (2009), 225--331.

Digital Library

[21]

Bo Long, Jiang Bian, Anlei Dong, and Yi Chang. 2012. Enhancing Product Search by Best-selling Prediction in e-Commerce. In CIKM '12 (CIKM '12). ACM, New York, NY, USA, 2479--2482.

Digital Library

[22]

Craig Macdonald, Rodrygo L. T. Santos, and Iadh Ounis. 2012. On the usefulness of query features for learning to rank. In Proceedings of the 21st ACM international conference on Information and knowledge management. ACM, 2559--2562.

Digital Library

[23]

Craig Macdonald, Rodrygo L. T. Santos, and Iadh Ounis. 2013. The whens and hows of learning to rank for web search. Information Retrieval 16, 5 (2013), 584--628.

Digital Library

[24]

José R Pérez-Agüera, Javier Arroyo, Jane Greenberg, Joaquin Perez Iglesias, and Victor Fresno. 2010. Using BM25F for semantic search. In Proceedings of the 3rd international semantic search workshop. ACM, 2.

Digital Library

[25]

Tao Qin and Tie-Yan Liu. 2013. Introducing LETOR 4.0 Datasets. CoRR abs/1306.2597 (2013). http://arxiv.org/abs/1306.2597

[26]

Stephen E. Robertson, Steve Walker, Susan Jones, Micheline M. Hancock-Beaulieu, Mike Gatford, and others. 1995. Okapi at TREC-3. NIST SPECIAL PUBLICATION SP 109 (1995), 109.

[27]

Alex Smola and Vladimir Vapnik. 1997. Support vector regression machines. Advances in neural information processing systems 9 (1997), 155--161.

[28]

Johan A. K. Suykens and Joos Vandewalle. 1999. Least squares support vector machine classifiers. Neural processing letters 9, 3 (1999), 293--300.

Digital Library

[29]

Liang Tang, Bo Long, Bee-Chung Chen, and Deepak Agarwal. 2016. An Empirical Study on Recommendation with Multiple Types of Feedback. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 283--292.

Digital Library

[30]

Niek Tax, Sander Bockting, and Djoerd Hiemstra. 2015. A cross-benchmark comparison of 87 learning to rank methods. Information processing & management 51, 6 (2015), 757--772.

Digital Library

[31]

Christophe Van Gysel, Maarten de Rijke, and Evangelos Kanoulas. 2016. Learning Latent Vector Spaces for Product Search. In Proceedings of the 25th ACM CIKM '16. ACM, New York, NY, USA, 165--174.

Digital Library

[32]

Damir Vandic, Flavius Frasincar, and Uzay Kaymak. 2013. Facet Selection Algorithms for Web Product Search. In Proceedings of the 22Nd ACM International Conference on Information & Knowledge Management (CIKM '13). ACM, New York, NY, USA, 2327--2332.

Digital Library

[33]

Frank Wilcoxon. 1945. Individual comparisons by ranking methods. Biometrics bulletin 1, 6 (1945), 80--83.

[34]

Jun Xu and Hang Li. 2007. Adarank: a boosting algorithm for information retrieval. In ACM SIGIR. ACM, 391--398.

Digital Library

[35]

Emine Yilmaz and Stephen Robertson. 2010. On the Choice of Effectiveness Measures for Learning to Rank. Inf. Retr. 13, 3 (June 2010), 271--290.

Digital Library

[36]

Jun Yu, Sunil Mohan, Duangmanee (Pew) Putthividhya, and Weng-Keen Wong. 2014. Latent Dirichlet Allocation Based Diversified Retrieval for e-Commerce Search. In WSDM '14. New York, NY, USA, 463--472.

Digital Library

Cited By

Zhu QZhang HHe QDou Z(2024)Query-Aware Explainable Product Search With Reinforcement Knowledge Graph ReasoningIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.329733136:3(1260-1273)Online publication date: Mar-2024
https://doi.org/10.1109/TKDE.2023.3297331
Mohammadpur DSaghafi M(2024)TechnoSearch: Improving e-Commerce Searches Using Product Category and Brand Based Ranking2024 International Conference on Electrical, Communication and Computer Engineering (ICECCE)10.1109/ICECCE63537.2024.10823605(1-6)Online publication date: 30-Oct-2024
https://doi.org/10.1109/ICECCE63537.2024.10823605
Jha RSubramaniyam SBenjamin ETaula T(2024)Unified Embedding Based Personalized Retrieval in Etsy Search2024 IEEE International Conference on Future Machine Learning and Data Science (FMLDS)10.1109/FMLDS63805.2024.00055(258-264)Online publication date: 20-Nov-2024
https://doi.org/10.1109/FMLDS63805.2024.00055
Show More Cited By

Index Terms

On Application of Learning to Rank for E-Commerce Search

Recommendations

Learning to rank code examples for code search engines

Source code examples are used by developers to implement unfamiliar tasks by learning from existing solutions. To better support developers in finding existing solutions, code search engines are designed to locate and rank code examples relevant to user'...
Debiasing Grid-based Product Search in E-commerce
KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

The widespread usage of e-commerce websites in daily life and the resulting wealth of implicit feedback data form the foundation for systems that train and test e-commerce search ranking algorithms. While convenient to collect, implicit feedback data ...
Learning to rank by optimizing expected reciprocal rank
AIRS'11: Proceedings of the 7th Asia conference on Information Retrieval Technology

Learning to rank is one of the most hot research areas in information retrieval, among which listwise approach is an important research direction and the methods that directly optimizing evaluation metrics in listwise approach have been used for ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval

August 2017

1476 pages

ISBN:9781450350228

DOI:10.1145/3077136

General Chairs:
Noriko Kando
National Institute of Informatics
,
Tetsuya Sakai
Waseda University
,
Hideo Joho
University of Tsukuba
,
Program Chairs:
Hang Li
Huawei Noah's Ark Lab
,
Arjen P. de Vries
Radboud University
,
Ryen W. White
Microsoft Cortana

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 August 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

WalmartLabs

Conference

SIGIR '17

Sponsor:

SIGIR

SIGIR '17: The 40th International ACM SIGIR conference on research and development in Information Retrieval

August 7 - 11, 2017

Tokyo, Shinjuku, Japan

Acceptance Rates

SIGIR '17 Paper Acceptance Rate 78 of 362 submissions, 22%;

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

77
Total Citations
View Citations
1,272
Total Downloads

Downloads (Last 12 months)76
Downloads (Last 6 weeks)20

Reflects downloads up to 02 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Zhu QZhang HHe QDou Z(2024)Query-Aware Explainable Product Search With Reinforcement Knowledge Graph ReasoningIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.329733136:3(1260-1273)Online publication date: Mar-2024
https://doi.org/10.1109/TKDE.2023.3297331
Mohammadpur DSaghafi M(2024)TechnoSearch: Improving e-Commerce Searches Using Product Category and Brand Based Ranking2024 International Conference on Electrical, Communication and Computer Engineering (ICECCE)10.1109/ICECCE63537.2024.10823605(1-6)Online publication date: 30-Oct-2024
https://doi.org/10.1109/ICECCE63537.2024.10823605
Jha RSubramaniyam SBenjamin ETaula T(2024)Unified Embedding Based Personalized Retrieval in Etsy Search2024 IEEE International Conference on Future Machine Learning and Data Science (FMLDS)10.1109/FMLDS63805.2024.00055(258-264)Online publication date: 20-Nov-2024
https://doi.org/10.1109/FMLDS63805.2024.00055
Miyashita TShoji YFujita SDürst M(2024)BERT-Based Movie Keyword Search Leveraging User-Generated Movie Rankings and Reviews2024 IEEE International Conference on Big Data and Smart Computing (BigComp)10.1109/BigComp60711.2024.00046(246-256)Online publication date: 18-Feb-2024
https://doi.org/10.1109/BigComp60711.2024.00046
Grande RSánchez-Sobrino SVallejo DCastro-Schez JAlbusac J(2024)Narrowing the Technological Gap by Promoting Small Commerce Through VR and AI for a Lifelike E-Commerce Experience: Needs and SolutionsEnterprise Information Systems10.1007/978-3-031-64755-0_5(92-112)Online publication date: 26-Jul-2024
https://doi.org/10.1007/978-3-031-64755-0_5
Sagtani HJeunen OUstimenko A(2024)Learning-to-Rank with Nested FeedbackAdvances in Information Retrieval10.1007/978-3-031-56063-7_22(306-315)Online publication date: 24-Mar-2024
https://dl.acm.org/doi/10.1007/978-3-031-56063-7_22
Challa NSathwik AKiran JLokesh KDeepthi Ch VNaseeba B(2023)Smart Fashion Recommendation System using FashionNetICST Transactions on Scalable Information Systems10.4108/eetsis.4278Online publication date: 30-Oct-2023
https://doi.org/10.4108/eetsis.4278
Jeunen OSagtani HDoi HKarimov RPokharna NKalim DUstimenko AGreen CMehrotra RShi W(2023)On Gradient Boosted Decision Trees and Neural Rankers: A Case-Study on Short-Video Recommendations at ShareChatProceedings of the 15th Annual Meeting of the Forum for Information Retrieval Evaluation10.1145/3632754.3632940(136-141)Online publication date: 15-Dec-2023
https://dl.acm.org/doi/10.1145/3632754.3632940
Kataria AVenkateshprasanna HKummetha A(2023)Learning to Rank for Search Results Re-ranking in Learning Experience PlatformsProceedings of the 16th Annual ACM India Compute Conference10.1145/3627217.3627224(25-30)Online publication date: 9-Dec-2023
https://dl.acm.org/doi/10.1145/3627217.3627224
Paparella VAnelli VNardini FPerego RDi Noia TFrommholz IHopfgartner FLee MOakes MLalmas MZhang MSantos R(2023)Post-hoc Selection of Pareto-Optimal Solutions in Search and RecommendationProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615010(2013-2023)Online publication date: 21-Oct-2023
https://dl.acm.org/doi/10.1145/3583780.3615010
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten