skip to main content
10.1145/3077136.3080838acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

On Application of Learning to Rank for E-Commerce Search

Published: 07 August 2017 Publication History

Abstract

E-Commerce (E-Com) search is an emerging important new application of information retrieval. Learning to Rank (LETOR) is a general effective strategy for optimizing search engines, and is thus also a key technology for E-Com search. While the use of LETOR for web search has been well studied, its use for E-Com search has not yet been well explored. In this paper, we discuss the practical challenges in applying learning to rank methods to E-Com search, including the challenges in feature representation, obtaining reliable relevance judgments, and optimally exploiting multiple user feedback signals such as click rates, add-to-cart ratios, order rates, and revenue. We study these new challenges using experiments on industry data sets and report several interesting findings that can provide guidance on how to optimally apply LETOR to E-Com search: First, popularity-based features defined solely on product items are very useful and LETOR methods were able to effectively optimize their combination with relevance-based features. Second, query attribute sparsity raises challenges for LETOR, and selecting features to reduce/avoid sparsity is beneficial. Third, while crowdsourcing is often useful for obtaining relevance judgments for Web search, it does not work as well for E-Com search due to difficulty in eliciting sufficiently fine grained relevance judgments. Finally, among the multiple feedback signals, the order rate is found to be the most robust training objective, followed by click rate, while add-to-cart ratio seems least robust, suggesting that an effective practical strategy may be to initially use click rates for training and gradually shift to using order rates as they become available.

References

[1]
Omar Alonso and Stefano Mizzaro. 2009. Relevance criteria for e-commerce: a crowdsourcing-based experimental analysis. In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval. ACM, 760--761.
[2]
Omar Alonso, Daniel E. Rose, and Benjamin Stewart. 2008. Crowdsourcing for relevance evaluation. In ACM SigIR Forum, Vol. 42. ACM, 9--15.
[3]
Leif Azzopardi and Guido Zuccon. 2016. Advances in Formal Models of Search and Search Behaviour. In Proceedings of the 2016 ACM on International Conference on the Theory of Information Retrieval. ACM, 1--4.
[4]
Leo Breiman. 2001. Random forests. Machine learning 45, 1 (2001), 5--32.
[5]
Chris Burges, Tal Shaked, Erin Renshaw, Ari Lazier, Matt Deeds, Nicole Hamilton, and Greg Hullender. 2005. Learning to rank using gradient descent. In Proceedings of the 22nd international conference on Machine learning. ACM, 89--96.
[6]
Christopher J. C. Burges. 2010. From ranknet to lambdarank to lambdamart: An overview. Learning 11 (2010), 23--581.
[7]
Olivier Chapelle and Yi Chang. 2011. Yahoo! Learning to Rank Challenge Overview. In Yahoo! Learning to Rank Challenge. 1--24.
[8]
Olivier Chapelle, Yi Chang, and Tie-Yan Liu. 2011. Future directions in learning to rank. In Yahoo! Learning to Rank Challenge. 91--100.
[9]
Xiangru Chen, Haofen Wang, Xinruo Sun, Junfeng Pan, and Yong Yu. 2011. Diversifying Product Search Results. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '11). ACM, New York, NY, USA, 1093--1094.
[10]
Huizhong Duan, ChengXiang Zhai, Jinxing Cheng, and Abhishek Gattani. 2013. Supporting Keyword Search in Product Database: A Probabilistic Approach. Proc. VLDB Endow. 6, 14 (Sept. 2013), 1786--1797.
[11]
Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Xiang-Rui Wang, and Chih-Jen Lin. 2008. LIBLINEAR: A library for large linear classification. The Journal of Machine Learning Research 9 (2008), 1871--1874.
[12]
Yoav Freund, Raj Iyer, Robert E Schapire, and Yoram Singer. 2003. An efficient boosting algorithm for combining preferences. The Journal of machine learning research 4 (2003), 933--969.
[13]
Wolfgang Hardle. 2012. Smoothing techniques: with implementation in S. Springer Science & Business Media.
[14]
Kalervo Järvelin and Jaana Kekäläinen. 2002. Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems (TOIS) 20, 4 (2002), 422--446.
[15]
John Le, Andy Edmonds, Vaughn Hester, and Lukas Biewald. 2010. Ensuring quality in crowdsourced search relevance evaluation: The effects of training question distribution. In SIGIR 2010 workshop on crowdsourcing for search evaluation. 21--26.
[16]
Su-In Lee, Honglak Lee, Pieter Abbeel, and Andrew Y. Ng. 2006. Efficient l1 regularized logistic regression. In Proceedings of the National Conference on Artificial Intelligence, Vol. 21. 401.
[17]
Beibei Li, Anindya Ghose, and Panagiotis G. Ipeirotis. 2011. Towards a Theory Model for Product Search. In Proceedings of the 20th International Conference on World Wide Web (WWW '11). ACM, New York, NY, USA, 327--336.
[18]
Hang Li. 2014. Learning to rank for information retrieval and natural language processing. Synthesis Lectures on Human Language Technologies 7, 3 (2014), 1--121.
[19]
Chih-Jen Lin, Ruby C. Weng, and S. Sathiya Keerthi. 2008. Trust region newton method for logistic regression. The Journal of Machine Learning Research 9 (2008), 627--650.
[20]
Tie-Yan Liu. 2009. Learning to rank for information retrieval. Foundations and Trends in Information Retrieval 3, 3 (2009), 225--331.
[21]
Bo Long, Jiang Bian, Anlei Dong, and Yi Chang. 2012. Enhancing Product Search by Best-selling Prediction in e-Commerce. In CIKM '12 (CIKM '12). ACM, New York, NY, USA, 2479--2482.
[22]
Craig Macdonald, Rodrygo L. T. Santos, and Iadh Ounis. 2012. On the usefulness of query features for learning to rank. In Proceedings of the 21st ACM international conference on Information and knowledge management. ACM, 2559--2562.
[23]
Craig Macdonald, Rodrygo L. T. Santos, and Iadh Ounis. 2013. The whens and hows of learning to rank for web search. Information Retrieval 16, 5 (2013), 584--628.
[24]
José R Pérez-Agüera, Javier Arroyo, Jane Greenberg, Joaquin Perez Iglesias, and Victor Fresno. 2010. Using BM25F for semantic search. In Proceedings of the 3rd international semantic search workshop. ACM, 2.
[25]
Tao Qin and Tie-Yan Liu. 2013. Introducing LETOR 4.0 Datasets. CoRR abs/1306.2597 (2013). http://arxiv.org/abs/1306.2597
[26]
Stephen E. Robertson, Steve Walker, Susan Jones, Micheline M. Hancock-Beaulieu, Mike Gatford, and others. 1995. Okapi at TREC-3. NIST SPECIAL PUBLICATION SP 109 (1995), 109.
[27]
Alex Smola and Vladimir Vapnik. 1997. Support vector regression machines. Advances in neural information processing systems 9 (1997), 155--161.
[28]
Johan A. K. Suykens and Joos Vandewalle. 1999. Least squares support vector machine classifiers. Neural processing letters 9, 3 (1999), 293--300.
[29]
Liang Tang, Bo Long, Bee-Chung Chen, and Deepak Agarwal. 2016. An Empirical Study on Recommendation with Multiple Types of Feedback. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 283--292.
[30]
Niek Tax, Sander Bockting, and Djoerd Hiemstra. 2015. A cross-benchmark comparison of 87 learning to rank methods. Information processing & management 51, 6 (2015), 757--772.
[31]
Christophe Van Gysel, Maarten de Rijke, and Evangelos Kanoulas. 2016. Learning Latent Vector Spaces for Product Search. In Proceedings of the 25th ACM CIKM '16. ACM, New York, NY, USA, 165--174.
[32]
Damir Vandic, Flavius Frasincar, and Uzay Kaymak. 2013. Facet Selection Algorithms for Web Product Search. In Proceedings of the 22Nd ACM International Conference on Information & Knowledge Management (CIKM '13). ACM, New York, NY, USA, 2327--2332.
[33]
Frank Wilcoxon. 1945. Individual comparisons by ranking methods. Biometrics bulletin 1, 6 (1945), 80--83.
[34]
Jun Xu and Hang Li. 2007. Adarank: a boosting algorithm for information retrieval. In ACM SIGIR. ACM, 391--398.
[35]
Emine Yilmaz and Stephen Robertson. 2010. On the Choice of Effectiveness Measures for Learning to Rank. Inf. Retr. 13, 3 (June 2010), 271--290.
[36]
Jun Yu, Sunil Mohan, Duangmanee (Pew) Putthividhya, and Weng-Keen Wong. 2014. Latent Dirichlet Allocation Based Diversified Retrieval for e-Commerce Search. In WSDM '14. New York, NY, USA, 463--472.

Cited By

View all
  • (2024)Query-Aware Explainable Product Search With Reinforcement Knowledge Graph ReasoningIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.329733136:3(1260-1273)Online publication date: Mar-2024
  • (2024)TechnoSearch: Improving e-Commerce Searches Using Product Category and Brand Based Ranking2024 International Conference on Electrical, Communication and Computer Engineering (ICECCE)10.1109/ICECCE63537.2024.10823605(1-6)Online publication date: 30-Oct-2024
  • (2024)Unified Embedding Based Personalized Retrieval in Etsy Search2024 IEEE International Conference on Future Machine Learning and Data Science (FMLDS)10.1109/FMLDS63805.2024.00055(258-264)Online publication date: 20-Nov-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval
August 2017
1476 pages
ISBN:9781450350228
DOI:10.1145/3077136
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 August 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. e-commerce search
  2. information retrieval
  3. learning to rank

Qualifiers

  • Research-article

Funding Sources

  • WalmartLabs

Conference

SIGIR '17
Sponsor:

Acceptance Rates

SIGIR '17 Paper Acceptance Rate 78 of 362 submissions, 22%;
Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)76
  • Downloads (Last 6 weeks)20
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Query-Aware Explainable Product Search With Reinforcement Knowledge Graph ReasoningIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.329733136:3(1260-1273)Online publication date: Mar-2024
  • (2024)TechnoSearch: Improving e-Commerce Searches Using Product Category and Brand Based Ranking2024 International Conference on Electrical, Communication and Computer Engineering (ICECCE)10.1109/ICECCE63537.2024.10823605(1-6)Online publication date: 30-Oct-2024
  • (2024)Unified Embedding Based Personalized Retrieval in Etsy Search2024 IEEE International Conference on Future Machine Learning and Data Science (FMLDS)10.1109/FMLDS63805.2024.00055(258-264)Online publication date: 20-Nov-2024
  • (2024)BERT-Based Movie Keyword Search Leveraging User-Generated Movie Rankings and Reviews2024 IEEE International Conference on Big Data and Smart Computing (BigComp)10.1109/BigComp60711.2024.00046(246-256)Online publication date: 18-Feb-2024
  • (2024)Narrowing the Technological Gap by Promoting Small Commerce Through VR and AI for a Lifelike E-Commerce Experience: Needs and SolutionsEnterprise Information Systems10.1007/978-3-031-64755-0_5(92-112)Online publication date: 26-Jul-2024
  • (2024)Learning-to-Rank with Nested FeedbackAdvances in Information Retrieval10.1007/978-3-031-56063-7_22(306-315)Online publication date: 24-Mar-2024
  • (2023)Smart Fashion Recommendation System using FashionNetICST Transactions on Scalable Information Systems10.4108/eetsis.4278Online publication date: 30-Oct-2023
  • (2023)On Gradient Boosted Decision Trees and Neural Rankers: A Case-Study on Short-Video Recommendations at ShareChatProceedings of the 15th Annual Meeting of the Forum for Information Retrieval Evaluation10.1145/3632754.3632940(136-141)Online publication date: 15-Dec-2023
  • (2023)Learning to Rank for Search Results Re-ranking in Learning Experience PlatformsProceedings of the 16th Annual ACM India Compute Conference10.1145/3627217.3627224(25-30)Online publication date: 9-Dec-2023
  • (2023)Post-hoc Selection of Pareto-Optimal Solutions in Search and RecommendationProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615010(2013-2023)Online publication date: 21-Oct-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media