skip to main content
10.1145/3397271.3401065acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Query Reformulation in E-Commerce Search

Published:25 July 2020Publication History

ABSTRACT

The importance of e-commerce platforms has driven forward a growing body of research work on e-commerce search. We present the first large-scale and in-depth study of query reformulations performed by users of e-commerce search; the study is based on the query logs of eBay's search engine. We analyze various factors including the distribution of different types of reformulations, changes of search result pages retrieved for the reformulations, and clicks and purchases performed upon the retrieved results. We then turn to address a novel challenge in the e-commerce search realm: predicting whether a user will reformulate her query before presenting her the search results. Using a suite of prediction features, most of which are novel to this study, we attain high prediction quality. Some of the features operate prior to retrieval time, whereas others rely on the retrieved results. While the latter are substantially more effective than the former, we show that the integration of these two types of features is of merit. We also show that high prediction quality can be obtained without considering information from the past about the user or the query she posted. Nevertheless, using these types of information can further improve prediction quality.

Skip Supplemental Material Section

Supplemental Material

3397271.3401065.mp4

mp4

17.5 MB

References

  1. Martin Arlitt. 2000. Characterizing web user sessions. SIGMETRICS Perform. Eval. Rev., Vol. 28, 2 (2000), 50--63.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Ahmed Hassan Awadallah, Ranjitha Gurunath Kulkarni, Umut Ozertem, and Rosie Jones. 2015. Characterizing and predicting voice query reformulation. In Proc. of CIKM. 543--552.Google ScholarGoogle Scholar
  3. Ahmed Hassan Awadallah, Xiaolin Shi, Nick Craswell, and Bill Ramsey. 2013. Beyond clicks: query reformulation as a predictor of search satisfaction. In Proc. of CIKM. 2019--2028.Google ScholarGoogle Scholar
  4. Ahmed Hassan Awadallah, Ryen W. White, Susan T. Dumais, and Yi-Min Wang. 2014. Struggling or exploring?: disambiguating long search sessions. In Proc. of WSDM. 53--62.Google ScholarGoogle Scholar
  5. Yoshua Bengio, Ian Goodfellow, and Aaron Courville. 2017. Deep learning.Google ScholarGoogle Scholar
  6. Paolo Boldi, Francesco Bonchi, Carlos Castillo, Debora Donato, Aristides Gionis, and Sebastiano Vigna. 2008. The query-flow graph: model and applications. In Proc. of CIKM. 609--618.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Leo Breiman. 2001. Random forests. Machine learning, Vol. 45, 1 (2001), 5--32.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Eliot Brenner, Jun Zhao, Aliasgar Kutiyanawala, and Zheng Yan. 2018. End-to-end neural ranking for eCommerce product search: An application of task models and textual embeddings. In The SIGIR 2018 Workshop On eCommerce.Google ScholarGoogle Scholar
  9. David Carmel and Elad Yom-Tov. 2010. Estimating the query difficulty for information retrieval. Morgan & Claypool Publishers.Google ScholarGoogle Scholar
  10. Chih-Chung Chang and Chih-Jen Lin. 2011. LIBSVM: A library for support vector machines. ACM TIST, Vol. 2, 3 (2011), 27.Google ScholarGoogle Scholar
  11. Tianqi Chen and Carlos Guestrin. 2016. Xgboost: A scalable tree boosting system. In Proc. of KDD. 785--794.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Corinna Cortes and Vladimir Vapnik. 1995. Support-vector networks. Machine Learning, Vol. 20, 3 (1995), 273--297.Google ScholarGoogle ScholarCross RefCross Ref
  13. Steve Cronen-Townsend, Yun Zhou, and W. Bruce Croft. 2002. Predicting query performance. In Proc. of SIGIR. 299--306.Google ScholarGoogle Scholar
  14. Ronan Cummins, Joemon M. Jose, and Colm O'Riordan. 2011. Improved query performance prediction using standard deviation. In Proc. of SIGIR. 1089--1090.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Rotem Dror, Gili Baumer, Segev Shlomov, and Roi Reichart. 2018. The hitchhiker's guide to testing statistical significance in natural language processing. In Proc. of ACL. 1383--1392.Google ScholarGoogle ScholarCross RefCross Ref
  16. Joseph L Fleiss. 1971. Measuring nominal scale agreement among many raters. Psychological bulletin, Vol. 76, 5 (1971), 378--382.Google ScholarGoogle Scholar
  17. Jerome H Friedman. 2001. Greedy function approximation: A gradient boosting machine. Annals of statistics (2001), 1189--1232.Google ScholarGoogle Scholar
  18. Katharina C Furtner, Thomas Mandl, and Christa Womser-Hacker. 2015. Effects of Auto-Suggest on the Usability of Search in eCommerce.. In ISI. 178--190.Google ScholarGoogle Scholar
  19. Sreenivas Gollapudi, Samuel Ieong, and Anitha Kannan. 2012. Structured query reformulations in commerce search. In Proc. of CIKM. 1890--1894.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Ido Guy and Kira Radinsky. 2017. Structuring the unstructured: From startup to making sense of eBay's huge eCommerce inventory. In Proc. of SIGIR. 1351.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Christophe Van Gysel, Maarten de Rijke, and Evangelos Kanoulas. 2018. Mix'n match: Integrating text matching and product substitutability within product search. In Proc. of CIKM. 1373--1382.Google ScholarGoogle Scholar
  22. Claudia Hauff, Leif Azzopardi, and Djoerd Hiemstra. 2009. The combination and evaluation of query performance prediction methods. In Proc. of ECIR. 301--312.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Yujing Hu, Qing Da, Anxiang Zeng, Yang Yu, and Yinghui Xu. 2018. Reinforcement learning to rank in e-Commerce search engine: Formalization, analysis, and application. In Proc. of SIGKDD. 368--377.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Jeff Huang and Efthimis N. Efthimiadis. 2009. Analyzing and evaluating query reformulation strategies in web search logs. In Proc. of CIKM. 77--86.Google ScholarGoogle Scholar
  25. Bernard J. Jansen, Danielle L. Booth, and Amanda Spink. 2009. Patterns of query reformulation during Web searching. JASIST, Vol. 60, 7 (2009), 1358--1371.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Jiepu Jiang, Ahmed Hassan Awadallah, Xiaolin Shi, and Ryen W. White. 2015. Understanding and predicting graded search satisfaction. In Proc. of WSDM. 57--66.Google ScholarGoogle Scholar
  27. Jyun-Yu Jiang, Yen-Yu Ke, Pao-Yu Chien, and Pu-Jen Cheng. 2014. Learning user reformulation behavior for query auto-completion. In Proc. of SIGIR. 445--454.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Rosie Jones and Kristina Lisa Klinkner. 2008. Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs. In Proc. of CIKM. 699--708.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Li-Jen Kao and Yo-Ping Huang. 2017. Predicting purchase intention according to fan page users' sentiment. In Proc. of SMC. 831--835.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Elad Kravi, Ido Guy, Avihai Mejer, David Carmel, Yoelle Maarek, Dan Pelleg, and Gilad Tsur. 2016. One Query, Many Clicks: Analysis of Queries with Multiple Clicks by the Same User. In Proc. of CIKM. 1423--1432.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Rohan Kumar, Mohit Kumar, Neil Shah, and Christos Faloutsos. 2018. Did we get it right? predicting query performance in e-Commerce search. In The SIGIR 2018 Workshop On eCommerce.Google ScholarGoogle Scholar
  32. Or Levi, Ido Guy, Fiana Raiber, and Oren Kurland. 2018. Selective Cluster Presentation on the Search Results Page. ACM TOIS, Vol. 36, 3, Article 28 (2018), 42 pages.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Saurav Manchanda, Mohit Sharma, and George Karypis. 2019. Intent term selection and refinement in e-commerce queries. CoRR, Vol. abs/1908.08564 (2019).Google ScholarGoogle Scholar
  34. Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Proc. of NIPS. 3111--3119.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Eric W Noreen. 1989. Computer-intensive methods for testing hypotheses. Wiley New York.Google ScholarGoogle Scholar
  36. Daan Odijk, Ryen W. White, Ahmed Hassan Awadallah, and Susan T. Dumais. 2015. Struggling and success in web search. In Proc. of CIKM. 1551--1560.Google ScholarGoogle Scholar
  37. Jay M. Ponte and W. Bruce Croft. 1998. A language modeling approach to information retrieval. In Proc. of SIGIR. 275--281.Google ScholarGoogle Scholar
  38. Fiana Raiber and Oren Kurland. 2014. Query-performance prediction: setting the expectations straight. In Proc. of SIGIR. 13--22.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Hadas Raviv, Oren Kurland, and David Carmel. 2014. Query performance prediction for entity retrieval. In Proc. of SIGIR. 1099--1102.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Stephen E. Robertson, Steve Walker, Susan Jones, Micheline Hancock-Beaulieu, and Mike Gatford. 1994. Okapi at TREC-3. In Proc. of TREC-3.Google ScholarGoogle Scholar
  41. Haggai Roitman, Shai Erera, Oren Sar Shalom, and Bar Weiner. 2017. Enhanced mean retrieval score estimation for query performance prediction. In Proc. of ICTIR. 35--42.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Shubhra Kanti Karmaker Santu, Parikshit Sondhi, and ChengXiang Zhai. 2017. On application of learning to rank for e-Commerce search. In Proc. of SIGIR. 475--484.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Nikos Sarkas, Stelios Paparizos, and Panayiotis Tsaparas. 2010. Structured annotations of web queries. In Proc. of SIGMOD. 771--782.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Humphrey Sheil, Omer Rana, and Ronan Reilly. 2018. Predicting purchasing intent: Automatic feature learning using recurrent neural networks. In The SIGIR 2018 Workshop On eCommerce.Google ScholarGoogle Scholar
  45. Anna Shtok, Oren Kurland, and David Carmel. 2016. Query performance prediction using reference lists. ACM Trans. Inf. Syst., Vol. 34, 4 (2016), 19:1--19:34.Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Anna Shtok, Oren Kurland, David Carmel, Fiana Raiber, and Gad Markovits. 2012. Predicting query performance by query-drift estimation. ACM Transactions on Information Systems, Vol. 30, 2 (2012), 11.Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. Gyanit Singh, Nish Parikh, and Neel Sundaresn. 2011. User behavior in zero-recall ecommerce queries. In Proc. of SIGIR. 75--84.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Parikshit Sondhi, Mohit Sharma, Pranam Kolari, and ChengXiang Zhai. 2018. A taxonomy of queries for e-Commerce search. In Proc. of SIGIR. 1245--1248.Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Ning Su, Jiyin He, Yiqun Liu, Min Zhang, and Shaoping Ma. 2018. User Intent, behaviour, and perceived satisfaction in product search. In Proc. of WSDM. 547--555.Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Zehong Tan, Canran Xu, Mengjie Jiang, Hua Yang, and Xiaoyuan Wu. 2017. Query rewrite for null and low search results in eCommerce. In The SIGIR 2017 Workshop On eCommerce.Google ScholarGoogle Scholar
  51. Yongquan Tao and Shengli Wu. 2014. Query performance prediction by considering score magnitude and variance together. In Proc. of CIKM. 1891--1894.Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Arthur Toth, Louis Tan, Giuseppe Di Fabbrizio, and Ankur Datta. 2017. Predicting shopping behavior with mixture of RNNs. In The SIGIR 2017 Workshop On eCommerce.Google ScholarGoogle Scholar
  53. Hen Tzaban, Ido Guy, Asnat Greenstein-Messica, Arnon Dagan, Lior Rokach, and Bracha Shapira. 2020. Product Bundle Identification using Semi-Supervised Learning. In Proc. of SIGIR.Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Teng Xiao, Jiaxin Ren, Zaiqiao Meng, Huan Sun, and Shangsong Liang. 2019. Dynamic bayesian metric learning for personalized product search. In Proc. of CIKM. 1693--1702.Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. Yingwei Xin, Ethan Hart, Vibhuti Mahajan, and Jean-David Ruvini. 2018. Learning better internal structure of words for sequence labeling. In Proc. of EMNLP. 2584--2593.Google ScholarGoogle ScholarCross RefCross Ref
  56. Yuan Zhang, Dong Wang, and Yan Zhang. 2019. Neural IR meets graph embedding: A ranking model for product search. In Proc. of WWW. 2390--2400.Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Ying Zhao, Falk Scholer, and Yohannes Tsegay. 2008. Effective pre-retrieval query performance prediction using similarity and variability evidence. In Proc. of ECIR. 52--64.Google ScholarGoogle ScholarCross RefCross Ref
  58. Yun Zhou and W. Bruce Croft. 2007. Query performance prediction in web search environments. In Proc. of SIGI. 543--550.Google ScholarGoogle Scholar

Index Terms

  1. Query Reformulation in E-Commerce Search

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SIGIR '20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
        July 2020
        2548 pages
        ISBN:9781450380164
        DOI:10.1145/3397271

        Copyright © 2020 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 25 July 2020

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate792of3,983submissions,20%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader