skip to main content
10.1145/3383313.3412235acmconferencesArticle/Chapter ViewAbstractPublication PagesrecsysConference Proceedingsconference-collections
research-article

From the lab to production: A case study of session-based recommendations in the home-improvement domain

Published:22 September 2020Publication History

ABSTRACT

E-commerce applications rely heavily on session-based recommendation algorithms to improve the shopping experience of their customers. Recent progress in session-based recommendation algorithms shows great promise. However, translating that promise to real-world outcomes is a challenging task for several reasons, but mostly due to the large number and varying characteristics of the available models. In this paper, we discuss the approach and lessons learned from the process of identifying and deploying a successful session-based recommendation algorithm for a leading e-commerce application in the home-improvement domain. To this end, we initially evaluate fourteen session-based recommendation algorithms in an offline setting using eight different popular evaluation metrics on three datasets. The results indicate that offline evaluation does not provide enough insight to make an informed decision since there is no clear winning method on all metrics. Additionally, we observe that standard offline evaluation metrics fall short for this application. Specifically, they reward an algorithm only when it predicts the exact same item that the user clicked next or eventually purchased. In a practical scenario, however, there are near-identical products which, although they are assigned different identifiers, they should be considered as equally-good recommendations. To overcome these limitations, we perform an additional round of evaluation, where human experts provide both objective and subjective feedback for the recommendations of five algorithms that performed the best in the offline evaluation. We find that the experts’ opinion is oftentimes different from the offline evaluation results. Analysis of the feedback confirms that the performance of all models is significantly higher when we evaluate near-identical product recommendations as relevant. Finally, we run an A/B test with one of the models that performed the best in the human evaluation phase. The treatment model increased conversion rate by 15.6% and revenue per visit by 18.5% when compared with a leading third-party solution.

References

  1. R. Agrawal, T. Imieliński, and A. Swami. 1993. Mining Association Rules Between Sets of Items in Large Databases. In SIGMOD ’93.Google ScholarGoogle Scholar
  2. J. Beel and S. Langer. 2015. A Comparison of Offline Evaluations, Online Evaluations, and User Studies in the Context of Research-Paper Recommender Systems. In Research and Advanced Technology for Digital Libraries.Google ScholarGoogle Scholar
  3. G. Bonnin and D. Jannach. 2014. Automated generation of music playlists: Survey and experiments. CSUR 47, 2 (2014).Google ScholarGoogle Scholar
  4. K. Cho, B. van Merriënboer, D. Bahdanau, and Y. Bengio. 2014. On the Properties of Neural Machine Translation: Encoder–Decoder Approaches. In SSST ’14.Google ScholarGoogle Scholar
  5. P. Cremonesi, F. Garzotto, and R. Turrin. 2012. Investigating the Persuasion Potential of Recommender Systems from a Quality Perspective: An Empirical Study. TiiS 2, 2 (2012).Google ScholarGoogle Scholar
  6. M. Dacrema, P. Cremonesi, and D. Jannach. 2019. Are We Really Making Much Progress? A Worrying Analysis of Recent Neural Recommendation Approaches. In RecSys ’19.Google ScholarGoogle Scholar
  7. F. Garcin, B. Faltings, O. Donatsch, A. Alazzawi, C. Bruttin, and A. Huber. 2014. Offline and online evaluation of news recommender systems at swissinfo.ch. In RecSys ’14.Google ScholarGoogle Scholar
  8. D. Garg, P. Gupta, P. Malhotra, L. Vig, and G. Shroff. 2019. Sequence and time aware neighborhood for session-based recommendations: Stan. In SIGIR ’19.Google ScholarGoogle Scholar
  9. C. Gomez-Uribe and N. Hunt. 2016. The Netflix Recommender System: Algorithms, Business Value, and Innovation. TMIS 6, 4 (2016).Google ScholarGoogle Scholar
  10. A. Gunawardana and G. Shani. 2015. Evaluating Recommender Systems. Recommender Systems Handbook.Google ScholarGoogle Scholar
  11. N. Hariri, B. Mobasher, and R. Burke. 2012. Context-aware music recommendation based on latenttopic sequential patterns. In RecSys ’12.Google ScholarGoogle Scholar
  12. B. Hidasi and A. Karatzoglou. 2018. Recurrent Neural Networks with Top-k Gains for Session-based Recommendations. In CIKM ’18.Google ScholarGoogle Scholar
  13. B. Hidasi, A. Karatzoglou, L. Baltrunas, and D. Tikk. 2016. Session-based recommendations with recurrent neural networks. In ICLR ’16.Google ScholarGoogle Scholar
  14. D. Jannach and M. Ludewig. 2017. When Recurrent Neural Networks Meet the Neighborhood for Session-Based Recommendation. In RecSys ’17.Google ScholarGoogle Scholar
  15. I. Kamehkhosh and D. Jannach. 2017. User Perception of Next-Track Music Recommendations. In UMAP ’17.Google ScholarGoogle Scholar
  16. I. Kamehkhosh, D. Jannach, and M. Ludewig. 2017. A Comparison of Frequent Pattern Techniques and a Deep Learning Method for Session-Based Recommendation. In TempRec ’17.Google ScholarGoogle Scholar
  17. L. Lerche, D. Jannach, and M. Ludewig. 2016. On the value of reminders within e-commerce recommendations. In UMAP ’16.Google ScholarGoogle Scholar
  18. J. Li, P. Ren, Z. Chen, Z. Ren, T. Lian, and J. Ma. 2017. Neural Attentive Session-Based Recommendation. In CIKM ’17.Google ScholarGoogle Scholar
  19. Q. Liu, Y. Zeng, R. Mokhosi, and H. Zhang. 2018. STAMP: short-term attention/memory priority model for session-based recommendation. In KDD ’18.Google ScholarGoogle Scholar
  20. M. Ludewig and D. Jannach. 2018. Evaluation of session-based recommendation algorithms. UMUAI 28, 4 (2018).Google ScholarGoogle Scholar
  21. M. Ludewig and D. Jannach. 2019. User-centric evaluation of session-based recommendations for an automated radio station. In RecSys ’19.Google ScholarGoogle Scholar
  22. M. Ludewig, N. Mauro, S. Latifi, and D. Jannach. 2019. Empirical Analysis of Session-Based Recommendation Algorithms. CoRR abs/1910.12781(2019).Google ScholarGoogle Scholar
  23. M. Ludewig, N. Mauro, S. Latifi, and D. Jannach. 2019. Performance Comparison of Neural and Non-Neural Approaches to Session-Based Recommendation. In RecSys ’19.Google ScholarGoogle Scholar
  24. S. McNee, J. Riedl, and J. Konstan. 2006. Being accurate is not enough: How accuracy metrics have hurt recommender systems. In CHI ’06.Google ScholarGoogle Scholar
  25. F. Mi and B. Faltings. 2018. Context tree for adaptive session-based recommendation. arXiv preprint arXiv:1806.03733(2018).Google ScholarGoogle Scholar
  26. M. Rossetti, F. Stella, and M. Zanker. 2016. Contrasting Offline and Online Results when Evaluating Recommendation Algorithms. In RecSys ’16.Google ScholarGoogle Scholar
  27. K. Wagstaff. 2012. Machine Learning that Matters. In ICML’ 12.Google ScholarGoogle Scholar
  28. M. Wang, P. Ren, L. Mei, Z. Chen, J. Ma, and M. de Rijke. 2019. A Collaborative Session-Based Recommendation Approach with Parallel Memory Modules. In SIGIR’19.Google ScholarGoogle Scholar
  29. S. Wang, L. Cao, and Y. Wang. 2019. A Survey on Session-based Recommender Systems. CoRR abs/1902.04864(2019).Google ScholarGoogle Scholar
  30. S. Wu, Y. Tang, Y. Zhu, L. Wang, X. Xie, and T. Tan. 2019. Session-Based Recommendation with Graph Neural Networks. In AAAI ’19.Google ScholarGoogle Scholar
  31. F. Yuan, A. Karatzoglou, I. Arapakis, J. Jose, and X. He. 2019. A simple convolutional generative network for next item recommendation. In WSDM ’19.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    RecSys '20: Proceedings of the 14th ACM Conference on Recommender Systems
    September 2020
    796 pages
    ISBN:9781450375832
    DOI:10.1145/3383313

    Copyright © 2020 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 22 September 2020

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    Overall Acceptance Rate254of1,295submissions,20%

    Upcoming Conference

    RecSys '24
    18th ACM Conference on Recommender Systems
    October 14 - 18, 2024
    Bari , Italy

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format