skip to main content
10.1145/3437963.3441799acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Combating Selection Biases in Recommender Systems with a Few Unbiased Ratings

Authors Info & Claims
Published:08 March 2021Publication History

ABSTRACT

Recommendation datasets are prone to selection biases due to self-selection behavior of users and item selection process of systems. This makes explicitly combating selection biases an essential problem in training recommender systems. Most previous studies assume no unbiased data available for training. We relax this assumption and assume that a small subset of training data is unbiased. Then, we propose a novel objective that utilizes the unbiased data to adaptively assign propensity weights to biased training ratings. This objective, combined with unbiased performance estimators, alleviates the effects of selection biases on the training of recommender systems. To optimize the objective, we propose an efficient algorithm that minimizes the variance of propensity estimates for better generalized recommender systems. Extensive experiments on two real-world datasets confirm the advantages of our approach in significantly reducing both the error of rating prediction and the variance of propensity estimation.

References

  1. Balázs Csanád Csáji et al. 2001. Approximation with artificial neural networks. Faculty of Sciences, Etvs Lornd University, Hungary (2001).Google ScholarGoogle Scholar
  2. John Duchi, Elad Hazan, and Yoram Singer. 2011. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization. Journal of Machine Learning Research (JMLR) (2011).Google ScholarGoogle Scholar
  3. Luca Franceschi, Michele Donini, Paolo Frasconi, and Massimiliano Pontil. 2017. Forward and reverse gradient-based hyperparameter optimization. In Proceedings of the 34th International Conference on Machine Learning (ICML).Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Luca Franceschi, Paolo Frasconi, Saverio Salzo, Riccardo Grazzi, and Massimiliano Pontil. 2018. Bilevel programming for hyperparameter optimization and meta-learning. In Proceedings of the 35th International Conference on Machine Learning (ICML).Google ScholarGoogle Scholar
  5. Xiangnan He and Tat-Seng Chua. 2017. Neural factorization machines for sparse predictive analytics. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR).Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. José Miguel Hernández-Lobato, Neil Houlsby, and Zoubin Ghahramani. 2014. Probabilistic matrix factorization with non-random missing data. In Proceedings of the 31st International Conference on Machine Learning (ICML).Google ScholarGoogle Scholar
  7. Sha Hu, Zhicheng Dou, Xiaojie Wang, Tetsuya Sakai, and Ji-Rong Wen. 2015. Search result diversification based on hierarchical intents. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (CIKM).Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Xinting Huang, Jianzhong Qi, Yu Sun, and Rui Zhang. 2020 a. Mala: Cross-domain dialogue generation with action learning. In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI).Google ScholarGoogle ScholarCross RefCross Ref
  9. Xinting Huang, Jianzhong Qi, Yu Sun, and Rui Zhang. 2020 b. Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL).Google ScholarGoogle ScholarCross RefCross Ref
  10. Simon Jenni and Paolo Favaro. 2018. Deep Bilevel Learning. In Proceedings of the 15th European Conference on Computer Vision (ECCV).Google ScholarGoogle ScholarCross RefCross Ref
  11. Thorsten Joachims, Adith Swaminathan, and Tobias Schnabel. 2017. Unbiased learning-to-rank with biased feedback. In Proceedings of the 10th ACM International Conference on Web Search and Data Mining (WSDM).Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer (2009).Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Guang Ling, Haiqin Yang, Michael R Lyu, and Irwin King. 2012. Response aware model-based collaborative filtering. In Proceedings of the 28th Conference on Uncertainty in Artificial Intelligence (UAI).Google ScholarGoogle Scholar
  14. Donghua Liu, Jing Li, Bo Du, Jun Chang, and Rong Gao. 2019. DAML: Dual Attention Mutual Learning between Ratings and Reviews for Item Recommendation. In Proceedings of the 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (SIGKDD).Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Jiongnan Liu, Zhicheng Dou, Xiaojie Wang, Shuqi Lu, and Ji-Rong Wen. 2020. DVGAN: A Minimax Game for Search Result Diversification Combining Explicit and Implicit Features. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR).Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Shuqi Lu, Zhicheng Dou, Chenyan Xiong, Xiaojie Wang, and Ji-Rong Wen. 2020. Knowledge Enhanced Personalized Search. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR).Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Dougal Maclaurin, David Duvenaud, and Ryan P Adams. 2015. Gradient-based hyperparameter optimization through reversible learning. In Proceedings of the 32nd International Conference on Machine Learning (ICML).Google ScholarGoogle Scholar
  18. Benjamin M Marlin and Richard S Zemel. 2009. Collaborative prediction and ranking with non-random missing data. In Proceedings of the 3rd ACM Conference on Recommender Systems (RecSys).Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Benjamin M Marlin, Richard S Zemel, Sam Roweis, and Malcolm Slaney. 2007. Collaborative filtering and the missing at random assumption. In Proceedings of the 23rd Conference on Uncertainty in Artificial Intelligence (UAI).Google ScholarGoogle Scholar
  20. Fabian Pedregosa. 2016. Hyperparameter optimization with approximate gradient. In Proceedings of the 33rd International Conference on Machine Learning (ICML).Google ScholarGoogle Scholar
  21. Mengye Ren, Wenyuan Zeng, Bin Yang, and Raquel Urtasun. 2018. Learning to Reweight Examples for Robust Deep Learning. In Proceedings of the 35th International Conference on Machine Learning (ICML).Google ScholarGoogle Scholar
  22. Masahiro Sato, Sho Takemori, Janmajay Singh, and Tomoko Ohkuma. 2020. Unbiased Learning for the Causal Effect of Recommendation. In Fourteenth ACM Conference on Recommender Systems (RecSys).Google ScholarGoogle Scholar
  23. Tobias Schnabel and Paul N Bennett. 2020. Debiasing Item-to-Item Recommendations With Small Annotated Datasets. In Fourteenth ACM Conference on Recommender Systems (RecSys).Google ScholarGoogle Scholar
  24. Tobias Schnabel, Adith Swaminathan, Ashudeep Singh, Navin Chandak, and Thorsten Joachims. 2016. Recommendations as treatments: debiasing learning and evaluation. In Proceedings of the 33rd International Conference on Machine Learning (ICML).Google ScholarGoogle Scholar
  25. Amirreza Shaban, Ching-An Cheng, Nathan Hatch, and Byron Boots. 2019. Truncated back-propagation for bilevel optimization. In Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS).Google ScholarGoogle Scholar
  26. Ying Shan, T Ryan Hoens, Jian Jiao, Haijing Wang, Dong Yu, and JC Mao. 2016. Deep crossing: Web-scale modeling without manually crafted combinatorial features. In Proceedings of the 22nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (SIGKDD).Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Ilya Shenbin, Anton Alekseev, Elena Tutubalina, Valentin Malykh, and Sergey I Nikolenko. 2020. RecVAE: a New Variational Autoencoder for Top-N Recommendations with Implicit Feedback. In Proceedings of the 13th International Conference on Web Search and Data Mining (WSDM).Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Harald Steck. 2010. Training and testing of recommender systems on data missing not at random. In Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (SIGKDD).Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Harald Steck. 2013. Evaluation of recommendations: rating-prediction and ranking. In Proceedings of the 7th ACM Conference on Recommender Systems (RecSys).Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Yixin Su, Rui Zhang, Sarah Erfani, and Zhenghua Xu. 2021. Detecting Beneficial Feature Interactions for Recommender Systems. In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI).Google ScholarGoogle ScholarCross RefCross Ref
  31. Adith Swaminathan and Thorsten Joachims. 2015a. Counterfactual risk minimization: learning from logged bandit feedback. In Proceedings of the 32nd International Conference on Machine Learning (ICML).Google ScholarGoogle Scholar
  32. Adith Swaminathan and Thorsten Joachims. 2015b. The self-normalized estimator for counterfactual learning. In Proceedings of the 28th Conference on Neural Information Processing Systems (NeurIPS).Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Menghan Wang, Mingming Gong, Xiaolin Zheng, and Kun Zhang. 2018a. Modeling dynamic missingness of implicit feedback for recommendation. In Proceedings of the 32nd Conference on Neural Information Processing Systems (NeurIPS).Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Xiaojie Wang, Zhicheng Dou, Tetsuya Sakai, and Ji-Rong Wen. 2016. Evaluating search result diversity using intent hierarchies. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval (SIGIR).Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Xiaojie Wang, Jianzhong Qi, Kotagiri Ramamohanarao, Yu Sun, Bo Li, and Rui Zhang. 2018b. A joint optimization approach for personalized recommendation diversification. In Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD).Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Xiaojie Wang, Ji-Rong Wen, Zhicheng Dou, Tetsuya Sakai, and Rui Zhang. 2017. Search result diversity evaluation based on intent hierarchies. IEEE Transactions on Knowledge and Data Engineering (TKDE) (2017).Google ScholarGoogle Scholar
  37. Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. 2018c. Kdgan: Knowledge distillation with generative adversarial networks. In Advances in Neural Information Processing Systems (NeurIPS).Google ScholarGoogle Scholar
  38. Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. 2019 a. Adversarial distillation for learning with privileged provisions. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) (2019).Google ScholarGoogle Scholar
  39. Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. 2019 b. Doubly robust joint learning for recommendation on data missing not at random. In Proceedings of the 36th International Conference on Machine Learning (ICML).Google ScholarGoogle Scholar
  40. Longqi Yang, Eugene Bagdasaryan, Joshua Gruenstein, Cheng-Kang Hsieh, and Deborah Estrin. 2018. Openrec: A modular framework for extensible and adaptable recommendation algorithms. In Proceedings of the 11th ACM International Conference on Web Search and Data Mining (WSDM).Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Combating Selection Biases in Recommender Systems with a Few Unbiased Ratings

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      WSDM '21: Proceedings of the 14th ACM International Conference on Web Search and Data Mining
      March 2021
      1192 pages
      ISBN:9781450382977
      DOI:10.1145/3437963

      Copyright © 2021 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 8 March 2021

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate498of2,863submissions,17%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader