research-article

Combating Selection Biases in Recommender Systems with a Few Unbiased Ratings

Authors:
Xiaojie Wang

Amazon.com, Inc., Melbourne, VIC, Australia

Amazon.com, Inc., Melbourne, VIC, Australia
View Profile

,
Rui Zhang

The University of Melbourne, Melbourne, VIC, Australia

The University of Melbourne, Melbourne, VIC, Australia
View Profile

,
Yu Sun

Twitter Inc., San Francisco, CA, USA

Twitter Inc., San Francisco, CA, USA
View Profile

,
Jianzhong Qi

The University of Melbourne, Melbourne, VIC, Australia

The University of Melbourne, Melbourne, VIC, Australia
View Profile

WSDM '21: Proceedings of the 14th ACM International Conference on Web Search and Data MiningMarch 2021Pages 427–435https://doi.org/10.1145/3437963.3441799

Published:08 March 2021Publication History

WSDM '21: Proceedings of the 14th ACM International Conference on Web Search and Data Mining

Pages 427–435

ABSTRACT

Recommendation datasets are prone to selection biases due to self-selection behavior of users and item selection process of systems. This makes explicitly combating selection biases an essential problem in training recommender systems. Most previous studies assume no unbiased data available for training. We relax this assumption and assume that a small subset of training data is unbiased. Then, we propose a novel objective that utilizes the unbiased data to adaptively assign propensity weights to biased training ratings. This objective, combined with unbiased performance estimators, alleviates the effects of selection biases on the training of recommender systems. To optimize the objective, we propose an efficient algorithm that minimizes the variance of propensity estimates for better generalized recommender systems. Extensive experiments on two real-world datasets confirm the advantages of our approach in significantly reducing both the error of rating prediction and the variance of propensity estimation.

References

Balázs Csanád Csáji et al. 2001. Approximation with artificial neural networks. Faculty of Sciences, Etvs Lornd University, Hungary (2001).Google Scholar
John Duchi, Elad Hazan, and Yoram Singer. 2011. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization. Journal of Machine Learning Research (JMLR) (2011).Google Scholar
Luca Franceschi, Michele Donini, Paolo Frasconi, and Massimiliano Pontil. 2017. Forward and reverse gradient-based hyperparameter optimization. In Proceedings of the 34th International Conference on Machine Learning (ICML).Google ScholarDigital Library
Luca Franceschi, Paolo Frasconi, Saverio Salzo, Riccardo Grazzi, and Massimiliano Pontil. 2018. Bilevel programming for hyperparameter optimization and meta-learning. In Proceedings of the 35th International Conference on Machine Learning (ICML).Google Scholar
Xiangnan He and Tat-Seng Chua. 2017. Neural factorization machines for sparse predictive analytics. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR).Google ScholarDigital Library
José Miguel Hernández-Lobato, Neil Houlsby, and Zoubin Ghahramani. 2014. Probabilistic matrix factorization with non-random missing data. In Proceedings of the 31st International Conference on Machine Learning (ICML).Google Scholar
Sha Hu, Zhicheng Dou, Xiaojie Wang, Tetsuya Sakai, and Ji-Rong Wen. 2015. Search result diversification based on hierarchical intents. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (CIKM).Google ScholarDigital Library
Xinting Huang, Jianzhong Qi, Yu Sun, and Rui Zhang. 2020 a. Mala: Cross-domain dialogue generation with action learning. In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI).Google ScholarCross Ref
Xinting Huang, Jianzhong Qi, Yu Sun, and Rui Zhang. 2020 b. Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL).Google ScholarCross Ref
Simon Jenni and Paolo Favaro. 2018. Deep Bilevel Learning. In Proceedings of the 15th European Conference on Computer Vision (ECCV).Google ScholarCross Ref
Thorsten Joachims, Adith Swaminathan, and Tobias Schnabel. 2017. Unbiased learning-to-rank with biased feedback. In Proceedings of the 10th ACM International Conference on Web Search and Data Mining (WSDM).Google ScholarDigital Library
Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer (2009).Google ScholarDigital Library
Guang Ling, Haiqin Yang, Michael R Lyu, and Irwin King. 2012. Response aware model-based collaborative filtering. In Proceedings of the 28th Conference on Uncertainty in Artificial Intelligence (UAI).Google Scholar
Donghua Liu, Jing Li, Bo Du, Jun Chang, and Rong Gao. 2019. DAML: Dual Attention Mutual Learning between Ratings and Reviews for Item Recommendation. In Proceedings of the 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (SIGKDD).Google ScholarDigital Library
Jiongnan Liu, Zhicheng Dou, Xiaojie Wang, Shuqi Lu, and Ji-Rong Wen. 2020. DVGAN: A Minimax Game for Search Result Diversification Combining Explicit and Implicit Features. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR).Google ScholarDigital Library
Shuqi Lu, Zhicheng Dou, Chenyan Xiong, Xiaojie Wang, and Ji-Rong Wen. 2020. Knowledge Enhanced Personalized Search. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR).Google ScholarDigital Library
Dougal Maclaurin, David Duvenaud, and Ryan P Adams. 2015. Gradient-based hyperparameter optimization through reversible learning. In Proceedings of the 32nd International Conference on Machine Learning (ICML).Google Scholar
Benjamin M Marlin and Richard S Zemel. 2009. Collaborative prediction and ranking with non-random missing data. In Proceedings of the 3rd ACM Conference on Recommender Systems (RecSys).Google ScholarDigital Library
Benjamin M Marlin, Richard S Zemel, Sam Roweis, and Malcolm Slaney. 2007. Collaborative filtering and the missing at random assumption. In Proceedings of the 23rd Conference on Uncertainty in Artificial Intelligence (UAI).Google Scholar
Fabian Pedregosa. 2016. Hyperparameter optimization with approximate gradient. In Proceedings of the 33rd International Conference on Machine Learning (ICML).Google Scholar
Mengye Ren, Wenyuan Zeng, Bin Yang, and Raquel Urtasun. 2018. Learning to Reweight Examples for Robust Deep Learning. In Proceedings of the 35th International Conference on Machine Learning (ICML).Google Scholar
Masahiro Sato, Sho Takemori, Janmajay Singh, and Tomoko Ohkuma. 2020. Unbiased Learning for the Causal Effect of Recommendation. In Fourteenth ACM Conference on Recommender Systems (RecSys).Google Scholar
Tobias Schnabel and Paul N Bennett. 2020. Debiasing Item-to-Item Recommendations With Small Annotated Datasets. In Fourteenth ACM Conference on Recommender Systems (RecSys).Google Scholar
Tobias Schnabel, Adith Swaminathan, Ashudeep Singh, Navin Chandak, and Thorsten Joachims. 2016. Recommendations as treatments: debiasing learning and evaluation. In Proceedings of the 33rd International Conference on Machine Learning (ICML).Google Scholar
Amirreza Shaban, Ching-An Cheng, Nathan Hatch, and Byron Boots. 2019. Truncated back-propagation for bilevel optimization. In Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS).Google Scholar
Ying Shan, T Ryan Hoens, Jian Jiao, Haijing Wang, Dong Yu, and JC Mao. 2016. Deep crossing: Web-scale modeling without manually crafted combinatorial features. In Proceedings of the 22nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (SIGKDD).Google ScholarDigital Library
Ilya Shenbin, Anton Alekseev, Elena Tutubalina, Valentin Malykh, and Sergey I Nikolenko. 2020. RecVAE: a New Variational Autoencoder for Top-N Recommendations with Implicit Feedback. In Proceedings of the 13th International Conference on Web Search and Data Mining (WSDM).Google ScholarDigital Library
Harald Steck. 2010. Training and testing of recommender systems on data missing not at random. In Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (SIGKDD).Google ScholarDigital Library
Harald Steck. 2013. Evaluation of recommendations: rating-prediction and ranking. In Proceedings of the 7th ACM Conference on Recommender Systems (RecSys).Google ScholarDigital Library
Yixin Su, Rui Zhang, Sarah Erfani, and Zhenghua Xu. 2021. Detecting Beneficial Feature Interactions for Recommender Systems. In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI).Google ScholarCross Ref
Adith Swaminathan and Thorsten Joachims. 2015a. Counterfactual risk minimization: learning from logged bandit feedback. In Proceedings of the 32nd International Conference on Machine Learning (ICML).Google Scholar
Adith Swaminathan and Thorsten Joachims. 2015b. The self-normalized estimator for counterfactual learning. In Proceedings of the 28th Conference on Neural Information Processing Systems (NeurIPS).Google ScholarDigital Library
Menghan Wang, Mingming Gong, Xiaolin Zheng, and Kun Zhang. 2018a. Modeling dynamic missingness of implicit feedback for recommendation. In Proceedings of the 32nd Conference on Neural Information Processing Systems (NeurIPS).Google ScholarDigital Library
Xiaojie Wang, Zhicheng Dou, Tetsuya Sakai, and Ji-Rong Wen. 2016. Evaluating search result diversity using intent hierarchies. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval (SIGIR).Google ScholarDigital Library
Xiaojie Wang, Jianzhong Qi, Kotagiri Ramamohanarao, Yu Sun, Bo Li, and Rui Zhang. 2018b. A joint optimization approach for personalized recommendation diversification. In Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD).Google ScholarDigital Library
Xiaojie Wang, Ji-Rong Wen, Zhicheng Dou, Tetsuya Sakai, and Rui Zhang. 2017. Search result diversity evaluation based on intent hierarchies. IEEE Transactions on Knowledge and Data Engineering (TKDE) (2017).Google Scholar
Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. 2018c. Kdgan: Knowledge distillation with generative adversarial networks. In Advances in Neural Information Processing Systems (NeurIPS).Google Scholar
Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. 2019 a. Adversarial distillation for learning with privileged provisions. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) (2019).Google Scholar
Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. 2019 b. Doubly robust joint learning for recommendation on data missing not at random. In Proceedings of the 36th International Conference on Machine Learning (ICML).Google Scholar
Longqi Yang, Eugene Bagdasaryan, Joshua Gruenstein, Cheng-Kang Hsieh, and Deborah Estrin. 2018. Openrec: A modular framework for extensible and adaptable recommendation algorithms. In Proceedings of the 11th ACM International Conference on Web Search and Data Mining (WSDM).Google ScholarDigital Library

Index Terms

Combating Selection Biases in Recommender Systems with a Few Unbiased Ratings
1. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
      1. Recommender systems

Recommendations

Selection bias mitigation in recommender system using uninteresting items based on temporal visibility
Highlights
- Modeling pre-use preferences and temporal rating can identify uninteresting items.
Abstract
Most collaborative filtering recommendation algorithms rely too much on the user's historical rating data. However, selection bias is common in explicit feedback data, which makes the learning of user preferences face more challenges. ...
Read More
Unbiased Learning to Rank with Unbiased Propensity Estimation
SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval

Learning to rank with biased click data is a well-known challenge. A variety of methods has been explored to debias click data for learning to rank such as click models, result interleaving and, more recently, the unbiased learning-to-rank framework ...
Read More
Estimation of selected parameters

Modern statistical problems often involve selection of populations (or genes for example) using the observations. After selecting the populations, it is important to estimate the corresponding parameters. These quantities are called the selected ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WSDM '21: Proceedings of the 14th ACM International Conference on Web Search and Data Mining
March 2021
1192 pages
ISBN:9781450382977
DOI:10.1145/3437963
General Chairs:
Liane Lewin-Eytan
Amazon, Israel
,
David Carmel
Amazon, Israel
,
Elad Yom-Tov
Microsoft, Israel
,
Program Chairs:
Eugene Agichtein
Emory University and Amazon, USA
,
Evgeniy Gabrilovich
Google Health, USA
Copyright © 2021 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 8 March 2021
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
propensity estimation
rating prediction
selection bias
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate498of2,863submissions,17%
Upcoming Conference
WSDM '25

Sponsor:

sigir

sigir

sigir

sigir

The Eighteenth ACM International Conference on Web Search and Data Mining

April 7 - 11, 2025

Hannover , Germany
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 8
  Total Citations
  View Citations
- 711
  Total Downloads
- Downloads (Last 12 months)132
- Downloads (Last 6 weeks)23
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Combating Selection Biases in Recommender Systems with a Few Unbiased Ratings

WSDM '21: Proceedings of the 14th ACM International Conference on Web Search and Data Mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Selection bias mitigation in recommender system using uninteresting items based on temporal visibility

Unbiased Learning to Rank with Unbiased Propensity Estimation

Estimation of selected parameters