ABSTRACT
Recommendation datasets are prone to selection biases due to self-selection behavior of users and item selection process of systems. This makes explicitly combating selection biases an essential problem in training recommender systems. Most previous studies assume no unbiased data available for training. We relax this assumption and assume that a small subset of training data is unbiased. Then, we propose a novel objective that utilizes the unbiased data to adaptively assign propensity weights to biased training ratings. This objective, combined with unbiased performance estimators, alleviates the effects of selection biases on the training of recommender systems. To optimize the objective, we propose an efficient algorithm that minimizes the variance of propensity estimates for better generalized recommender systems. Extensive experiments on two real-world datasets confirm the advantages of our approach in significantly reducing both the error of rating prediction and the variance of propensity estimation.
- Balázs Csanád Csáji et al. 2001. Approximation with artificial neural networks. Faculty of Sciences, Etvs Lornd University, Hungary (2001).Google Scholar
- John Duchi, Elad Hazan, and Yoram Singer. 2011. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization. Journal of Machine Learning Research (JMLR) (2011).Google Scholar
- Luca Franceschi, Michele Donini, Paolo Frasconi, and Massimiliano Pontil. 2017. Forward and reverse gradient-based hyperparameter optimization. In Proceedings of the 34th International Conference on Machine Learning (ICML).Google ScholarDigital Library
- Luca Franceschi, Paolo Frasconi, Saverio Salzo, Riccardo Grazzi, and Massimiliano Pontil. 2018. Bilevel programming for hyperparameter optimization and meta-learning. In Proceedings of the 35th International Conference on Machine Learning (ICML).Google Scholar
- Xiangnan He and Tat-Seng Chua. 2017. Neural factorization machines for sparse predictive analytics. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR).Google ScholarDigital Library
- José Miguel Hernández-Lobato, Neil Houlsby, and Zoubin Ghahramani. 2014. Probabilistic matrix factorization with non-random missing data. In Proceedings of the 31st International Conference on Machine Learning (ICML).Google Scholar
- Sha Hu, Zhicheng Dou, Xiaojie Wang, Tetsuya Sakai, and Ji-Rong Wen. 2015. Search result diversification based on hierarchical intents. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (CIKM).Google ScholarDigital Library
- Xinting Huang, Jianzhong Qi, Yu Sun, and Rui Zhang. 2020 a. Mala: Cross-domain dialogue generation with action learning. In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI).Google ScholarCross Ref
- Xinting Huang, Jianzhong Qi, Yu Sun, and Rui Zhang. 2020 b. Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL).Google ScholarCross Ref
- Simon Jenni and Paolo Favaro. 2018. Deep Bilevel Learning. In Proceedings of the 15th European Conference on Computer Vision (ECCV).Google ScholarCross Ref
- Thorsten Joachims, Adith Swaminathan, and Tobias Schnabel. 2017. Unbiased learning-to-rank with biased feedback. In Proceedings of the 10th ACM International Conference on Web Search and Data Mining (WSDM).Google ScholarDigital Library
- Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer (2009).Google ScholarDigital Library
- Guang Ling, Haiqin Yang, Michael R Lyu, and Irwin King. 2012. Response aware model-based collaborative filtering. In Proceedings of the 28th Conference on Uncertainty in Artificial Intelligence (UAI).Google Scholar
- Donghua Liu, Jing Li, Bo Du, Jun Chang, and Rong Gao. 2019. DAML: Dual Attention Mutual Learning between Ratings and Reviews for Item Recommendation. In Proceedings of the 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (SIGKDD).Google ScholarDigital Library
- Jiongnan Liu, Zhicheng Dou, Xiaojie Wang, Shuqi Lu, and Ji-Rong Wen. 2020. DVGAN: A Minimax Game for Search Result Diversification Combining Explicit and Implicit Features. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR).Google ScholarDigital Library
- Shuqi Lu, Zhicheng Dou, Chenyan Xiong, Xiaojie Wang, and Ji-Rong Wen. 2020. Knowledge Enhanced Personalized Search. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR).Google ScholarDigital Library
- Dougal Maclaurin, David Duvenaud, and Ryan P Adams. 2015. Gradient-based hyperparameter optimization through reversible learning. In Proceedings of the 32nd International Conference on Machine Learning (ICML).Google Scholar
- Benjamin M Marlin and Richard S Zemel. 2009. Collaborative prediction and ranking with non-random missing data. In Proceedings of the 3rd ACM Conference on Recommender Systems (RecSys).Google ScholarDigital Library
- Benjamin M Marlin, Richard S Zemel, Sam Roweis, and Malcolm Slaney. 2007. Collaborative filtering and the missing at random assumption. In Proceedings of the 23rd Conference on Uncertainty in Artificial Intelligence (UAI).Google Scholar
- Fabian Pedregosa. 2016. Hyperparameter optimization with approximate gradient. In Proceedings of the 33rd International Conference on Machine Learning (ICML).Google Scholar
- Mengye Ren, Wenyuan Zeng, Bin Yang, and Raquel Urtasun. 2018. Learning to Reweight Examples for Robust Deep Learning. In Proceedings of the 35th International Conference on Machine Learning (ICML).Google Scholar
- Masahiro Sato, Sho Takemori, Janmajay Singh, and Tomoko Ohkuma. 2020. Unbiased Learning for the Causal Effect of Recommendation. In Fourteenth ACM Conference on Recommender Systems (RecSys).Google Scholar
- Tobias Schnabel and Paul N Bennett. 2020. Debiasing Item-to-Item Recommendations With Small Annotated Datasets. In Fourteenth ACM Conference on Recommender Systems (RecSys).Google Scholar
- Tobias Schnabel, Adith Swaminathan, Ashudeep Singh, Navin Chandak, and Thorsten Joachims. 2016. Recommendations as treatments: debiasing learning and evaluation. In Proceedings of the 33rd International Conference on Machine Learning (ICML).Google Scholar
- Amirreza Shaban, Ching-An Cheng, Nathan Hatch, and Byron Boots. 2019. Truncated back-propagation for bilevel optimization. In Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS).Google Scholar
- Ying Shan, T Ryan Hoens, Jian Jiao, Haijing Wang, Dong Yu, and JC Mao. 2016. Deep crossing: Web-scale modeling without manually crafted combinatorial features. In Proceedings of the 22nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (SIGKDD).Google ScholarDigital Library
- Ilya Shenbin, Anton Alekseev, Elena Tutubalina, Valentin Malykh, and Sergey I Nikolenko. 2020. RecVAE: a New Variational Autoencoder for Top-N Recommendations with Implicit Feedback. In Proceedings of the 13th International Conference on Web Search and Data Mining (WSDM).Google ScholarDigital Library
- Harald Steck. 2010. Training and testing of recommender systems on data missing not at random. In Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (SIGKDD).Google ScholarDigital Library
- Harald Steck. 2013. Evaluation of recommendations: rating-prediction and ranking. In Proceedings of the 7th ACM Conference on Recommender Systems (RecSys).Google ScholarDigital Library
- Yixin Su, Rui Zhang, Sarah Erfani, and Zhenghua Xu. 2021. Detecting Beneficial Feature Interactions for Recommender Systems. In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI).Google ScholarCross Ref
- Adith Swaminathan and Thorsten Joachims. 2015a. Counterfactual risk minimization: learning from logged bandit feedback. In Proceedings of the 32nd International Conference on Machine Learning (ICML).Google Scholar
- Adith Swaminathan and Thorsten Joachims. 2015b. The self-normalized estimator for counterfactual learning. In Proceedings of the 28th Conference on Neural Information Processing Systems (NeurIPS).Google ScholarDigital Library
- Menghan Wang, Mingming Gong, Xiaolin Zheng, and Kun Zhang. 2018a. Modeling dynamic missingness of implicit feedback for recommendation. In Proceedings of the 32nd Conference on Neural Information Processing Systems (NeurIPS).Google ScholarDigital Library
- Xiaojie Wang, Zhicheng Dou, Tetsuya Sakai, and Ji-Rong Wen. 2016. Evaluating search result diversity using intent hierarchies. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval (SIGIR).Google ScholarDigital Library
- Xiaojie Wang, Jianzhong Qi, Kotagiri Ramamohanarao, Yu Sun, Bo Li, and Rui Zhang. 2018b. A joint optimization approach for personalized recommendation diversification. In Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD).Google ScholarDigital Library
- Xiaojie Wang, Ji-Rong Wen, Zhicheng Dou, Tetsuya Sakai, and Rui Zhang. 2017. Search result diversity evaluation based on intent hierarchies. IEEE Transactions on Knowledge and Data Engineering (TKDE) (2017).Google Scholar
- Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. 2018c. Kdgan: Knowledge distillation with generative adversarial networks. In Advances in Neural Information Processing Systems (NeurIPS).Google Scholar
- Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. 2019 a. Adversarial distillation for learning with privileged provisions. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) (2019).Google Scholar
- Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. 2019 b. Doubly robust joint learning for recommendation on data missing not at random. In Proceedings of the 36th International Conference on Machine Learning (ICML).Google Scholar
- Longqi Yang, Eugene Bagdasaryan, Joshua Gruenstein, Cheng-Kang Hsieh, and Deborah Estrin. 2018. Openrec: A modular framework for extensible and adaptable recommendation algorithms. In Proceedings of the 11th ACM International Conference on Web Search and Data Mining (WSDM).Google ScholarDigital Library
Index Terms
- Combating Selection Biases in Recommender Systems with a Few Unbiased Ratings
Recommendations
Selection bias mitigation in recommender system using uninteresting items based on temporal visibility
Highlights- Modeling pre-use preferences and temporal rating can identify uninteresting items.
AbstractMost collaborative filtering recommendation algorithms rely too much on the user's historical rating data. However, selection bias is common in explicit feedback data, which makes the learning of user preferences face more challenges. ...
Unbiased Learning to Rank with Unbiased Propensity Estimation
SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information RetrievalLearning to rank with biased click data is a well-known challenge. A variety of methods has been explored to debias click data for learning to rank such as click models, result interleaving and, more recently, the unbiased learning-to-rank framework ...
Estimation of selected parameters
Modern statistical problems often involve selection of populations (or genes for example) using the observations. After selecting the populations, it is important to estimate the corresponding parameters. These quantities are called the selected ...
Comments