skip to main content
10.1145/3511808.3557431acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Representation Matters When Learning From Biased Feedback in Recommendation

Published: 17 October 2022 Publication History

Abstract

The logged feedback for training recommender systems is usually subject to selection bias, which could not reflect real user preference. Thus, many efforts have been made to learn the de-biased recommender system from biased feedback. However, existing methods for dealing with selection bias are usually affected by the error of propensity weight estimation, have high variance, or assume access to uniform data, which is expensive to be collected in practice. In this work, we address these issues by proposing Learning De-biased Representations (LDR), a framework derived from the representation learning perspective. LDR bridges the gap between propensity weight estimation (WE) and unbiased weighted learning (WL) and provides an end-to-end solution that iteratively conducts WE and WL. We show LDR can effectively alleviate selection bias with bounded variance. We also perform theoretical analysis on the statistical properties of LDR, such as its bias, variance, and generalization performance. Extensive experiments on both semi-synthetic and real-world datasets demonstrate the effectiveness of LDR.

References

[1]
Hana Ajakan, Pascal Germain, Hugo Larochelle, Francc ois Laviolette, and Mario Marchand. 2014. Domain-adversarial neural networks. In arXiv.
[2]
Jyoti Aneja, Alex Schwing, Jan Kautz, and Arash Vahdat. 2021. A contrastive learning approach for training variational autoencoder priors. In Conference on Neural Information Processing Systems.
[3]
Steffen Bickel, Michael Brückner, and Tobias Scheffer. 2007. Discriminative learning for differing training and test distributions. In International Conference on Machine Learning.
[4]
Stephen Bonner and Flavian Vasile. 2018. Causal embeddings for recommendation. In ACM Recommender Systems conference.
[5]
Jiawei Chen, Hande Dong, Yang Qiu, Xiangnan He, Xin Xin, Liang Chen, Guli Lin, and Keping Yang. 2021a. AutoDebias: Learning to Debias for Recommendation. In ACM SIGIR Conference on Research and Development in Information Retrieval.
[6]
Jiawei Chen, Hande Dong, Xiang Wang, Fuli Feng, Meng Wang, and Xiangnan He. 2020. Bias and debias in recommender system: A survey and future directions. In arXiv.
[7]
Zhengyu Chen, Sibo Gai, and Donglin Wang. 2019. Deep Tensor Factorization for Multi-Criteria Recommender Systems. In International Conference on Big Data (Big Data). 1046--1051.
[8]
Zhengyu Chen and Donglin Wang. 2021. Multi-Initialization Meta-Learning with Domain Adaptation. In The International Conference on Acoustics, Speech, & Signal Processing. 1390--1394.
[9]
Zhengyu Chen, Ziqing Xu, and Donglin Wang. 2021b. Deep transfer tensor decomposition with orthogonal constraint for recommender systems. In The Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI. 3.
[10]
Corinna Cortes, Yishay Mansour, and Mehryar Mohri. 2010. Learning Bounds for Importance Weighting. In Conference on Neural Information Processing Systems.
[11]
Imre Csiszár. 1967. Information-type measures of difference of probability distributions and indirect observation. In Studia Sci. Math. Hungar.
[12]
Miroslav Dud'ik, John Langford, and Lihong Li. 2011. Doubly robust policy evaluation and learning. In International Conference on Machine Learning.
[13]
Harrison Edwards and Amos J. Storkey. 2016. Censoring Representations with an Adversary. In International Conference on Learning Representations.
[14]
Benjamin Eysenbach, Shreyas Chaudhari, Swapnil Asawa, Sergey Levine, and Ruslan Salakhutdinov. 2020. Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers. In International Conference on Learning Representations.
[15]
Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, Francc ois Laviolette, Mario Marchand, and Victor Lempitsky. 2016. Domain-adversarial training of neural networks. In The Journal of Machine Learning Research.
[16]
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative Adversarial Nets. In Conference on Neural Information Processing System.
[17]
Negar Hassanpour and Russell Greiner. 2019. CounterFactual Regression with Importance Sampling Weights. In International Joint Conference on Artificial Intelligence.
[18]
Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. In The World Wide Web Conference.
[19]
Kurt Hornik, Maxwell Stinchcombe, and Halbert White. 1989. Multilayer feedforward networks are universal approximators. In Neural networks.
[20]
Fredrik Johansson, Uri Shalit, and David Sontag. 2016. Learning representations for counterfactual inference. In International Conference on Machine Learning.
[21]
Fredrik D Johansson, Nathan Kallus, Uri Shalit, and David Sontag. 2018. Learning weighted representations for generalization across designs. In arXiv.
[22]
Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. In Computer.
[23]
Zinan Lin, Dugang Liu, Weike Pan, and Zhong Ming. 2021. Transfer Learning in Collaborative Recommendation for Bias Reduction. In ACM Recommender Systems Conference.
[24]
Dugang Liu, Pengxiang Cheng, Zhenhua Dong, Xiuqiang He, Weike Pan, and Zhong Ming. 2020. A general knowledge distillation framework for counterfactual recommendation via uniform data. In ACM SIGIR Conference on Research and Development in Information Retrieval.
[25]
Dugang Liu, Pengxiang Cheng, Hong Zhu, Zhenhua Dong, Xiuqiang He, Weike Pan, and Zhong Ming. 2021. Mitigating Confounding Bias in Recommendation via Information Bottleneck. In ACM Recommender Systems Conference.
[26]
David Madras, Elliot Creager, Toniann Pitassi, and Richard Zemel. 2018. Learning adversarially fair and transferable representations. In International Conference on Machine Learning.
[27]
Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, and Adrian Vladu. 2018. Towards Deep Learning Models Resistant to Adversarial Attacks. In International Conference on Learning Representations.
[28]
Benjamin M Marlin, Richard S Zemel, Sam Roweis, and Malcolm Slaney. 2007. Collaborative filtering and the missing at random assumption. In UAI.
[29]
Jianmo Ni, Jiacheng Li, and Julian McAuley. 2019. Justifying recommendations using distantly-labeled reviews and fine-grained aspects. In Conference on Empirical Methods in Natural Language Processing.
[30]
Sebastian Nowozin, Botond Cseke, and Ryota Tomioka. 2016. f-gan: Training generative neural samplers using variational divergence minimization. In Conference on Neural Information Processing System.
[31]
Steffen Rendle, Walid Krichene, Li Zhang, and John Anderson. 2020. Neural collaborative filtering vs. matrix factorization revisited. In ACM Recommender Systems Conference.
[32]
Alfréd Rényi et al. 1961. On measures of entropy and information. In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Contributions to the Theory of Statistics.
[33]
Noveen Sachdeva, Yi Su, and Thorsten Joachims. 2020. Off-policy bandits with deficient support. In KDD.
[34]
Yuta Saito. 2020a. Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback. In ACM SIGIR Conference on Research and Development in Information Retrieval.
[35]
Yuta Saito. 2020b. Doubly robust estimator for ranking metrics with post-click conversions. In ACM Recommender Systems Conference.
[36]
Yuta Saito, Suguru Yaginuma, Yuta Nishino, Hayato Sakata, and Kazuhide Nakata. 2020. Unbiased recommender learning from missing-not-at-random implicit feedback. In ACM International Conference on Web Search and Data Mining.
[37]
Tobias Schnabel, Adith Swaminathan, Ashudeep Singh, Navin Chandak, and Thorsten Joachims. 2016. Recommendations as treatments: Debiasing learning and evaluation. In International Conference on Machine Learning.
[38]
Uri Shalit, Fredrik D Johansson, and David Sontag. 2017. Estimating individual treatment effect: generalization bounds and algorithms. In International Conference on Machine Learning.
[39]
Xinwei Sun, Botong Wu, Xiangyu Zheng, Chang Liu, Wei Chen, Tao Qin, and Tie-Yan Liu. 2021. Recovering Latent Causal Factor for Generalization to Distributional Shifts. In Conference on Neural Information Processing System.
[40]
Adith Swaminathan and Thorsten Joachims. 2015a. Batch learning from logged bandit feedback through counterfactual risk minimization. In The Journal of Machine Learning Research.
[41]
Adith Swaminathan and Thorsten Joachims. 2015b. The self-normalized estimator for counterfactual learning. In Conference on Neural Information Processing System.
[42]
Takeshi Teshima, Issei Sato, and Masashi Sugiyama. 2020. Few-shot domain adaptation by causal mechanism transfer. In International Conference on Machine Learning.
[43]
Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. 2019. Doubly robust joint learning for recommendation on data missing not at random. In International Conference on Machine Learning.
[44]
Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. 2021. Combating Selection Biases in Recommender Systems with a Few Unbiased Ratings. In ACM International Conference on Web Search and Data Mining.
[45]
Zifeng Wang, Xi Chen, Rui Wen, Shao-Lun Huang, Ercan Kuruoglu, and Yefeng Zheng. 2020. Information Theoretic Counterfactual Learning from Missing-Not-At-Random Feedback. In Conference on Neural Information Processing System.
[46]
Teng Xiao and Donglin Wang. 2021. A General Offline Reinforcement Learning Framework for Interactive Recommendation. In AAAI Conference on Artificial Intelligence. 4512--4520.
[47]
Teng Xiao and Suhang Wang. 2022a. Towards Off-Policy Learning for Ranking Policies with Logged Feedback. In AAAI Conference on Artificial Intelligence. 8700--8707.
[48]
Teng Xiao and Suhang Wang. 2022b. Towards unbiased and robust causal ranking for recommender systems. In ACM Conference on Web Search and Data Mining. 1158--1167.
[49]
Yuan Xie, Boyi Liu, Qiang Liu, Zhaoran Wang, Yuan Zhou, and Jian Peng. 2018. Off-Policy Evaluation and Learning from Logged Bandit Feedback: Error Reduction via Surrogate Policy. In International Conference on Learning Representations.
[50]
Da Xu, Chuanwei Ruan, Evren Korpeoglu, Sushant Kumar, and Kannan Achan. 2020a. Adversarial Counterfactual Learning and Evaluation for Recommender System. In Conference on Neural Information Processing System.
[51]
Da Xu, Yuting Ye, and Chuanwei Ruan. 2020b. Understanding the role of importance weighting for deep learning. In International Conference on Learning Representations.
[52]
Jinsung Yoon, James Jordon, and Mihaela Van Der Schaar. 2018. GANITE: Estimation of individualized treatment effects using generative adversarial nets. In International Conference on Learning Representations.
[53]
Yu Zheng, Chen Gao, Xiang Li, Xiangnan He, Yong Li, and Depeng Jin. 2021. Disentangling User Interest and Conformity for Recommendation with Causal Embedding. In The Web Conference.
[54]
Ziwei Zhu, Yun He, Yin Zhang, and James Caverlee. 2020. Unbiased Implicit Recommendation and Propensity Estimation via Combinational Joint Learning. In ACM Recommender Systems Conference.

Cited By

View all
  • (2025)Counterfactual Learning on Graphs: A SurveyMachine Intelligence Research10.1007/s11633-024-1519-z22:1(17-59)Online publication date: 24-Jan-2025
  • (2024)CaDRec: Contextualized and Debiased Recommender ModelProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657799(405-415)Online publication date: 10-Jul-2024
  • (2024)Pareto Graph Self-Supervised LearningICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP48485.2024.10447557(6630-6634)Online publication date: 14-Apr-2024
  • Show More Cited By

Index Terms

  1. Representation Matters When Learning From Biased Feedback in Recommendation

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management
    October 2022
    5274 pages
    ISBN:9781450392365
    DOI:10.1145/3511808
    • General Chairs:
    • Mohammad Al Hasan,
    • Li Xiong
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 October 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. counterfactual learning
    2. logged feedback
    3. unbiased learning

    Qualifiers

    • Research-article

    Funding Sources

    • Army Research Office (ARO)
    • National Science Foundation (NSF)

    Conference

    CIKM '22
    Sponsor:

    Acceptance Rates

    CIKM '22 Paper Acceptance Rate 621 of 2,257 submissions, 28%;
    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)37
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 28 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)Counterfactual Learning on Graphs: A SurveyMachine Intelligence Research10.1007/s11633-024-1519-z22:1(17-59)Online publication date: 24-Jan-2025
    • (2024)CaDRec: Contextualized and Debiased Recommender ModelProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657799(405-415)Online publication date: 10-Jul-2024
    • (2024)Pareto Graph Self-Supervised LearningICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP48485.2024.10447557(6630-6634)Online publication date: 14-Apr-2024
    • (2023)Reconsidering Learning Objectives in Unbiased Recommendation: A Distribution Shift PerspectiveProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599487(2764-2775)Online publication date: 6-Aug-2023
    • (2023)High-efficiency Device-Cloud Collaborative Transformer Model2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)10.1109/CVPRW59228.2023.00214(2204-2210)Online publication date: Jun-2023

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media