skip to main content
10.1145/3447548.3467205acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Purify and Generate: Learning Faithful Item-to-Item Graph from Noisy User-Item Interaction Behaviors

Published: 14 August 2021 Publication History

Abstract

Matching is almost the first and most fundamental step in recommender systems, that is to quickly select hundreds or thousands of related entities from the whole commodity pool. Among all the matching methods, item-to-item (I2I) graph based matching is a handy and highly effective approach and is widely used in most applications, owing to the essential relationships of entities described in a powerful I2I graph. Yet, the I2I graph is not a ready-made product in a data source. To obtain it from users' behaviors, a common practice in the industry is to construct the graph based on the similarity of item embeddings or co-occurrence frequency directly. However, these methods tend to lose the complicated correlations (high-ordered or nonlinear) inside decision-making actions and cannot achieve the global optimal solution. Moreover, the correlations between items are usually contained in users' short-term actions, which are full of noise information (e.g. spurious association, missing connection). It is vitally important to filter out noise while generating the graph. In this paper, we propose a novel framework called Purified Graph Generation (PGG) dedicated to learn faithful I2I graph from sparse and noisy behavior data. We capture the 'confidence value' between user and item to get rid of exception action during decision making, and leverage it to re-sample purified sets that are fed into an unsupervised I2I graph structure learning framework called GPBG. Extensive experimental results from both simulation and real data demonstrate that our method could significantly benefit the performance of I2I graph compared to the typical baselines.

Supplementary Material

MP4 File (purify_and_generate_learning_faithful-yue_he-yancheng_dong-38958174-2KL2.mp4)
Among all the matching methods, item-to-item (I2I) graph based matching is a handy and highly effective approach and is widely used in most applications. Yet, the I2I graph is not a ready-made product in a data source. To obtain it from users' behaviors, a common practice in the industry is to generate the graph in constructive way. However, these methods tend to lose the complicated correlations inside decision-making actions and cannot achieve the global optimal solution. And the correlations between items are usually contained in users' short-term actions, which are full of noise information. It is vitally important to filter out noise while generating the graph. In this paper, we propose a novel framework called Purified Graph Generation (PGG) dedicated to learn faithful I2I graph from sparse and noisy behavior data. We capture the `confidence value' between user to item and leverage it to re-sample purified sets that are fed into an unsupervised I2I graph structure learning framework called GPBG.

References

[1]
Aleksandar Bojchevski, Oleksandr Shchur, Daniel Zügner, and Stephan Günnemann. 2018. Netgan: Generating graphs via random walks. In International Conference on Machine Learning. PMLR, 610--619.
[2]
Nicola De Cao and Thomas Kipf. 2018. MolGAN: An implicit generative model for small molecular graphs. arXiv preprint arXiv:1805.11973 (2018).
[3]
Zhengxiao Du, Xiaowei Wang, Hongxia Yang, Jingren Zhou, and Jie Tang. 2019. Sequential scenario-specific meta learner for online recommendation. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2895--2904.
[4]
Luca Franceschi, Mathias Niepert, Massimiliano Pontil, and Xiao He. 2019. Learning discrete structures for graph neural networks. In International conference on machine learning. PMLR, 1972--1982.
[5]
Matt W Gardner and SR Dorling. 1998. Artificial neural networks (the multilayer perceptron)-a review of applications in the atmospheric sciences. Atmospheric environment, Vol. 32, 14--15 (1998), 2627--2636.
[6]
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in neural information processing systems. 2672--2680.
[7]
Xiangnan He, Kuan Deng, Xiang Wang, Yan Li, Yongdong Zhang, and Meng Wang. 2020 b. Lightgcn: Simplifying and powering graph convolution network for recommendation. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 639--648.
[8]
Yue He, Peng Cui, Jianxin Ma, Hao Zou, and Philip S. Yu. 2020 a. Learning Stable Graphs from Multiple Environments with Selection Bias. In KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining .
[9]
Diederik P Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013).
[10]
Thomas N Kipf and Max Welling. 2016. Semi-Supervised Classification with Graph Convolutional Networks. (2016).
[11]
Tengfei Ma, Jie Chen, and Cao Xiao. 2018. Constrained generation of semantically valid graphs via regularizing variational autoencoders. arXiv preprint arXiv:1809.02630 (2018).
[12]
Yutaka Matsuo and Mitsuru Ishizuka. 2004. Keyword extraction from a single document using word co-occurrence statistical information. International Journal on Artificial Intelligence Tools, Vol. 13, 01 (2004), 157--169.
[13]
Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, et al. 2019. Pytorch: An imperative style, high-performance deep learning library. arXiv preprint arXiv:1912.01703 (2019).
[14]
Darwin Saire Pilco and Ad'in Ram'irez Rivera. 2019. Graph learning network: A structure learning algorithm. arXiv preprint arXiv:1905.12665 (2019).
[15]
Kihyuk Sohn, Honglak Lee, and Xinchen Yan. 2015. Learning structured output representation using deep conditional generative models. Advances in neural information processing systems, Vol. 28 (2015), 3483--3491.
[16]
Rakshit Trivedi, Jiachen Yang, and Hongyuan Zha. 2020. Graphopt: Learning optimization models of graph formation. In International Conference on Machine Learning. PMLR, 9603--9613.
[17]
Jizhe Wang, Pipei Huang, Huan Zhao, Zhibo Zhang, Binqiang Zhao, and Dik Lun Lee. 2018. Billion-scale commodity embedding for e-commerce recommendation in alibaba. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 839--848.
[18]
Ziyang Wang, Wei Wei, Gao Cong, Xiao-Li Li, Xian-Ling Mao, and Minghui Qiu. 2020. Global context enhanced graph neural networks for session-based recommendation. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 169--178.
[19]
Jiaxuan You, Rex Ying, Xiang Ren, William Hamilton, and Jure Leskovec. 2018. Graphrnn: Generating realistic graphs with deep auto-regressive models. In International Conference on Machine Learning. PMLR, 5708--5717.

Cited By

View all
  • (2024)UniEmbedding: Learning Universal Multi-Modal Multi-Domain Item Embeddings via User-View Contrastive LearningProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3680098(4446-4453)Online publication date: 21-Oct-2024
  • (2024)DISS-CF: Direct Item Session Similarity Enhanced Collaborative Filtering Method for Recommendation2024 IEEE International Conference on Web Services (ICWS)10.1109/ICWS62655.2024.00054(320-329)Online publication date: 7-Jul-2024
  • (2023)TAG: Joint Triple-Hierarchical Attention and GCN for Review-Based Social Recommender SystemIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.319495235:10(9904-9919)Online publication date: 1-Oct-2023
  • Show More Cited By

Index Terms

  1. Purify and Generate: Learning Faithful Item-to-Item Graph from Noisy User-Item Interaction Behaviors

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '21: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining
    August 2021
    4259 pages
    ISBN:9781450383325
    DOI:10.1145/3447548
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 14 August 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. denoise
    2. item2item graph
    3. recommender
    4. structure learning

    Qualifiers

    • Research-article

    Conference

    KDD '21
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)78
    • Downloads (Last 6 weeks)7
    Reflects downloads up to 08 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)UniEmbedding: Learning Universal Multi-Modal Multi-Domain Item Embeddings via User-View Contrastive LearningProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3680098(4446-4453)Online publication date: 21-Oct-2024
    • (2024)DISS-CF: Direct Item Session Similarity Enhanced Collaborative Filtering Method for Recommendation2024 IEEE International Conference on Web Services (ICWS)10.1109/ICWS62655.2024.00054(320-329)Online publication date: 7-Jul-2024
    • (2023)TAG: Joint Triple-Hierarchical Attention and GCN for Review-Based Social Recommender SystemIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.319495235:10(9904-9919)Online publication date: 1-Oct-2023
    • (2023)A GCN-based Model for Recommendation Using Local Differential Privacy Method2023 IEEE International Conference on High Performance Computing & Communications, Data Science & Systems, Smart City & Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys)10.1109/HPCC-DSS-SmartCity-DependSys60770.2023.00035(194-201)Online publication date: 17-Dec-2023

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media