Abstract
With the explosive growth of online information, many recommendation methods have been proposed. This research direction is boosted with deep learning architectures, especially the recently proposed graph convolutional networks (GCNs). GCNs have shown tremendous potential in graph embedding learning thanks to its inductive inference property. However, most of the existing GCN-based methods focus on solving tasks in the homogeneous graph settings, and none of them considers heterogeneous graph settings. In this paper, we bridge the gap by developing a novel framework called HeteGraph based on the GCN principles. HeteGraph can handle heterogeneous graphs in the recommender systems. Specifically, we propose a sampling technique and a graph convolutional operation to learn high-quality graph’s node embeddings, which differs from the traditional GCN approaches where a full graph adjacency matrix is needed for the embedding learning. We design two models based on the HeteGraph framework to evaluate two important recommendation tasks, namely item rating prediction and diversified item recommendations. Extensive experiments show the encouraging performance of HeteGraph on the first task and the state-of-the-art performance on the second task.
Similar content being viewed by others
Availability of data and material
Public dataset MovieLens is available at: https://grouplens.org/datasets/movielens
Public dataset Book-Crossing is available at: www2.informatik.uni-freiburg.de/~cziegler/BX.
Code availability
The source code of HeteGraph is available at: http://github.com/heroddaji/dai_hetegraph.
References
Adomavicius G, Tuzhilin A (2005) Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans Knowl Data Eng 17(6):734–749
Koren Y, Bell RM, Volinsky C (2009) Matrix factorization techniques for recommender systems. IEEE Comput 42(8):30–37
Hidasi B, Karatzoglou A (2018) Recurrent neural networks with Top-k gains for session-based recommendations. In: Proceedings of the 27th ACM international conference on information and knowledge management (CIKM 2018), Torino, Italy, pp 843–852
Sedhain S, Menon AK, Sanner S, Xie L (2015) AutoRec: autoencoders meet collaborative filtering. In: Proceedings of the 24th international conference on world wide web companion volume (WWW 2015), Florence, Italy, pp 111–112
Kang W-C, McAuley J (2018) Self-attentive sequential recommendation. In: Proceedings of the IEEE international conference on data mining (ICDM 2018), pp 197–206, Singapore
Liu F, Xue S, Wu J, Zhou C, Hu W, Paris C, Nepal S, Yang J, Yu PS (2020) Deep learning for community detection: Progress, challenges and opportunities. In: Proceedings of the twenty-ninth international joint conference on artificial intelligence, IJCAI-20, pp 4981–4987
Hamilton WL, Ying R, Leskovec J (2017) Representation learning on graphs: methods and applications. IEEE Data Eng Bull 40(3):52–74
Kipf TN, Welling M (2017) Semi-supervised classification with graph convolutional networks. In: Proceedings of the 5th international conference on learning representations (ICLR 2017), Toulon, France
Hamilton WL, Ying Z, Leskovec J (2017) Inductive representation learning on large graphs. In: Proceedings of the 31st annual conference on neural information processing systems (NIPS 2017), Long Beach, CA, USA, pp 1025–1035
Ng AY, Jordan MI, Weiss Y (2001) On spectral clustering: analysis and an algorithm. In: Proceedings of the 15th annual conference on neural information processing systems (NIPS 2001), Vancouver, British Columbia, Canada, pp 849–856
Page L, Brin S, Motwani R, Winograd T (1999) The pagerank citation ranking: Bringing order to the web. Technical report, Stanford InfoLab
Kruskal JB (1964) Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis. Psychometrika 29(1):1–27
Perozzi B, Al-Rfou R, Skiena S (2014) DeepWalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining (KDD 2014), New York, NY, USA, pp 701–710
Tang J, Qu M, Wang M, Zhang M, Yan J, Mei Q (2015) LINE: large-scale information network embedding. In: Proceedings of the 24th international conference on world wide web (WWW 2015), Florence, Italy, pp 1067–1077
Wang D, Cui P, Zhu W (2016) Structural Deep Network Embedding. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2016), San Francisco, CA, USA, pp 1225–1234
Grover A, Leskovec J (2016) Node2vec: Scalable Feature Learning for Networks. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (KDD 2016), San Francisco, CA, USA, pp 855–864
Cao S, Lu W, Xu Q (2015) GraRep: Learning Graph Representations with Global Structural Information. In: Proceedings of the 24th ACM international conference on information and knowledge management (CIKM 2015), Melbourne, VIC, Australia, pp 891–900
Yang Z, Cohen WW, Salakhutdinov R (2016) Revisiting Semi-Supervised Learning with Graph Embeddings. In: Proceedings of the 33nd international conference on machine learning (ICML 2016), New York City, NY, USA, pp 40–48
Shervashidze N, Schweitzer P, van Leeuwen EJ, Mehlhorn K, Borgwardt KM (2011) Weisfeiler-lehman graph kernels. J Mach Learn Res 12:2539–2561
Dai H, Dai B, Song L (2016) Discriminative embeddings of latent variable models for structured data. In: Proceedings of the 33nd international conference on machine learning (ICML 2016), New York City, NY, USA, pp 2702–2711
Beck D, Haffari G, Cohn T (2018) Graph-to-Sequence Learning using Gated Graph Neural Networks. In: Proceedings of the 56th annual meeting of the association for computational linguistics (ACL 2018), Melbourne, Australia, pp 273–283
Scarselli F, Gori M, Tsoi AC, Markus H, Gabriele M (2009) The Graph Neural Network Model. IEEE Trans Neural Netw 20(1):61–80
Bruna J, Zaremba W, Szlam A, LeCun Y (2014) Spectral networks and locally connected networks on graphs. In: Proceedings of the 2nd international conference on learning representations (ICLR 2014), Banff, AB, Canada
Bronstein MM, Bruna J, LeCun Y, Szlam A, Vandergheynst P (2017) Geometric deep learning: going beyond Euclidean data. IEEE Signal Process Mag 34(4):18–42
Defferrard M, Bresson X, Vandergheynst P (2016) Convolutional neural networks on graphs with fast localized spectral filtering. In: Proceedings of the 30th annual conference on neural information processing systems (NIPS 2016), Barcelona, Spain, pp 3837–3845
Duvenaud DK, Maclaurin D, Aguilera-Iparraguirre J, Gómez-Bombarelli R, Hirzel T, Aspuru-Guzik A, Adams RP (2015) Convolutional Networks on Graphs for Learning Molecular Fingerprints. In: Proceedings of the 29th annual conference on neural information processing systems (NIPS 2015), Montreal, Quebec, Canada, pp 2224–2232
Monti F, Bronstein MM, Bresson X (2017) Geometric Matrix Completion with Recurrent Multi-Graph Neural Networks. In: Proceedings of the 31st Annual Conference on Neural Information Processing Systems (NIPS 2017), pages 3700–3710, Long Beach, CA, USA
Zitnik M, Agrawal M, Leskovec J (2018) Modeling polypharmacy side effects with graph convolutional networks. Bioinformatics 34(13):i457–i466
van den Berg R, Kipf TN, Welling M (2017) Graph convolutional matrix completion. CoRR, arXiv:abs/1706.02263
Cheng H-T, Koc L, Harmsen J, Shaked T, Chandra T, Aradhye H, Anderson G, Corrado G, Chai W, Ispir M, Anil R, Haque Z, Hong L, Jain V, Liu X, Shah H (September 2016) Wide & Deep Learning for Recommender Systems. In: Proceedings of the 1st Workshop on Deep Learning for Recommender Systems (DLRS@RecSys 2016), Boston, MA, USA, pp 7–10
Ying R, He R, Chen K, Eksombatchai P, Hamilton WL, Leskovec J (August 2018) Graph Convolutional Neural Networks for Web-Scale Recommender Systems. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining (KDD 2018), London, UK, pp 974–983
Pennington J, Socher R, Christopher D (2014) Manning. Glove: Global vectors for word representation. In: EMNLP 2014, ACL, pp 1532–1543
Cho K, van Merrienboer B, Gülçehre Ç, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using RNN encoder-decoder for statistical machine translation. EMNLP 2014:1724–1734
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Proceedings of the 27th annual conference on neural information processing systems (NIPS 2013), pp 3111–3119, Lake Tahoe, Nevada, USA
Linden G, Smith B, York J (2003) Amazon.com Recommendations: Item-to-Item Collaborative Filtering. IEEE Internet Comput 7(1):76–80
Gomez-Uribe CA, Hunt N (2016) The netflix recommender system: algorithms, business value, and innovation. ACM Trans Manag Inf Syst 6(4):13:1–13:19
Wilhelm M, Ramanathan A, Bonomo A, Jain S, Chi EH, Jennifer G (2018) Practical diversified recommendations on youtube with determinantal point processes. In: Proceedings of the 27th ACM international conference on information and knowledge management, CIKM 2018, Torino, Italy, October 22-26, 2018, pp 2165–2173. ACM
Ziegler C-N, McNee SM, Konstan JA, Lausen G (2005) Improving recommendation lists through topic diversification. In: Proceedings of the 14th international conference on world wide web, WWW, Chiba, Japan, pp 22–32
Lemire D, Maclachlan A (2005) Slope one predictors for online rating-based collaborative filtering. In: Proceedings of the 2005 SIAM international conference on data mining, SDM 2005, Newport Beach, CA, USA, pp 471–475
Luo X, Zhou M, Xia Y, Zhu Q (2014) An efficient non-negative matrix-factorization-based approach to collaborative filtering for recommender systems. IEEE Trans Ind Inform 10(2):1273–1284
Sarwar BM, Karypis G, Konstan JA, Riedl J (2001) Item-based collaborative filtering recommendation algorithms. In: Proceedings of the tenth international world wide web conference, WWW 10, pp 285–295
Graves A (2013) Generating sequences with recurrent neural networks. CoRR, arXiv:abs/1308.0850
Zeiler MD (2012) ADADELTA: an adaptive learning rate method. CoRR, arXiv:abs/1212.5701
Duchi JC, Hazan E, Singer Y (2011) Adaptive subgradient methods for online learning and stochastic optimization. J Mach Learn Res 12:2121–2159
Kingma DP, Ba J (2015) Adam: A method for stochastic optimization. In: 3rd International conference on learning representations, ICLR conference track proceedings. San Diego, CA, USA, (May 2015)
Sutskever I, Martens J, Dahl GE, Hinton GE (2013) On the importance of initialization and momentum in deep learning. In: Proceedings of the 30th international conference on machine learning, ICML 2013, vol 28., Atlanta, GA, USA, pp 1139–1147
Funding
This study was funded by Australian Research Council Discovery Project ARC, Grant number DP200102298.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
Author Quan Z. Sheng has received research grants from Company Australian Research Council.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Tran, D.H., Sheng, Q.Z., Zhang, W.E. et al. HeteGraph: graph learning in recommender systems via graph convolutional networks. Neural Comput & Applic 35, 13047–13063 (2023). https://doi.org/10.1007/s00521-020-05667-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-020-05667-z