skip to main content
10.1145/3357384.3358061acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

HeteSpaceyWalk: A Heterogeneous Spacey Random Walk for Heterogeneous Information Network Embedding

Published: 03 November 2019 Publication History

Abstract

Heterogeneous information network (HIN) embedding has gained increasing interests recently. However, the current way of random-walk based HIN embedding methods have paid few attention to the higher-order Markov chain nature of meta-path guided random walks, especially to the stationarity issue. In this paper, we systematically formalize the meta-path guided random walk as a higher-order Markov chain process,and present a heterogeneous personalized spacey random walk to efficiently and effectively attain the expected stationary distribution among nodes. Then we propose a generalized scalable framework to leverage the heterogeneous personalized spacey random walk to learn embeddings for multiple types of nodes in an HIN guided by a meta-path, a meta-graph, and a meta-schema respectively. We conduct extensive experiments in several heterogeneous networks and demonstrate that our methods substantially outperform the existing state-of-the-art network embedding algorithms.

References

[1]
Austin R. Benson, David F. Gleich, and Lek-Heng Lim. 2017. The Spacey Random Walk: A Stochastic Process for Higher-Order Data. SIAM Rev., Vol. 59, 2 (2017), 321--345.
[2]
Antoine Bordes, Nicolas Usunier, Alberto Garcia-Duran, Jason Weston, and Oksana Yakhnenko. 2013. Translating Embeddings for Modeling Multi-relational Data. In NIPS. 2787--2795.
[3]
Guillaume Bouchard, Dawei Yin, and Shengbo Guo. 2013. Convex Collective Matrix Factorization. In AISTATS. 144--152.
[4]
Hongyun Cai, Vincent W Zheng, and Kevin Chen-Chuan Chang. 2018. A comprehensive survey of graph embedding: Problems, techniques, and applications. TKDE, Vol. 30, 9 (2018), 1616--1637.
[5]
Shiyu Chang, Wei Han, Jiliang Tang, Guo-Jun Qi, Charu C Aggarwal, and Thomas S Huang. 2015. Heterogeneous network embedding via deep architectures. In KDD. ACM, 119--128.
[6]
Yuxiao Dong, Nitesh V. Chawla, and Ananthram Swami. 2017. metapath2vec: Scalable Representation Learning for Heterogeneous Networks. In KDD . 135--144.
[7]
Yuan Fang, Wenqing Lin, Vincent Wenchen Zheng, Min Wu, Kevin Chen-Chuan Chang, and Xiaoli Li. 2016. Semantic proximity search on graphs with metagraph-based learning. In ICDE . 277--288.
[8]
Tao-Yang Fu, Wang-Chien Lee, and Zhen Lei. 2017. HIN2Vec: Explore Meta-paths in Heterogeneous Information Networks for Representation Learning. In CIKM. 1797--1806.
[9]
Bent Fuglede and Flemming Topsoe. 2004. Jensen-Shannon divergence and Hilbert space embedding. In Information Theory, 2004. ISIT 2004. Proceedings. International Symposium on. IEEE, 31.
[10]
Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable Feature Learning for Networks. In KDD. 855--864.
[11]
Binbin Hu, Chuan Shi, Wayne Xin Zhao, and Philip S. Yu. 2018. Leveraging Meta-path based Context for Top- N Recommendation with A Neural Co-Attention Model. In KDD. ACM, 1531--1540.
[12]
Zhipeng Huang, Yudian Zheng, Reynold Cheng, Yizhou Sun, Nikos Mamoulis, and Xiang Li. 2016. Meta Structure: Computing Relevance in Large Heterogeneous Information Networks. In SIGKDD . 1595--1604.
[13]
Jan Kralj, Marko Robnik-vS ikonja, and Nada Lavravc. 2018. HINMINE: heterogeneous information network mining with information retrieval heuristics. Journal of Intelligent Information Systems, Vol. 50, 1 (2018), 29--61.
[14]
Ni Lao and William W. Cohen. 2010. Relational retrieval using a combination of path-constrained random walks. Machine Learning, Vol. 81, 1 (2010), 53--67.
[15]
Wen Li and Michael K Ng. 2014. On the limiting probability distribution of a transition probability tensor. Linear and Multilinear Algebra, Vol. 62, 3 (2014), 362--385.
[16]
Koji Maruhashi, Fan Guo, and Christos Faloutsos. 2012. MultiAspectForensics: mining large heterogeneous networks using tensor. Int. J. Web Eng. Technol., Vol. 7, 4 (2012), 302--322.
[17]
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013a. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).
[18]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013b. Distributed representations of words and phrases and their compositionality. In NIPS . 3111--3119.
[19]
Maximilian Nickel, Kevin Murphy, Volker Tresp, and Evgeniy Gabrilovich. 2016. A Review of Relational Machine Learning for Knowledge Graphs. Proc. IEEE, Vol. 104, 1 (2016), 11--33.
[20]
Maximilian Nickel, Volker Tresp, and Hans-Peter Kriegel. 2012. Factorizing YAGO: scalable machine learning for linked data. In WWW. 271--280.
[21]
Evangelos E. Papalexakis, Christos Faloutsos, and Nicholas D. Sidiropoulos. 2017. Tensors for Data Mining and Data Fusion: Models, Applications, and Scalable Algorithms. ACM TIST, Vol. 8, 2 (2017), 16:1--16:44.
[22]
Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. DeepWalk: online learning of social representations. In KDD . 701--710.
[23]
Jiezhong Qiu, Yuxiao Dong, Hao Ma, Jian Li, Kuansan Wang, and Jie Tang. 2018. Network embedding as matrix factorization: Unifying deepwalk, line, pte, and node2vec. In WSDM. ACM, 459--467.
[24]
Xiang Ren, Ahmed El-Kishky, Chi Wang, and Jiawei Han. 2015a. Automatic Entity Recognition and Typing from Massive Text Corpora: A Phrase and Network Mining Approach. In KDD. 2319--2320.
[25]
Xiang Ren, Ahmed El-Kishky, Chi Wang, Fangbo Tao, Clare R. Voss, and Jiawei Han. 2015b. ClusType: Effective Entity Recognition and Typing by Relation Phrase-Based Clustering. In SIGKDD . 995--1004.
[26]
Jingbo Shang, Meng Qu, Jialu Liu, Lance M Kaplan, Jiawei Han, and Jian Peng. 2016. Meta-path guided embedding for similarity search in large-scale heterogeneous information networks. arXiv preprint arXiv:1610.09769 (2016).
[27]
Chuan Shi, Binbin Hu, Xin Zhao, and Philip Yu. 2018a. Heterogeneous Information Network Embedding for Recommendation. IEEE Transactions on Knowledge and Data Engineering (2018).
[28]
Chuan Shi, Xiangnan Kong, Yue Huang, S Yu Philip, and Bin Wu. 2014. HeteSim: A General Framework for Relevance Measure in Heterogeneous Networks. IEEE Trans. Knowl. Data Eng., Vol. 26, 10 (2014), 2479--2492.
[29]
Chuan Shi, Yitong Li, Jiawei Zhang, Yizhou Sun, and S Yu Philip. 2017. A survey of heterogeneous information network analysis. TKDE, Vol. 29, 1 (2017), 17--37.
[30]
Chuan Shi, Zhiqiang Zhang, Ping Luo, Philip S Yu, Yading Yue, and Bin Wu. 2015. Semantic path based personalized recommendation on weighted heterogeneous information networks. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. ACM, 453--462.
[31]
Yu Shi, Qi Zhu, Fang Guo, Chao Zhang, and Jiawei Han. 2018b. Easing Embedding Learning by Comprehensive Transcription of Heterogeneous Information Networks. In KDD. ACM, 2190--2199.
[32]
Ajit P. Singh and Geoffrey J. Gordon. 2008. Relational Learning via Collective Matrix Factorization. In KDD . 650--658.
[33]
Lichao Sun, Lifang He, Zhipeng Huang, Bokai Cao, Congying Xia, Xiaokai Wei, and S Yu Philip. 2018. Joint embedding of meta-path and meta-graph for heterogeneous information networks. In ICBK . IEEE, 131--138.
[34]
Yizhou Sun and Jiawei Han. 2012. Mining heterogeneous information networks: principles and methodologies. Synthesis Lectures on Data Mining and Knowledge Discovery, Vol. 3, 2 (2012), 1--159.
[35]
Yizhou Sun, Jiawei Han, Xifeng Yan, Philip S. Yu, and Tianyi Wu. 2011. PathSim: Meta Path-Based Top-K Similarity Search in Heterogeneous Information Networks. PVLDB (2011), 992--1003.
[36]
Yizhou Sun, Brandon Norick, Jiawei Han, Xifeng Yan, Philip S. Yu, and Xiao Yu. 2012. Integrating meta-path selection with user-guided object clustering in heterogeneous information networks. In KDD. 1348--1356.
[37]
Jian Tang, Meng Qu, and Qiaozhu Mei. 2015a. PTE: Predictive Text Embedding through Large-scale Heterogeneous Text Networks. (2015), 1165--1174.
[38]
Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. 2015b. LINE: Large-scale Information Network Embedding. In WWW. 1067--1077.
[39]
Xiao Yu, Xiang Ren, Yizhou Sun, Quanquan Gu, Bradley Sturt, Urvashi Khandelwal, Brandon Norick, and Jiawei Han. 2014. Personalized entity recommendation: a heterogeneous information network approach. In WSDM . 283--292.
[40]
Chuxu Zhang, Ananthram Swami, and Nitesh V Chawla. 2018a. CARL: Content-Aware Representation Learning for Heterogeneous Networks. arXiv preprint arXiv:1805.04983 (2018).
[41]
Daokun Zhang, Jie Yin, Xingquan Zhu, and Chengqi Zhang. 2018b. MetaGraph2Vec: Complex Semantic Path Augmented Heterogeneous Network Embedding. In PAKDD . Springer, 196--208.
[42]
Huan Zhao, Quanming Yao, Jianda Li, Yangqiu Song, and Dik Lun Lee. 2017. Meta-Graph Based Recommendation Fusion over Heterogeneous Information Networks. In KDD. ACM, 635--644.
[43]
Jing Zheng, Jian Liu, Chuan Shi, Fuzhen Zhuang, Jingzhi Li, and Bin Wu. 2017. Recommendation in heterogeneous information network via dual similarity regularization. International Journal of Data Science and Analytics, Vol. 3, 1 (2017), 35--48.

Cited By

View all
  • (2025)Predicting drug combination side effects based on a metapath-based heterogeneous graph neural networkBMC Bioinformatics10.1186/s12859-024-06028-626:1Online publication date: 15-Jan-2025
  • (2025)Navigating complexity: a comprehensive review of heterogeneous information networks and embedding techniquesKnowledge and Information Systems10.1007/s10115-025-02357-xOnline publication date: 13-Feb-2025
  • (2024)LHGCN: A Laminated Heterogeneous Graph Convolutional Network for Modeling User–Item Interaction in E-CommerceSymmetry10.3390/sym1612169516:12(1695)Online publication date: 21-Dec-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '19: Proceedings of the 28th ACM International Conference on Information and Knowledge Management
November 2019
3373 pages
ISBN:9781450369763
DOI:10.1145/3357384
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 November 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. heterogeneous networks
  2. network embedding
  3. random walk

Qualifiers

  • Research-article

Funding Sources

  • Early Career Scheme
  • NSFC

Conference

CIKM '19
Sponsor:

Acceptance Rates

CIKM '19 Paper Acceptance Rate 202 of 1,031 submissions, 20%;
Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)46
  • Downloads (Last 6 weeks)4
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Predicting drug combination side effects based on a metapath-based heterogeneous graph neural networkBMC Bioinformatics10.1186/s12859-024-06028-626:1Online publication date: 15-Jan-2025
  • (2025)Navigating complexity: a comprehensive review of heterogeneous information networks and embedding techniquesKnowledge and Information Systems10.1007/s10115-025-02357-xOnline publication date: 13-Feb-2025
  • (2024)LHGCN: A Laminated Heterogeneous Graph Convolutional Network for Modeling User–Item Interaction in E-CommerceSymmetry10.3390/sym1612169516:12(1695)Online publication date: 21-Dec-2024
  • (2024)FlowWalker: A Memory-Efficient and High-Performance GPU-Based Dynamic Graph Random Walk FrameworkProceedings of the VLDB Endowment10.14778/3659437.365943817:8(1788-1801)Online publication date: 1-Apr-2024
  • (2024)On Efficient Large Sparse Matrix Chain MultiplicationProceedings of the ACM on Management of Data10.1145/36549592:3(1-27)Online publication date: 30-May-2024
  • (2024)A Learning-path based Supervised Method for Concept Prerequisite Relations Extraction in Educational DataProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679597(2168-2177)Online publication date: 21-Oct-2024
  • (2024)Coarse-to-Fine Robust Heterogeneous Network Representation Learning Without MetapathIEEE Transactions on Network Science and Engineering10.1109/TNSE.2024.344572411:6(5773-5789)Online publication date: Nov-2024
  • (2024)Heterogeneous Graph Contrastive Learning With Metapath-Based AugmentationsIEEE Transactions on Emerging Topics in Computational Intelligence10.1109/TETCI.2023.33223418:1(1003-1014)Online publication date: Feb-2024
  • (2024)A Heterogeneous Graph Neural Network With Attribute Enhancement and Structure-Aware AttentionIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.323903411:1(829-838)Online publication date: Feb-2024
  • (2024)GERWR: Identifying the Key Pathogenicity- Associated sRNAs of Magnaporthe Oryzae Infection in Rice Based on Graph Embedding and Random Walk With RestartIEEE/ACM Transactions on Computational Biology and Bioinformatics10.1109/TCBB.2023.334808021:2(227-239)Online publication date: Mar-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media