research-article

Collective Representation Learning on Spatiotemporal Heterogeneous Information Networks

Authors:

Dakshak Keerthi Chandra,

Jennifer Leopold,

Yanjie FuAuthors Info & Claims

SIGSPATIAL '19: Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pages 319 - 328

https://doi.org/10.1145/3347146.3359104

Published: 05 November 2019 Publication History

Abstract

Representation learning is a technique that is used to capture the underlying latent features of complex data. Representation learning on networks has been widely implemented for learning network structure and embedding it in a low dimensional vector space. In recent years, network embedding using representation learning has attracted increasing attention, and many deep architectures have been widely proposed. However, existing network embedding techniques ignore the multi-class spatial and temporal relationships that crucially reflect the complex nature among vertices and links in spatiotemporal heterogeneous information networks(SHINs).

To address this problem, in this paper, we present two types of collective representation learning models for spatiotemporal heterogeneous information network embedding (SHNE). 1) We propose a model called Multilingual SHNE (M-SHNE); the proposed model leverages the use of random walks along with multilingual word embedding technique used in natural language processing (NLP) to collectively learn the spatiotemporal proximity measures between vertices in SHINs and preserve it in a low dimensional vector space. 2) We propose a second method called Meta path Constrained Random walk SHNE (MCR-SHNE) that combines the advantage of meta path counting algorithm, path constrained random walks, and word embedding technique to generate lower dimensional embeddings that preserve the spatiotemporal proximity measures in SHINs. Experimental results demonstrate the effectiveness of our two proposed models over state-of-the-art algorithms on real-world datasets.

References

[1]

Marco Baroni, Georgiana Dinu, and Germán Kruszewski. 2014. Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Vol. 1. 238--247.

[2]

Mikhail Belkin and Partha Niyogi. 2002. Laplacian eigenmaps and spectral techniques for embedding and clustering. In Advances in neural information processing systems. 585--591.

[3]

Ulrik Brandes. 2001. A faster algorithm for betweenness centrality. Journal of mathematical sociology 25, 2 (2001), 163--177.

[4]

José Camacho-Collados, Mohammad Taher Pilehvar, and Roberto Navigli. 2016. Nasari: Integrating explicit knowledge and corpus statistics for a multilingual representation of concepts and entities. Artificial Intelligence 240 (2016), 36--64.

[5]

Shaosheng Cao, Wei Lu, and Qiongkai Xu. 2015. Grarep: Learning graph representations with global structural information. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. ACM, 891--900.

Digital Library

[6]

Yanjie Fu, Yong Ge, Yu Zheng, Zijun Yao, Yanchi Liu, Hui Xiong, and Jing Yuan. 2014. Sparse real estate ranking with online user reviews and offline moving behaviors. In 2014 IEEE International Conference on Data Mining. IEEE, 120--129.

Digital Library

[7]

Yanjie Fu, Guannan Liu, Spiros Papadimitriou, Hui Xiong, Yong Ge, Hengshu Zhu, and Chen Zhu. 2015. Real estate ranking via mixed land-use latent models. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 299--308.

Digital Library

[8]

Yanjie Fu, Hui Xiong, Yong Ge, Zijun Yao, Yu Zheng, and Zhi-Hua Zhou. 2014. Exploiting geographic dependencies for real estate appraisal: a mutual perspective of ranking and clustering. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 1047--1056.

Digital Library

[9]

Yanjie Fu, Hui Xiong, Yong Ge, Yu Zheng, Zijun Yao, and Zhi-Hua Zhou. 2016. Modeling of geographic dependencies for real estate ranking. ACM Transactions on Knowledge Discovery from Data (TKDD) 11, 1 (2016), 11.

[10]

Yoav Goldberg and Omer Levy. 2014. word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method. arXiv preprint arXiv:1402.3722 (2014).

[11]

Palash Goyal and Emilio Ferrara. 2018. Graph embedding techniques, applications, and performance: A survey. Knowledge-Based Systems 151 (2018), 78--94.

[12]

Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 855--864.

Digital Library

[13]

Joseph B Kruskal. 1964. Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis. Psychometrika 29, 1 (1964), 1--27.

[14]

Ni Lao and William W Cohen. 2010. Relational retrieval using a combination of path-constrained random walks. Machine learning 81, 1 (2010), 53--67.

[15]

Ni Lao, Tom Mitchell, and William W Cohen. 2011. Random walk inference and learning in a large scale knowledge base. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 529--539.

Digital Library

[16]

Linyuan Lü and Tao Zhou. 2011. Link prediction in complex networks: A survey. Physica A: statistical mechanics and its applications 390, 6 (2011), 1150--1170.

[17]

Thang Luong, Hieu Pham, and Christopher D Manning. 2015. Bilingual word representations with monolingual quality in mind. In Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing. 151--159.

[18]

Changping Meng, Reynold Cheng, Silviu Maniu, Pierre Senellart, and Wangda Zhang. 2015. Discovering meta-paths in large heterogeneous information networks. In Proceedings of the 24th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 754--764.

Digital Library

[19]

Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).

[20]

Tomas Mikolov, Quoc V Le, and Ilya Sutskever. 2013. Exploiting similarities among languages for machine translation. arXiv preprint arXiv:1309.4168 (2013).

[21]

Jeffrey Pennington, Richard Socher, and Christopher Manning. 2014. Glove: Global vectors for word representation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). 1532--1543.

[22]

Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 701--710.

Digital Library

[23]

Pushpendre Rastogi, Benjamin Van Durme, and Raman Arora. 2015. Multiview LSA: Representation learning via generalized CCA. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 556--566.

[24]

Yizhou Sun, Rick Barber, Manish Gupta, Charu C Aggarwal, and Jiawei Han. 2011. Co-author relationship prediction in heterogeneous bibliographic networks. In Advances in Social Networks Analysis and Mining (ASONAM), 2011 International Conference on. IEEE, 121--128.

Digital Library

[25]

Yizhou Sun and Jiawei Han. 2012. Mining heterogeneous information networks: principles and methodologies. Synthesis Lectures on Data Mining and Knowledge Discovery 3, 2 (2012), 1--159.

Digital Library

[26]

Yizhou Sun, Jiawei Han, Xifeng Yan, Philip S Yu, and Tianyi Wu. 2011. Pathsim: Meta path-based top-k similarity search in heterogeneous information networks. Proceedings of the VLDB Endowment 4, 11 (2011), 992--1003.

Digital Library

[27]

Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. 2015. Line: Large-scale information network embedding. In Proceedings of the 24th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 1067--1077.

Digital Library

[28]

Xiao Wang, Peng Cui, Jing Wang, Jian Pei, Wenwu Zhu, and Shiqiang Yang. 2017. Community Preserving Network Embedding. In AAAI. 203--209.

[29]

Wolfgang Woess. 2000. Random walks on infinite graphs and groups. Vol. 138. Cambridge university press.

[30]

Will Y Zou, Richard Socher, Daniel Cer, and Christopher D Manning. 2013. Bilingual word embeddings for phrase-based machine translation. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. 1393--1398.

Cited By

Chen RLei JYao HLi TLi S(2025)Anchor-Enhanced Geographical Entity Representation LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.332982236:1(924-938)Online publication date: Jan-2025
https://doi.org/10.1109/TNNLS.2023.3329822
Corcoran PSpasić I(2023)Self-Supervised Representation Learning for Geographical Data—A Systematic Literature ReviewISPRS International Journal of Geo-Information10.3390/ijgi1202006412:2(64)Online publication date: 12-Feb-2023
https://doi.org/10.3390/ijgi12020064
Shin YSeong GKim NKim SYoon Y(2023)Understanding Urban Economic Status through GNN-based Urban Representation Learning Using Mobility DataProceedings of the 1st ACM SIGSPATIAL International Workshop on Advances in Urban-AI10.1145/3615900.3628786(71-80)Online publication date: 13-Nov-2023
https://dl.acm.org/doi/10.1145/3615900.3628786
Show More Cited By

Collective Representation Learning on Spatiotemporal Heterogeneous Information Networks
1. Information systems
  1. Information systems applications

Recommendations

Bilingual embeddings with random walks over multilingual wordnets
Abstract
Bilingual word embeddings represent words of two languages in the same space, and allow to transfer knowledge from one language to the other without machine translation. The main approach is to train monolingual embeddings first and ...
Representation Learning on Heterogeneous Spatiotemporal Networks
A Deep Spatiotemporal Trajectory Representation Learning Framework for Clustering
Learning trajectory representations is essential in many Location Based Services (LBS) applications. Most traditional methods extract trajectory representations based on manually defined features, while deep learning-based methods can reduce part of the ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGSPATIAL '19: Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

November 2019

648 pages

ISBN:9781450369091

DOI:10.1145/3347146

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGSPATIAL: ACM Special Interest Group on Spatial Information

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 November 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

SIGSPATIAL '19

Sponsor:

SIGSPATIAL

SIGSPATIAL '19: 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

November 5 - 8, 2019

IL, Chicago, USA

Acceptance Rates

SIGSPATIAL '19 Paper Acceptance Rate 34 of 161 submissions, 21%;

Overall Acceptance Rate 257 of 1,238 submissions, 21%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

9
Total Citations
View Citations
248
Total Downloads

Downloads (Last 12 months)10
Downloads (Last 6 weeks)1

Reflects downloads up to 15 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Chen RLei JYao HLi TLi S(2025)Anchor-Enhanced Geographical Entity Representation LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.332982236:1(924-938)Online publication date: Jan-2025
https://doi.org/10.1109/TNNLS.2023.3329822
Corcoran PSpasić I(2023)Self-Supervised Representation Learning for Geographical Data—A Systematic Literature ReviewISPRS International Journal of Geo-Information10.3390/ijgi1202006412:2(64)Online publication date: 12-Feb-2023
https://doi.org/10.3390/ijgi12020064
Shin YSeong GKim NKim SYoon Y(2023)Understanding Urban Economic Status through GNN-based Urban Representation Learning Using Mobility DataProceedings of the 1st ACM SIGSPATIAL International Workshop on Advances in Urban-AI10.1145/3615900.3628786(71-80)Online publication date: 13-Nov-2023
https://dl.acm.org/doi/10.1145/3615900.3628786
Li TXi YWang HLi YTarkoma SHui P(2023)Learning Representations of Satellite Imagery by Leveraging Point-of-InterestsACM Transactions on Intelligent Systems and Technology10.1145/358934414:4(1-32)Online publication date: 8-May-2023
https://dl.acm.org/doi/10.1145/3589344
Wang DFu YLiu KChen FWang PLu C(2023)Automated Urban Planning for Reimagining City Configuration via Adversarial Learning: Quantification, Generation, and EvaluationACM Transactions on Spatial Algorithms and Systems10.1145/35243029:1(1-24)Online publication date: 17-Jan-2023
https://dl.acm.org/doi/10.1145/3524302
Li HLu HJensen CTang BCheema M(2022)Spatial Data Quality in the Internet of Things: Management, Exploitation, and ProspectsACM Computing Surveys10.1145/349833855:3(1-41)Online publication date: 3-Feb-2022
https://dl.acm.org/doi/10.1145/3498338
Fang LZhang LWu HXu TZhou DChen E(2021)Patent2Vec: Multi-view representation learning on patent-graphs for patent classificationWorld Wide Web10.1007/s11280-021-00885-4Online publication date: 16-Jun-2021
https://doi.org/10.1007/s11280-021-00885-4
Damiani MAcquaviva AHachem FRossini M(2020)Learning Behavioral Representations of Human MobilityProceedings of the 28th International Conference on Advances in Geographic Information Systems10.1145/3397536.3422255(367-376)Online publication date: 3-Nov-2020
https://dl.acm.org/doi/10.1145/3397536.3422255
Keerthi Chandra DWang PLeopold JFu Yd'Aquin MDietze SHauff CCurry ECudre Mauroux P(2020)Collective Embedding with Feature Importance: A Unified Approach for Spatiotemporal Network EmbeddingProceedings of the 29th ACM International Conference on Information & Knowledge Management10.1145/3340531.3412030(615-624)Online publication date: 19-Oct-2020
https://dl.acm.org/doi/10.1145/3340531.3412030

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten