skip to main content
10.1145/3472720.3483621acmconferencesArticle/Chapter ViewAbstractPublication PageshtConference Proceedingsconference-collections
research-article

Random-Walk Graph Embeddings and the Influence of Edge Weighting Strategies in Community Detection Tasks

Published: 28 October 2021 Publication History

Abstract

Graph embedding methods have been developed over recent years with the goal of mapping graph data structures into low dimensional vector spaces so that conventional machine learning tasks can be efficiently evaluated. In particular, random walk based methods sample the graph using random walk sequences that capture a graph's structural properties. In this work, we study the influence of edge weighting strategies that bias the random walk process and we are able to demonstrate that under several settings the biased random walks enhance downstream community detection tasks.

References

[1]
Lada A Adamic and Eytan Adar. 2003. Friends and neighbors on the web. Social networks, Vol. 25, 3 (2003), 211--230.
[2]
Smriti Bhagat, Graham Cormode, and S Muthukrishnan. 2011. Node classification in social networks. In Social network data analytics . Springer, 115--148.
[3]
Christopher Bishop. 2006. Pattern Recognition and Machine Learning. Pattern Recognition and Machine Learning (2006).
[4]
Pasquale De Meo, Emilio Ferrara, Giacomo Fiumara, and Alessandro Provetti. 2011. Generalized louvain method for community detection in large networks. In 2011 11th international conference on intelligent systems design and applications. IEEE, 88--93.
[5]
Pasquale De Meo, Emilio Ferrara, Giacomo Fiumara, and Angela Ricciardello. 2012. A novel measure of edge centrality in social networks. Knowledge-based systems, Vol. 30 (2012), 136--150.
[6]
Santo Fortunato and Marc Barthelemy. 2007. Resolution limit in community detection. Proceedings of the national academy of sciences, Vol. 104, 1 (2007), 36--41.
[7]
Palash Goyal and Emilio Ferrara. 2018. Graph embedding techniques, applications, and performance: A survey. Knowledge-Based Systems, Vol. 151 (2018), 78--94.
[8]
Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable Feature Learning for Networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, August 13--17, 2016. ACM, 855--864.
[9]
Muhammad Aqib Javed, Muhammad Shahzad Younis, Siddique Latif, Junaid Qadir, and Adeel Baig. 2018. Community detection in networks: A multidisciplinary review. Journal of Network and Computer Applications, Vol. 108 (2018), 87--111.
[10]
Glen Jeh and Jennifer Widom. 2002. SimRank: a measure of structural-context similarity. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining . 538--543.
[11]
Di Jin, Zhizhi Yu, Pengfei Jiao, Shirui Pan, Philip S Yu, and Weixiong Zhang. 2021. A survey of community detection approaches: From statistical modeling to deep learning. arXiv preprint arXiv:2101.01669 (2021).
[12]
Alireza Khadivi, Ali Ajdari Rad, and Martin Hasler. 2011. Network community-detection enhancement by proper weighting. Physical Review E, Vol. 83, 4 (2011), 046104.
[13]
Andrea Lancichinetti, Santo Fortunato, and Filippo Radicchi. 2008. Benchmark graphs for testing community detection algorithms. Physical review E, Vol. 78, 4 (2008), 046110.
[14]
Jure Leskovec and Andrej Krevl. 2014. SNAP Datasets: Stanford Large Network Dataset Collection. http://snap.stanford.edu/data .
[15]
Linyuan Lü and Tao Zhou. 2011. Link prediction in complex networks: A survey. Physica A: statistical mechanics and its applications, Vol. 390, 6 (2011), 1150--1170.
[16]
Xiaoyan Lu, Konstantin Kuzmin, Mingming Chen, and Boleslaw K Szymanski. 2018. Adaptive modularity maximization via edge weighting scheme. Information Sciences, Vol. 424 (2018), 55--68.
[17]
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013a. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).
[18]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013b. Distributed representations of words and phrases and their compositionality. Advances in neural information processing systems, Vol. 26 (2013), 3111--3119.
[19]
Symeon Papadopoulos, Yiannis Kompatsiaris, Athena Vakali, and Ploutarchos Spyridonos. 2012. Community detection in social media. Data Mining and Knowledge Discovery, Vol. 24, 3 (2012), 515--554.
[20]
Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. DeepWalk: online learning of social representations. In The 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '14, New York, NY, USA - August 24 - 27, 2014. ACM, 701--710.
[21]
Nguyen Xuan Vinh, Julien Epps, and James Bailey. 2010. Information theoretic measures for clusterings comparison: Variants, properties, normalization and correction for chance. The Journal of Machine Learning Research, Vol. 11 (2010), 2837--2854.
[22]
Bernard L Welch. 1947. The generalization of ?Student's' problem when several different population variances are involved. Biometrika, Vol. 34, 1--2 (1947), 28--35.
[23]
Jaewon Yang and Jure Leskovec. 2015. Defining and evaluating network communities based on ground-truth. Knowledge and Information Systems, Vol. 42, 1 (2015), 181--213.

Cited By

View all
  • (2023)Modified SkipGram Negative Sampling Model for Faster Convergence of Graph EmbeddingDeep Learning Theory and Applications10.1007/978-3-031-37317-6_1(1-16)Online publication date: 7-Jul-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
OASIS '21: Proceedings of the 2021 Workshop on Open Challenges in Online Social Networks
October 2021
44 pages
ISBN:9781450386326
DOI:10.1145/3472720
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 October 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. community detection
  2. deepwalk
  3. edge weighting
  4. latent representation
  5. node2vec

Qualifiers

  • Research-article

Funding Sources

Conference

HT '21
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Modified SkipGram Negative Sampling Model for Faster Convergence of Graph EmbeddingDeep Learning Theory and Applications10.1007/978-3-031-37317-6_1(1-16)Online publication date: 7-Jul-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media