Edge Representation Learning for Community Detection in Large Scale Information Networks

Li, Suxue; Zhang, Haixia; Wu, Dalei; Zhang, Chuanting; Yuan, Dongfeng

doi:10.1007/978-3-319-73521-4_4

Suxue Li¹⁷,
Haixia Zhang¹⁷,
Dalei Wu¹⁸,
Chuanting Zhang¹⁷ &
…
Dongfeng Yuan¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10731))

Included in the following conference series:

International Workshop on Mobility Analytics for Spatio-temporal and Social Data

1054 Accesses
1 Citations

Abstract

It is found that networks in real world divide naturally into communities or modules. Many community detection algorithms have been developed to uncover the community structure in networks. However, most of them focus on non-overlapping communities and the applicability of these work is limited when it comes to real world networks, which inherently are overlapping in most cases, e.g. Facebook and Weibo. In this paper, we propose an overlapping community detection algorithm based on edge representation learning. Firstly, we sample a series of edge sequences using random walks on graph, then a mapping function from edge to feature vectors is automatically learned in an unsupervised way. At last we employ the traditional clustering algorithms, e.g. K-means and its variants, on the learned representations to carry out community detection. To demonstrate the effectiveness of our proposed method, extensive experiments are conducted on a group of synthetic networks and two real world networks with ground truth. Experiment results show that our proposed method outperforms traditional algorithms in terms of evaluation metrics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 44.99; Price excludes VAT (USA)

Softcover Book: USD 60.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Adhikari, B., Zhang, Y., Ramakrishnan, N., Prakash, B.A.: Distributed representation of subgraphs. arXiv preprint arXiv:1702.06921 (2017)
Bansal, S., Bhowmick, S., Paymal, P.: Fast community detection for dynamic complex networks. In: da F. Costa, L., Evsukoff, A., Mangioni, G., Menezes, R. (eds.) CompleNet 2010. CCIS, vol. 116, pp. 196–207. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-25501-4_20
Chapter Google Scholar
Blondel, V.D., Guillaume, J.L., Lambiotte, R., Lefebvre, E.: Fast unfolding of communities in large networks. J. Stat. Mech. Theory Exp. 2008(10), P10008 (2008)
Article Google Scholar
Danon, L., Diaz-Guilera, A., Duch, J., Arenas, A.: Comparing community structure identification. J. Stat. Mech. Theory Exp. 2005(09), P09008 (2005)
Article Google Scholar
Fortunato, S.: Community detection in graphs. Phys. Rep. 486(3), 75–174 (2010)
Article MathSciNet Google Scholar
Fortunato, S., Hric, D.: Community detection in networks: a user guide. Phys. Rep. 659, 1–44 (2016)
Article MathSciNet Google Scholar
Grover, A., Leskovec, J.: node2vec: scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 855–864. ACM (2016)
Google Scholar
Jain, A.K.: Data clustering: 50 years beyond k-means. Pattern Recogn. Lett. 31(8), 651–666 (2010)
Article Google Scholar
Kernighan, B.W., Lin, S.: An efficient heuristic procedure for partitioning graphs. Bell Syst. Tech. J. 49(2), 291–307 (1970)
Article MATH Google Scholar
Lancichinetti, A., Fortunato, S., Radicchi, F.: Benchmark graphs for testing community detection algorithms. Phys. Rev. E 78(4), 046110 (2008)
Article Google Scholar
Leal, T.P., Goncalves, A.C., Vieira, V.d.F., Xavier, C.R.: Decode-differential evolution algorithm for community detection. In: 2013 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 4635–4640. IEEE (2013)
Google Scholar
Leskovec, J., Krevl, A.: SNAP Datasets: Stanford large network dataset collection, June 2014. http://snap.stanford.edu/data
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Google Scholar
Newman, M.E.: The structure and function of complex networks. SIAM Rev. 45(2), 167–256 (2003)
Article MathSciNet MATH Google Scholar
Newman, M.E.: Finding community structure in networks using the eigenvectors of matrices. Phys. Rev. E 74(3), 036104 (2006)
Article MathSciNet Google Scholar
Newman, M.E.: Modularity and community structure in networks. Proc. Nat. Acad. Sci. 103(23), 8577–8582 (2006)
Article Google Scholar
Perozzi, B., Al-Rfou, R., Skiena, S.: Deepwalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 701–710. ACM (2014)
Google Scholar
Raghavan, U.N., Albert, R., Kumara, S.: Near linear time algorithm to detect community structures in large-scale networks. Phys. Rev. E 76(3), 036106 (2007)
Article Google Scholar
Rosenberg, A., Hirschberg, J.: V-measure: a conditional entropy-based external cluster evaluation measure. EMNLP-CoNLL 7, 410–420 (2007)
Google Scholar
Tang, J., Qu, M., Wang, M., Zhang, M., Yan, J., Mei, Q.: Line: large-scale information network embedding. In: Proceedings of the 24th International Conference on World Wide Web, pp. 1067–1077. ACM (2015)
Google Scholar
Vinh, N.X., Epps, J., Bailey, J.: Information theoretic measures for clusterings comparison: is a correction for chance necessary? In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 1073–1080. ACM (2009)
Google Scholar
Wang, S., Tang, J., Aggarwal, C., Chang, Y., Liu, H.: Signed network embedding in social media. In: SDM (2017)
Google Scholar
Yang, L., Cao, X., Jin, D., Wang, X., Meng, D.: A unified semi-supervised community detection framework using latent space graph regularization. IEEE Trans. Cybern. 45(11), 2585–2598 (2015)
Article Google Scholar
Zachary, W.W.: An information flow model for conflict and fission in small groups. J. Anthropol. Res. 33(4), 452–473 (1977)
Article Google Scholar

Download references

Acknowledgments

The work presented in this paper was supported in part by the Special Project for Independent Innovation and Achievement Transformation of Shandong Province (2013ZHZX2C0102, 2014ZZCX03401).

Author information

Authors and Affiliations

Shandong Provincial Key Laboratory of Wireless Communication Technologies, Shandong University, Jinan, 250100, Shandong, China
Suxue Li, Haixia Zhang, Chuanting Zhang & Dongfeng Yuan
The University of Tennessee at Chattanooga, Chattanooga, TN, 37403, USA
Dalei Wu

Authors

Suxue Li
View author publications
You can also search for this author in PubMed Google Scholar
Haixia Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Dalei Wu
View author publications
You can also search for this author in PubMed Google Scholar
Chuanting Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Dongfeng Yuan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dongfeng Yuan .

Editor information

Editors and Affiliations

University of Piraeus, Piraeus, Greece
Christos Doulkeridis
University of Piraeus, Piraeus, Greece
George A. Vouros
Shenzhen Institutes of Advanced Technology, Shenzhen, China
Qiang Qu
Institute of Computing Technology, Beijing, China
Shuhui Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, S., Zhang, H., Wu, D., Zhang, C., Yuan, D. (2018). Edge Representation Learning for Community Detection in Large Scale Information Networks. In: Doulkeridis, C., Vouros, G., Qu, Q., Wang, S. (eds) Mobility Analytics for Spatio-Temporal and Social Data. MATES 2017. Lecture Notes in Computer Science(), vol 10731. Springer, Cham. https://doi.org/10.1007/978-3-319-73521-4_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-73521-4_4
Published: 28 December 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73520-7
Online ISBN: 978-3-319-73521-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics