Abstract
Traditionally, community detection and network embedding are two separate tasks. Network embedding aims to output a vector representation for each node in the network, and community detection aims to find all densely connected groups of nodes and well separate them from others. Most of the existing approaches do community detection and network embedding in a separate manner, and ignore node attributes information, which leads to poor results. In this paper, we propose a novel model that jointly solves the network embedding and community detection problems together. The model can make use of the network local information, the global information and node attributes information collaboratively. We empirically show that by jointly solving these two problems together, the model can greatly improve the ability of community detection, but also learn better network embedding than the advanced baseline methods. We evaluate the proposed model on several datasets, and the experimental results have shown the effectiveness and advancement of our model.
Similar content being viewed by others
References
Girvan M, Newman ME (2002) Community structure in social and biological networks. Proc Natl Acad Sci U S A 99(12):7821–7826
Newman MEJ, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E Stat Nonlinear Soft Matter Phys 69(2):026113
Blondel VD, Guillaume JL, Lambiotte R et al (2008) Fast unfolding of communities in large networks. J Stat Mech 2008(10):155–168
Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: online learning of social representations. In: The ACM SIGKDD International Conference, pp 701–710. ACM
Tang J, Qu M, Wang M, Zhang M, Yan J, Mei Q (2015) Line: large-scale information network embedding. In: International Conference on World Wide Web, pp. 1067–1077
Grover A, Leskovec J (2016) node2vec: scalable feature learning for networks. In: The ACM SIGKDD International Conference
Xun G (2017) Collaboratively improving topic discovery and word embeddings by coordinating global and local contexts. In: ACM Sigkdd International Conference ACM
Yang C, Liu Z, Zhao D, Sun M, Chang EY (2015) Network representation learning with rich text information. In: International Conference on Artificial Intelligence, pp 2111–2117
Wang D, Li T, Zhu S, Ding C (2008) Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization. In: Proceedings of the 31st international ACM SIGIR conference on Research and development in information retrieval, pp 307–314. ACM
Yang Z; Hao T, Dikmen O, Chen X, Oja E (2012) Clustering by nonnegative matrix factorization using graph random walk. In: Advances in Neural Information Processing Systems, pp 1079–1087
Wang X, Jin D, Cao X, Yang L, Zhang W (2016) Semantic community identification in large attribute networks. In: Thirtieth AAAI Conference on Artificial Intelligence
Meng W, Chaokun W, Jeffrey XY, Jun Z (2015) Community detection in social networks: an in-depth benchmarking study with a procedure-oriented framework. In: PVLDB 8, 10 (June 2015), pp 998–1009
Wang F, Li T, Wang X et al (2011) Community discovery using nonnegative matrix factorization. Data Min Knowl Disc 22(3):493–521
Yang J, Mcauley J, Leskovec J (2014) Community detection in networks with node attributes
Karrer B, Newman MEJ (2011) Stochastic blockmodels and community structure in networks. Phys Rev E Stat Nonlinear Soft Matter Phys 83(2):016107
Yang Z, Hao T, Dikmen O et al (2012) Clustering by nonnegative matrix factorization using graph random walk. In: International Conference on Neural Information Processing Systems. Curran Associates Inc
Mrinmaya S, Avinava D, Shashank S, Eric PX, Eduard H (2014) Spatial compactness meets topical consistency: jointly modeling links and content for community detection. In: WSDM, pp 503–512
Sun Y, Aggarwal CC, Han J (2012) Relation strength-aware clustering of heterogeneous information networks with incomplete attributes. In: PVLDB 5, 5 (Jan. 2012), pp 394–405
Cai HY, Zheng VW, Zhu F, Chen-Chuan Chang K, Huang Z (2017) From community detection to community profiling. In: PVLDB 10, 7 (2017), pp 817–828
Xie J, Kelley K, Szymanski BK (2013) Overlapping community detection in networks: the state-of-the-art and comparative study. In: ACM CSUR 45, 4 (2013), 43:1–43:35
Atzmueller M, Doerfel S, Mitzlaff F (2016) Description-oriented community detection using exhaustive subgroup discovery. Inform Sci 329(2):965–984
He D, Feng Z, Jin, D, Wang X, and Zhang W (2017) Joint identification of network communities and semantics via integrative modeling of network topologies and node contents. In: AAAI 2017, pp 116–124
Kozdoba M, Mannor S (2015) Community detection via measure space embedding. In: NIPS. Pp 2890–2898
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: NIPS, pp 3111–3119
Belkin M, Niyogi P (2003) Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput 15(6):1373–1396
Li P, Hastie TJ, Church KE (2006) Very sparse random projections. In: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 287–296
Hu Q, Xie S, Lin S, Wang S, Philip SY (2016) Clustering embedded approaches for efficient information network inference. Data Sci Eng 1(1):29–40
Cheng W, Greaves C, Warren M (2006) From n-gram to skip-gram to concgram. Int J Corp Linguist 11(4):411–433
Cao S, Lu W, Xu Q (2015) Grarep: learning graph representations with global structural information. In: KDD
Wang D, Cui P, Zhu W (2016) Structural deep network embedding In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, pp 1225–1234
Bruna J, Zaremba W, Szlam A, LeCun Y (2014) Spectral networks and locally connected networks on graphs. In: ICLR
Kipf TN, Welling M (2016) Variational graph auto-encoders. ArXiv:1611.07308
Kipf TN, Welling M (2017) Semi-supervised classification with graph convolutional networks. In: ICLR
Huang X, Li J, Hu X (2017) Label Informed Attributed Network Embedding. In: WSDM. ACM, pp 731–739
Liang J, Jacobs P, Sun J et al (2018) Semi-supervised embedding in attributed networks with outliers
Zhang Z, Yang H, Bu J, Zhou S, Yu P, Zhang J, Ester M, Wang C (2018) ANRL: Attributed network representation learning via deep neural networks. In: IJCAI 2018
Gao H, Huang H (2018) Deep attributed network embedding. In: IJCAI 2018
Wu W, Li B, Chen L, Zhang C (2018) Efficient attributed network embedding via recursive randomized hashing. IJCAI 2018
Liu J, He Z, Wei L, Huang Y (2018) Content to node: self-translation network embedding. In: SIGKDD, 2018
Tian F, Gao B, Cui Q, Chen E, Liu T-Y (2014) Learning Deep Representations for Graph Clustering. In: AAAI, pp 1293–1299
Wang X, Cui P, Wang J, Pei J, Zhu W, Yang S (2017) Community preserving network embedding. In: AAAI 2017
Cavallari S, Zheng VW, Cai H et al (2017) Learning community embedding with community detection and node embedding on graphs. In: ACM on Conference on Information and Knowledge Management. ACM, 2017, pp 377–386
Tu C, Zeng X, Wang H, et al. (2016) A unified framework for community detection and network representation learning. IEEE Trans Knowl Data Eng PP(99):1–1.
Li Y, Sha C, Huang X, Zhang Y (2018) Community detection in attributed graphs: an embedding approach. In: AAAI 2018
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119
Bojchevski A, Gunnemann S (2018) Bayesian robust attributed graph clustering: joint learning of partial anomalies and group structure. In: AAAI 2018
Fawcett T (2006) An introduction to roc analysis. Pattern Recogn Lett 27(8):861–874
van der Maaten L, Hinton G (2008) Visualizing data using t-SNE. JMLR 9(2008):2579–2605
Acknowledgements
Our work is supported by the National Key Research Development Program of China (No. 2017YFB0802800). The authors would like to thank the Editor-in-Chief and anonymous reviewers for their insightful and constructive commendations that have led to an improved version of this paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Ding, Y., Wei, H., Hu, G. et al. Unifying community detection and network embedding in attributed networks. Knowl Inf Syst 63, 1221–1239 (2021). https://doi.org/10.1007/s10115-021-01557-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-021-01557-5