Skip to main content
Log in

Unifying community detection and network embedding in attributed networks

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Traditionally, community detection and network embedding are two separate tasks. Network embedding aims to output a vector representation for each node in the network, and community detection aims to find all densely connected groups of nodes and well separate them from others. Most of the existing approaches do community detection and network embedding in a separate manner, and ignore node attributes information, which leads to poor results. In this paper, we propose a novel model that jointly solves the network embedding and community detection problems together. The model can make use of the network local information, the global information and node attributes information collaboratively. We empirically show that by jointly solving these two problems together, the model can greatly improve the ability of community detection, but also learn better network embedding than the advanced baseline methods. We evaluate the proposed model on several datasets, and the experimental results have shown the effectiveness and advancement of our model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. https://linqs.soe.ucsc.edu/data.

  2. http://mlg.ucd.ie/networks/.

References

  1. Girvan M, Newman ME (2002) Community structure in social and biological networks. Proc Natl Acad Sci U S A 99(12):7821–7826

    Article  MathSciNet  Google Scholar 

  2. Newman MEJ, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E Stat Nonlinear Soft Matter Phys 69(2):026113

    Article  Google Scholar 

  3. Blondel VD, Guillaume JL, Lambiotte R et al (2008) Fast unfolding of communities in large networks. J Stat Mech 2008(10):155–168

    Article  Google Scholar 

  4. Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: online learning of social representations. In: The ACM SIGKDD International Conference, pp 701–710. ACM

  5. Tang J, Qu M, Wang M, Zhang M, Yan J, Mei Q (2015) Line: large-scale information network embedding. In: International Conference on World Wide Web, pp. 1067–1077

  6. Grover A, Leskovec J (2016) node2vec: scalable feature learning for networks. In: The ACM SIGKDD International Conference

  7. Xun G (2017) Collaboratively improving topic discovery and word embeddings by coordinating global and local contexts. In: ACM Sigkdd International Conference ACM

  8. Yang C, Liu Z, Zhao D, Sun M, Chang EY (2015) Network representation learning with rich text information. In: International Conference on Artificial Intelligence, pp 2111–2117

  9. Wang D, Li T, Zhu S, Ding C (2008) Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization. In: Proceedings of the 31st international ACM SIGIR conference on Research and development in information retrieval, pp 307–314. ACM

  10. Yang Z; Hao T, Dikmen O, Chen X, Oja E (2012) Clustering by nonnegative matrix factorization using graph random walk. In: Advances in Neural Information Processing Systems, pp 1079–1087

  11. Wang X, Jin D, Cao X, Yang L, Zhang W (2016) Semantic community identification in large attribute networks. In: Thirtieth AAAI Conference on Artificial Intelligence

  12. Meng W, Chaokun W, Jeffrey XY, Jun Z (2015) Community detection in social networks: an in-depth benchmarking study with a procedure-oriented framework. In: PVLDB 8, 10 (June 2015), pp 998–1009

  13. Wang F, Li T, Wang X et al (2011) Community discovery using nonnegative matrix factorization. Data Min Knowl Disc 22(3):493–521

    Article  MathSciNet  Google Scholar 

  14. Yang J, Mcauley J, Leskovec J (2014) Community detection in networks with node attributes

  15. Karrer B, Newman MEJ (2011) Stochastic blockmodels and community structure in networks. Phys Rev E Stat Nonlinear Soft Matter Phys 83(2):016107

    Article  MathSciNet  Google Scholar 

  16. Yang Z, Hao T, Dikmen O et al (2012) Clustering by nonnegative matrix factorization using graph random walk. In: International Conference on Neural Information Processing Systems. Curran Associates Inc

  17. Mrinmaya S, Avinava D, Shashank S, Eric PX, Eduard H (2014) Spatial compactness meets topical consistency: jointly modeling links and content for community detection. In: WSDM, pp 503–512

  18. Sun Y, Aggarwal CC, Han J (2012) Relation strength-aware clustering of heterogeneous information networks with incomplete attributes. In: PVLDB 5, 5 (Jan. 2012), pp 394–405

  19. Cai HY, Zheng VW, Zhu F, Chen-Chuan Chang K, Huang Z (2017) From community detection to community profiling. In: PVLDB 10, 7 (2017), pp 817–828

  20. Xie J, Kelley K, Szymanski BK (2013) Overlapping community detection in networks: the state-of-the-art and comparative study. In: ACM CSUR 45, 4 (2013), 43:1–43:35

  21. Atzmueller M, Doerfel S, Mitzlaff F (2016) Description-oriented community detection using exhaustive subgroup discovery. Inform Sci 329(2):965–984

    Article  Google Scholar 

  22. He D, Feng Z, Jin, D, Wang X, and Zhang W (2017) Joint identification of network communities and semantics via integrative modeling of network topologies and node contents. In: AAAI 2017, pp 116–124

  23. Kozdoba M, Mannor S (2015) Community detection via measure space embedding. In: NIPS. Pp 2890–2898

  24. Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: NIPS, pp 3111–3119

  25. Belkin M, Niyogi P (2003) Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput 15(6):1373–1396

    Article  Google Scholar 

  26. Li P, Hastie TJ, Church KE (2006) Very sparse random projections. In: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 287–296

  27. Hu Q, Xie S, Lin S, Wang S, Philip SY (2016) Clustering embedded approaches for efficient information network inference. Data Sci Eng 1(1):29–40

    Article  Google Scholar 

  28. Cheng W, Greaves C, Warren M (2006) From n-gram to skip-gram to concgram. Int J Corp Linguist 11(4):411–433

    Article  Google Scholar 

  29. Cao S, Lu W, Xu Q (2015) Grarep: learning graph representations with global structural information. In: KDD

  30. Wang D, Cui P, Zhu W (2016) Structural deep network embedding In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, pp 1225–1234

  31. Bruna J, Zaremba W, Szlam A, LeCun Y (2014) Spectral networks and locally connected networks on graphs. In: ICLR

  32. Kipf TN, Welling M (2016) Variational graph auto-encoders. ArXiv:1611.07308

  33. Kipf TN, Welling M (2017) Semi-supervised classification with graph convolutional networks. In: ICLR

  34. Huang X, Li J, Hu X (2017) Label Informed Attributed Network Embedding. In: WSDM. ACM, pp 731–739

  35. Liang J, Jacobs P, Sun J et al (2018) Semi-supervised embedding in attributed networks with outliers

  36. Zhang Z, Yang H, Bu J, Zhou S, Yu P, Zhang J, Ester M, Wang C (2018) ANRL: Attributed network representation learning via deep neural networks. In: IJCAI 2018

  37. Gao H, Huang H (2018) Deep attributed network embedding. In: IJCAI 2018

  38. Wu W, Li B, Chen L, Zhang C (2018) Efficient attributed network embedding via recursive randomized hashing. IJCAI 2018

  39. Liu J, He Z, Wei L, Huang Y (2018) Content to node: self-translation network embedding. In: SIGKDD, 2018

  40. Tian F, Gao B, Cui Q, Chen E, Liu T-Y (2014) Learning Deep Representations for Graph Clustering. In: AAAI, pp 1293–1299

  41. Wang X, Cui P, Wang J, Pei J, Zhu W, Yang S (2017) Community preserving network embedding. In: AAAI 2017

  42. Cavallari S, Zheng VW, Cai H et al (2017) Learning community embedding with community detection and node embedding on graphs. In: ACM on Conference on Information and Knowledge Management. ACM, 2017, pp 377–386

  43. Tu C, Zeng X, Wang H, et al. (2016) A unified framework for community detection and network representation learning. IEEE Trans Knowl Data Eng PP(99):1–1.

  44. Li Y, Sha C, Huang X, Zhang Y (2018) Community detection in attributed graphs: an embedding approach. In: AAAI 2018

  45. Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119

  46. Bojchevski A, Gunnemann S (2018) Bayesian robust attributed graph clustering: joint learning of partial anomalies and group structure. In: AAAI 2018

  47. Fawcett T (2006) An introduction to roc analysis. Pattern Recogn Lett 27(8):861–874

    Article  MathSciNet  Google Scholar 

  48. van der Maaten L, Hinton G (2008) Visualizing data using t-SNE. JMLR 9(2008):2579–2605

    MATH  Google Scholar 

Download references

Acknowledgements

Our work is supported by the National Key Research Development Program of China (No. 2017YFB0802800). The authors would like to thank the Editor-in-Chief and anonymous reviewers for their insightful and constructive commendations that have led to an improved version of this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hao Wei.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ding, Y., Wei, H., Hu, G. et al. Unifying community detection and network embedding in attributed networks. Knowl Inf Syst 63, 1221–1239 (2021). https://doi.org/10.1007/s10115-021-01557-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-021-01557-5

Keywords

Navigation