Is the simple assignment enough? Exploring the interpretability for community detection

Zhao, Qiqi; Ma, Huifang; Li, Xiaohong; Li, Zhixin

doi:10.1007/s13042-021-01384-8

Is the simple assignment enough? Exploring the interpretability for community detection

Original Article
Published: 07 August 2021

Volume 12, pages 3463–3474, (2021)
Cite this article

International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Qiqi Zhao¹,
Huifang Ma ORCID: orcid.org/0000-0002-5104-8982^1,2,
Xiaohong Li¹ &
…
Zhixin Li²

237 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

The maximum likelihood estimation is a probabilistic inferencing model of community connectivity in large networks. In general, only the adjacency matrix is utilized to perform community structure parameter inference. Although there are recent examples that combine connectivity and attribute information for community detection, our model is an enhanced overlapping community detection model that combines adjacency spectral embedding with maximum likelihood estimation. This provides the flexibility of complex networks to increase connectivity information through measurements from attribute embedding. The attribute information can be effectively captured and transformed by attribute embedding to encode the combination with structure information. Then, the link strength among communities is designed to adjust the impact of these structural information on community generation based on the contribution of the structure to the clusters, and the node assignment allow for the nature of the real network (overlapping and outliers). Experiments highlight attributed networks in which attributed community detection task provides satisfactory performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Community detection in attributed networks using neighborhood information

Article 10 April 2024

A novel method of spectral clustering in attributed networks by constructing parameter-free affinity matrix

Article 17 November 2021

Community detection in attributed networks considering both structural and attribute similarities: two mathematical programming approaches

Article 20 February 2019

Notes

Throughout text, we use the words {{nodes, vertices, objects}, {characteristic, intuition, nature, property}, {circles, cluster, communities}} interchangeably.
http://snap.stanford.edu/data
http://snap.stanford.edu/data/com-Youtube.html.

References

Peng C, Zhang Z, Wong KC et al (2017) A scalable community detection algorithm for large graphs using stochastic block models. Intell Data Anal 21(6):1463–1485
Article Google Scholar
Yang J, McAuley J, Leskovec J (2013) Community detection in networks with node attributes. IEEE 13th international conference on data mining. IEEE, 2013 pp 1151–1156
Zhang F, Zhang Y, Qin L et al (2016) When engagement meets similarity: efficient (k, r)-core computation on social networks[J]. arXiv preprint arXiv:611.03254.
Whang JJ, Gleich DF, Dhillon IS (2016) Overlapping community detection using neighborhood-inflated seed expansion. IEEE Trans Knowl Data Eng 28(5):1272–1284
Article Google Scholar
Yang J, Leskovec J (2013) Overlapping community detection at scale: a nonnegative matrix factorization approach. Proceedings of the sixth ACM international conference on web search and data mining. 2013:587–596
You X, Ma Y, Liu Z (2020) A three-stage algorithm on community detection in social networks. Knowl-Based Syst 187:104822
Article Google Scholar
Pan X, Xu G, Wang B et al (2019) A novel community detection algorithm based on local similarity of clustering coefficient in social networks. IEEE Access 7:121586–121598
Article Google Scholar
Li J, Kumar CA, Mei C et al (2017) Comparison of reduction in formal decision contexts. Int J Approximate Reasoning 80:100–122
Article MathSciNet Google Scholar
Xia S, Zhang Z, Li W et al (2020) GBNRS: a novel rough set algorithm for fast adaptive attribute reduction in classification. IEEE Trans Knowl Data Eng pp 1–1
Zhang H, Wang CD, Lai JH et al (2019) Community detection using multilayer edge mixture model. Knowl Inform Syst 60(2):757–779
Article Google Scholar
Whang JJ, Hou Y, Gleich DF et al (2018) Non-exhaustive, overlapping clustering. IEEE trans pattern anal mach intell 41(11):2644–2659
Article Google Scholar
Xia S, Peng D, Meng D et al (2020) A fast adaptive k-means with no bounds. IEEE Trans Pattern Anal Mach Intell 1–1
Xia S, Chen B, Wang G et al (2021) mCRF and mRD: two classification methods based on a novel multiclass label noise filtering learning framework. IEEE Trans Neural Networks Learn Syst 1–15
Xia S, Liu Y, Ding X et al (2019) Granular ball computing classifiers for efficient, scalable and robust learning. Inform Sci 483:136–152
Article MathSciNet Google Scholar
Liu Z, Xiang B, Guo W et al (2019) Overlapping community detection algorithm based on coarsening and local overlapping modularity. IEEE Access 7:57943–57955
Article Google Scholar
Zhe C, Sun A, Xiao X (2019) Community detection on large complex attribute network. Proceedings of the 25th ACM SIGKDD int conf knowl discov & data min 2041–2049
Krogan NJ, Cagney G, Yu H et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature 440(7084):637–643
Article Google Scholar
Whang JJ, Dhillon IS, Gleich DF (2015) Non-exhaustive, overlapping k-means. Proceedings of the 2015 SIAM international conference on data mining Soc Ind and Appl Math 936–944
Rodríguez CE, Núr̃ez-Antonio G, Escarela G (2020) A Bayesian mixture model for clustering circular data. Comput Statistics Data Anal, 143:106842
Article MathSciNet Google Scholar
Guo X, Su J, Zhou H et al (2019) Community detection based on genetic algorithm using local structural similarity. IEEE Access 7:134583–134600
Article Google Scholar
Ma X, Yang P, Guan S (2019) Overlapping community detection algorithm based on edge strength. IEEE Access 7:126642–126650
Article Google Scholar
Bertsimas D, Nohadani O (2019) Robust maximum likelihood estimation. INFORMS J Comput 31(3):445–458
Article MathSciNet Google Scholar
Sussman DL, Tang M, Fishkind DE et al (2012) A consistent adjacency spectral embedding for stochastic blockmodel graphs. J Am Stat Assoc 107(499):1119–1128
Article MathSciNet Google Scholar
Perozzi B, Akoglu L (2016) Scalable anomaly ranking of attributed neighborhoods. Proceedings of the 2016 SIAM International Conference on Data Mining. Soc Ind Appl Math 207–215
Zhao Q, Ma H, Li X et al (2020) NotMle: community detection in an inference way. 2020 IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI). IEEE 736–741
Newman MEJ (2012) Communities, modules and large-scale structure in networks. Nature phys 8(1):25–31
Article Google Scholar
Mehta N, Duke L C, Rai P (2019) Stochastic blockmodels meet graph neural networks. Int Conf Mach Learn. PMLR, 4466–4474
Qiao M, Yu J, Bian W et al (2018) Adapting stochastic block models to power-law degree distributions. IEEE trans cybern, 49(2):626–637
Article Google Scholar
Airoldi E M, Blei DM, Fienberg SE et al (2008) Mixed membership stochastic blockmodels. J mach learn res 9(2008):1981–2014
Cai X, Huang D, Wang CD et al (2020) Spectral clustering by subspace randomization and graph fusion for high-dimensional data. Adv Knowl Discov Data Min 12084:330–342
Zachary WW (1977) An information flow model for conflict and fission in small groups. J anthropol res 33(4):452–473
Article Google Scholar
You X, Ma Y, Liu Z (2020) A three-stage algorithm on community detection in social networks. Knowl-Based Syst 187:104822
Newman MEJ, Clauset A (2016) Structure and inference in annotated networks. Nature communications 7(1):1–11
Article Google Scholar
Arthur D, Vassilvitskii S (2007) k-means++: the advantages of careful seeding. Soc Appl Math 1027–1035
Frénay B, Verleysen M (2013) Classification in the presence of label noise: a survey. IEEE trans neural networks learn syst 25(5):845–869
Article Google Scholar
Xia S, Wang G, Chen Z et al (2019) Complete random forest based class noise filtering learning for improving the generalizability of classifiers. IEEE Trans Knowl Data Eng 31(11):2063–2078
Article Google Scholar
Ruan Y, Fuhry D, Parthasarathy S (2013) Efficient community detection in large networks using content and links. Proceedings of the 22nd int conf World Wide Web 1089–1098
Ahn YY, Bagrow JP, Lehmann S (2010) Link communities reveal multiscale complexity in networks. Nature 466(7307):761–764
Article Google Scholar
Arasteh M, Alizadeh S (2019) A fast divisive community detection algorithm based on edge degree betweenness centrality. Appl Intelli 49(2):689–702
Article Google Scholar
Lancichinetti A, Fortunato S (2009) Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. Phys Rev E 80(1):016118
Article Google Scholar

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (61762078, 61363058, 61966004), Research Fund of Guangxi Key Lab of Multi-source Information Mining and Security (MIMS18-08), Northwest Normal University young teachers research capacity promotion plan (NWNU-LKQN2019-2), Research Fund of Guangxi Key Laboratory of Trusted Software (kx202003), and Gansu Innovation college fund project (2020B-089).

Author information

Authors and Affiliations

College of Computer Science and Engineering, Northwest Normal University, Lanzhou, 730070, Gansu, China
Qiqi Zhao, Huifang Ma & Xiaohong Li
Guangxi Key Lab of Multi-source Information Mining and Security, Guangxi Normal University, Guilin, 541004, Guangxi, China
Huifang Ma & Zhixin Li

Authors

Qiqi Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Huifang Ma
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohong Li
View author publications
You can also search for this author in PubMed Google Scholar
Zhixin Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Huifang Ma.

Ethics declarations

Conflict of interest

The authors declare that there is no conflict of interest regarding the publication of this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhao, Q., Ma, H., Li, X. et al. Is the simple assignment enough? Exploring the interpretability for community detection. Int. J. Mach. Learn. & Cyber. 12, 3463–3474 (2021). https://doi.org/10.1007/s13042-021-01384-8

Download citation

Received: 16 October 2020
Accepted: 19 July 2021
Published: 07 August 2021
Issue Date: December 2021
DOI: https://doi.org/10.1007/s13042-021-01384-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Is the simple assignment enough? Exploring the interpretability for community detection

Abstract

Access this article

Similar content being viewed by others

Community detection in attributed networks using neighborhood information

A novel method of spectral clustering in attributed networks by constructing parameter-free affinity matrix

Community detection in attributed networks considering both structural and attribute similarities: two mathematical programming approaches

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Is the simple assignment enough? Exploring the interpretability for community detection

Abstract

Access this article

Similar content being viewed by others

Community detection in attributed networks using neighborhood information

A novel method of spectral clustering in attributed networks by constructing parameter-free affinity matrix

Community detection in attributed networks considering both structural and attribute similarities: two mathematical programming approaches

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation