Abstract
Stochastic blockmodel (SBM) has recently come into the spotlight in the domains of social network analysis and statistical machine learning, as it enables us to decompose and then analyze an exploratory network without knowing any priori information about its intrinsic structure. However, the prohibitive computational cost limits SBM learning algorithm with the capability of model selection to small network with hundreds of nodes. This paper presents a fine-gained SBM and its fast learning algorithm, named FSL, which ingeniously combines the component-wise EM (CEM) algorithm and minimum message length (MML) together to achieve the parallel learning of parameter estimation and model evaluation. The FSL significantly reduces the time complexity of the learning algorithm, and scales to network with thousands of nodes. The experimental results indicate that the FSL can achieve the best tradeoff between effectiveness and efficiency through greatly reducing learning time while preserving competitive learning accuracy. Moreover, it is noteworthy that our proposed method shows its excellent generalization ability through the application of link prediction.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Holland, P., Laskey, K., Leinhardt, S.: Stochastic blockmodels: First steps. Social Networks 5(2), 109–137 (1983)
Airoldi, E., Blei, D., Fienberg, S., Xing, E.: Mixed membership stochastic blockmodels. The Journal of Machine Learning Research 9, 1981–2014 (2008)
Latouche, P., Birmel, E., Ambroise, C.: Overlapping stochastic block models with application to the French political blogosphere. The Annals of Applied Statistics 5(1), 309–336 (2011)
Newman, M., Leicht, E.: Mixture models and exploratory analysis in networks. Proceedings of the National Academy of Sciences of the United States of America 104(23), 9564–9569 (2007)
Karrer, B., Newman, M.: Stochastic blockmodels and community structure in networks. Physical Review E 83(1), 016107 (2011)
Yang, B., Liu, J., Liu, D.: Characterizing and Extracting Multiplex Patterns in Complex Networks. IEEE Transactions on Systems Man and Cybernetics, Part B-Cybernetics 42(2), 469–481 (2012)
Shen, H., Cheng, X., Guo, J.: Exploring the structural regularities in networks. Physical Review E 84(5), 056111 (2011)
Zhu, Y., Liu, D., Chen, G., Jia, H., Yu, H.: Mathematical modeling for active and dynamic diagnosis of crop diseases based on Bayesian networks and incremental learning. Mathematical and Computer Modelling 58(3), 514–523 (2013)
Daudin, J., Picard, F., Robin, S.: A mixture model for random graphs. Statistics and Computing 18(2), 173–183 (2008)
Hofman, J., Wiggins, C.: Bayesian approach to network modularity. Physical Review Letters 100(25), 258701 (2008)
Latouche, P., Birmele, E., Ambroise, C.: Variational Bayesian inference and complexity control for stochastic block models. Statistical Modelling 12(1), 93–115 (2012)
Figueiredo, M., Jain, A.: Unsupervised learning of finite mixture models. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(3), 381–396 (2002)
Celeux, G., Chretien, S., Forbes, F., Mkhadri, A.: A component-wise EM algorithm for mixtures. Journal of Computational and Graphical Statistics 10(4), 697–712 (2001)
Girvan, M., Newman, M.: Community structure in social and biological networks. Proceedings of the National Academy of Sciences of the United States of America 99(12), 7821–7826 (2002)
Lusseau, D., Schneider, K., Boisseau, O., Haase, P., Slooten, E., Dawson, S.: The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations - Can geographic isolation explain this unique trait? Behavioral Ecology and Sociobiology 54(4), 396–405 (2003)
Batageli, V., Mrvar, A.: Pajek datasets, http://vlado.fmf.uni-lj.si/pub/networks/data/default.htm
Duch, J., Arenas, A.: Community detection in complex networks using extremal optimization. Physical Review E 72(2), 027104 (2005)
Lu, L., Zhou, T.: Link prediction in complex networks: A survey. Physica A-Statistical Mechanics and Its Applications 390(6), 1150–1170 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Zhao, X., Yang, B., Chen, H. (2014). Efficiently and Fast Learning a Fine-grained Stochastic Blockmodel from Large Networks. In: Tseng, V.S., Ho, T.B., Zhou, ZH., Chen, A.L.P., Kao, HY. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2014. Lecture Notes in Computer Science(), vol 8443. Springer, Cham. https://doi.org/10.1007/978-3-319-06608-0_31
Download citation
DOI: https://doi.org/10.1007/978-3-319-06608-0_31
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-06607-3
Online ISBN: 978-3-319-06608-0
eBook Packages: Computer ScienceComputer Science (R0)