Skip to main content

Efficiently and Fast Learning a Fine-grained Stochastic Blockmodel from Large Networks

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8443))

Abstract

Stochastic blockmodel (SBM) has recently come into the spotlight in the domains of social network analysis and statistical machine learning, as it enables us to decompose and then analyze an exploratory network without knowing any priori information about its intrinsic structure. However, the prohibitive computational cost limits SBM learning algorithm with the capability of model selection to small network with hundreds of nodes. This paper presents a fine-gained SBM and its fast learning algorithm, named FSL, which ingeniously combines the component-wise EM (CEM) algorithm and minimum message length (MML) together to achieve the parallel learning of parameter estimation and model evaluation. The FSL significantly reduces the time complexity of the learning algorithm, and scales to network with thousands of nodes. The experimental results indicate that the FSL can achieve the best tradeoff between effectiveness and efficiency through greatly reducing learning time while preserving competitive learning accuracy. Moreover, it is noteworthy that our proposed method shows its excellent generalization ability through the application of link prediction.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Holland, P., Laskey, K., Leinhardt, S.: Stochastic blockmodels: First steps. Social Networks 5(2), 109–137 (1983)

    Article  MathSciNet  Google Scholar 

  2. Airoldi, E., Blei, D., Fienberg, S., Xing, E.: Mixed membership stochastic blockmodels. The Journal of Machine Learning Research 9, 1981–2014 (2008)

    MATH  Google Scholar 

  3. Latouche, P., Birmel, E., Ambroise, C.: Overlapping stochastic block models with application to the French political blogosphere. The Annals of Applied Statistics 5(1), 309–336 (2011)

    Article  MATH  MathSciNet  Google Scholar 

  4. Newman, M., Leicht, E.: Mixture models and exploratory analysis in networks. Proceedings of the National Academy of Sciences of the United States of America 104(23), 9564–9569 (2007)

    Article  MATH  Google Scholar 

  5. Karrer, B., Newman, M.: Stochastic blockmodels and community structure in networks. Physical Review E 83(1), 016107 (2011)

    Google Scholar 

  6. Yang, B., Liu, J., Liu, D.: Characterizing and Extracting Multiplex Patterns in Complex Networks. IEEE Transactions on Systems Man and Cybernetics, Part B-Cybernetics 42(2), 469–481 (2012)

    Article  Google Scholar 

  7. Shen, H., Cheng, X., Guo, J.: Exploring the structural regularities in networks. Physical Review E 84(5), 056111 (2011)

    Google Scholar 

  8. Zhu, Y., Liu, D., Chen, G., Jia, H., Yu, H.: Mathematical modeling for active and dynamic diagnosis of crop diseases based on Bayesian networks and incremental learning. Mathematical and Computer Modelling 58(3), 514–523 (2013)

    Article  Google Scholar 

  9. Daudin, J., Picard, F., Robin, S.: A mixture model for random graphs. Statistics and Computing 18(2), 173–183 (2008)

    Article  MathSciNet  Google Scholar 

  10. Hofman, J., Wiggins, C.: Bayesian approach to network modularity. Physical Review Letters 100(25), 258701 (2008)

    Google Scholar 

  11. Latouche, P., Birmele, E., Ambroise, C.: Variational Bayesian inference and complexity control for stochastic block models. Statistical Modelling 12(1), 93–115 (2012)

    Article  MathSciNet  Google Scholar 

  12. Figueiredo, M., Jain, A.: Unsupervised learning of finite mixture models. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(3), 381–396 (2002)

    Article  Google Scholar 

  13. Celeux, G., Chretien, S., Forbes, F., Mkhadri, A.: A component-wise EM algorithm for mixtures. Journal of Computational and Graphical Statistics 10(4), 697–712 (2001)

    Article  MathSciNet  Google Scholar 

  14. Girvan, M., Newman, M.: Community structure in social and biological networks. Proceedings of the National Academy of Sciences of the United States of America 99(12), 7821–7826 (2002)

    Article  MATH  MathSciNet  Google Scholar 

  15. Lusseau, D., Schneider, K., Boisseau, O., Haase, P., Slooten, E., Dawson, S.: The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations - Can geographic isolation explain this unique trait? Behavioral Ecology and Sociobiology 54(4), 396–405 (2003)

    Article  Google Scholar 

  16. Batageli, V., Mrvar, A.: Pajek datasets, http://vlado.fmf.uni-lj.si/pub/networks/data/default.htm

  17. Duch, J., Arenas, A.: Community detection in complex networks using extremal optimization. Physical Review E 72(2), 027104 (2005)

    Google Scholar 

  18. Lu, L., Zhou, T.: Link prediction in complex networks: A survey. Physica A-Statistical Mechanics and Its Applications 390(6), 1150–1170 (2011)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Zhao, X., Yang, B., Chen, H. (2014). Efficiently and Fast Learning a Fine-grained Stochastic Blockmodel from Large Networks. In: Tseng, V.S., Ho, T.B., Zhou, ZH., Chen, A.L.P., Kao, HY. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2014. Lecture Notes in Computer Science(), vol 8443. Springer, Cham. https://doi.org/10.1007/978-3-319-06608-0_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-06608-0_31

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-06607-3

  • Online ISBN: 978-3-319-06608-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics