Skip to main content
Log in

Ego-network probabilistic graphical model for discovering on-line communities

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Community discovery is a leading research topic in social network analysis. In this paper, we present an ego-network probabilistic graphical model (ENPGM) which encodes users’ feature similarities and the causal dependencies between users’ profiles, communities, and ego networks. The model comprises three parts: a profile similarity probabilistic graph, social circle vector, and relationship probabilistic vector. Using Bayesian networks, the profile similarity probabilistic graph considers information about both the features of individuals and network structures with low memory usage. The social circle vector is proposed to describe both the alters belonging to a community and the features causing the community to emerge. The relationship probabilistic vector represents the probability that an ego network forms when given a set of user profiles and a set of circles. We then propose a parameter-learning algorithm and the ego-network probabilistic criterion (ENPC) for extracting communities from ego networks with some missing feature values. The ENPC score balances both the positive and negative impacts of social circles on the probabilities of forming an ego network. Experimental results using Facebook, Twitter, and Google+ datasets indicate that the ENPGM and community learning algorithms can predict social circles with similar quality to the ground-truth communities.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Airoldi EM, Blei DM, Fienberg SE, Xing EP (2008) Mixed membership stochastic blockmodels. J Mach Learn Res 9:1981–2014

    MATH  Google Scholar 

  2. Arnaboldi V, Conti M, Passarella A, Dunbar RI (2017) Online social networks and information diffusion: The role of ego networks. Online Soc Netw Media 1:44–55

    Article  Google Scholar 

  3. Bastian M, Heymann S, Jacomy M et al (2009) Gephi: an open source software for exploring and manipulating networks. ICWSM 8:361–362

    Google Scholar 

  4. Bezdek JC (2013) Pattern recognition with fuzzy objective function algorithms. Springer Science & Business Media, Berlin

    MATH  Google Scholar 

  5. Biswas A, Biswas B (2015) Investigating community structure in perspective of ego network. Expert Syst Appl 42(20):6913–6934

    Article  Google Scholar 

  6. Buglass SL, Binder JF, Betts LR, Underwood JD (2017) Looking for trouble: A multilevel analysis of disagreeable contacts in online social networks. Comput Hum Behav 70:234–243

    Article  Google Scholar 

  7. Burton SH, Giraud-Carrier CG (2014) Discovering social circles in directed graphs. ACM Trans Knowl Discov Data 8(4):21

    Article  Google Scholar 

  8. Cheraghchi HS, Zakerolhosseini A (2017) Toward a novel art inspired incremental community mining algorithm in dynamic social network. Appl Intell 46(2):409–426

    Article  Google Scholar 

  9. Cruz-Ramírez N, Acosta-Mesa HG, Barrientos-Martínez RE, Nava-Fernández LA (2006) How good are the bayesian information criterion and the minimum description length principle for model selection? a bayesian network analysis. In: Mexican International Conference on Artificial Intelligence, pp. 494–504. Springer

  10. Daly R, Shen Q, Aitken S (2011) Learning bayesian networks: approaches and issues. Knowl Eng Rev 26(02):99–157. https://doi.org/10.1017/S0269888910000251

    Article  Google Scholar 

  11. De Laat M, Lally V, Lipponen L, Simons RJ (2007) Investigating patterns of interaction in networked learning and computer-supported collaborative learning: A role for social network analysis. Int J Comput-Support Collab Learn 2(1):87–103

    Article  Google Scholar 

  12. Dinakar S, Andhale P, Rege M (2015) Sentiment analysis of social network content. In: 2015 IEEE International Conference on Information Reuse and Integration (IRI), pp. 189–192. IEEE

  13. Dotsika F, Watkins A (2017) Identifying potentially disruptive trends by means of keyword network analysis. Technological Forecasting and Social Change

  14. Fagnan J, Zaïane O, Barbosa D (2014) Using triads to identify local community structure in social networks. In: 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 108–112. IEEE

  15. Frey BJ, Dueck D (2007) Clustering by passing messages between data points. Science 315(5814):972–976

    Article  MathSciNet  MATH  Google Scholar 

  16. Gibbons JD, Chakraborti S (2011) Nonparametric statistical inference. Springer, Berlin

    Book  MATH  Google Scholar 

  17. Girvan M, Newman ME (2002) Community structure in social and biological networks. Proc Natl Acad Sci 99(12):7821–7826

    Article  MathSciNet  MATH  Google Scholar 

  18. Gonzalez-Pardo A, Jung JJ, Camacho D (2017) ACO-based clustering for ego network analysis. Futur Gener Comput Syst 66:160–170

    Article  Google Scholar 

  19. Handcock MS, Raftery AE, Tantrum JM (2007) Model-based clustering for social networks. J R Stat Soc A Stat Soc 170(2):301–354

    Article  MathSciNet  Google Scholar 

  20. Heer J, Perer A (2014) Orion: A system for modeling, transformation and visualization of multidimensional heterogeneous networks. Inf Vis 13(2):111–133

    Article  Google Scholar 

  21. Jianqiang Z, Xiaolin G, Feng T (2017) A new method of identifying influential users in the micro-blog networks. IEEE Access 5:3008–3015

    Article  Google Scholar 

  22. Kamal A, Elahi ML, Poon B, Amin MA (2015) Fusion based approach to discovering social circles in EGO networks. In: 2015 International Conference on Machine Learning and Cybernetics (ICMLC), vol. 2, pp. 511–517. IEEE

  23. Khan GF, Wood J (2015) Information technology management domain: Emerging themes and keyword analysis. Scientometrics 105(2):959–972

    Article  Google Scholar 

  24. Lam W, Segre AM (2002) A distributed learning algorithm for bayesian inference networks. IEEE Trans Knowl Data Eng 14(1):93–105

    Article  Google Scholar 

  25. Lauritzen SL, Spiegelhalter DJ (1988) Local computations with probabilities on graphical structures and their application to expert systems. J R Stat Soc. Series B (Methodological) 157–224

  26. Leskovec J, Krevl A (2014) SNAP Datasets: Stanford large network dataset collection. http://snap.stanford.edu/data. Accessed: 2017-05-02

  27. Leskovec J, Mcauley J (2012) Learning to discover social circles in ego networks. In: Advances in Neural Information Processing Systems, pp 539–547. Curran Associates, Inc

  28. Li Z, He L, Li Y (2016) A novel multiobjective particle swarm optimization algorithm for signed network community detection. Appl Intell 44(3):621–633

    Article  Google Scholar 

  29. Lim KH, Datta A (2013) A seed-centric community detection algorithm based on an expanding ring search. In: Proceedings of the First Australasian Web Conference-Volume 144, pp 21–25. Australian Computer Society, Inc

  30. MacQueen J et al (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, vol 1, pp 281–297. Oakland, CA, USA

  31. Mcauley J, Leskovec J (2014) Discovering social circles in ego networks. ACM Trans Knowl Discov Data 8(1):4

    Article  Google Scholar 

  32. Newman ME (2004) Fast algorithm for detecting community structure in networks. Phys Rev E 69(6):066,133

    Article  Google Scholar 

  33. Nguyen TH, Shirai K, Velcin J (2015) Sentiment analysis on social media for stock movement prediction. Expert Syst Appl 42(24):9603–9611

    Article  Google Scholar 

  34. Obar JA, Wildman S (2015) Social media definition and the governance challenge: An introduction to the special issue. Telecommun Policy 9(39):745–750

    Article  Google Scholar 

  35. Petkos G, Papadopoulos S, Kompatsiaris Y (2015) Social circle discovery in ego-networks by mining the latent structure of user connections and profile attributes. In: 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 880–887. IEEE

  36. Rendón E, Abundez I, Arizmendi A, Quiroz EM (2011) Internal versus external cluster validation indexes. Int J Comput Commun 5(1):27–34

    Google Scholar 

  37. Rissanen J (1986) Stochastic complexity and modeling. Ann Stat, pp. 1080–1100

  38. Rokach L, Maimon O (2005) Clustering methods. Springer, Berlin

    Book  Google Scholar 

  39. Rosa JLG, Piazentin DR (2016) A new cognitive filtering approach based on freeman k3 neural networks. Appl Intell 2(45):363–382

    Article  Google Scholar 

  40. Shannon CE, Weaver W (1948) A mathematical theory of communication. Bell system technical journal 27(379-423):623–656

    Article  MathSciNet  MATH  Google Scholar 

  41. Stark TH, Krosnick JA (2017) Gensi: A new graphical tool to collect ego-centered network data. Soc Networks 48: 36–45

    Article  Google Scholar 

  42. Strehl A, Ghosh J (2002) Cluster ensembles—a knowledge reuse framework for combining multiple partitions. J Mach Learn Res 3(Dec):583–617

    MathSciNet  MATH  Google Scholar 

  43. Tabarzad MA, Hamzeh A (2017) A heuristic local community detection method (hlcd). Appl Intell 46 (1):62–78

    Article  Google Scholar 

  44. Tsamardinos I, Brown LE, Aliferis CF (2006) The max-min hill-climbing bayesian network structure learning algorithm. Mach Learn 65(1):31–78

    Article  Google Scholar 

  45. Verma A, Bharadwaj KK (2017) Identifying community structure in a multi-relational network employing non-negative tensor factorization and ga k-means clustering. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 7(1)

  46. Wang M, Zuo W, Wang Y (2016) An improved density peaks-based clustering method for social circle discovery in social networks. Neurocomputing 179:219–227

    Article  Google Scholar 

  47. Xu R, Wunsch D (2005) Survey of clustering algorithms. IEEE Trans Neural Netw 16(3):645–678

    Article  Google Scholar 

  48. Yakoubi Z, Kanawati R (2014) Licod: A leader-driven algorithm for community detection in complex networks. Vietnam J Comput Sci 1(4):241–256

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (General Program) under Grant No. 61572253, the 13th Five-Year Plan Equipment Pre-Research Projects Fund under Grant No. 61402420101HK02001, and the Aviation Science Fund under Grant No. 2016ZC52030.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yi Zhuang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ding, F., Zhuang, Y. Ego-network probabilistic graphical model for discovering on-line communities. Appl Intell 48, 3038–3052 (2018). https://doi.org/10.1007/s10489-018-1137-y

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-018-1137-y

Keywords

Navigation