Abstract
Community discovery is a leading research topic in social network analysis. In this paper, we present an ego-network probabilistic graphical model (ENPGM) which encodes users’ feature similarities and the causal dependencies between users’ profiles, communities, and ego networks. The model comprises three parts: a profile similarity probabilistic graph, social circle vector, and relationship probabilistic vector. Using Bayesian networks, the profile similarity probabilistic graph considers information about both the features of individuals and network structures with low memory usage. The social circle vector is proposed to describe both the alters belonging to a community and the features causing the community to emerge. The relationship probabilistic vector represents the probability that an ego network forms when given a set of user profiles and a set of circles. We then propose a parameter-learning algorithm and the ego-network probabilistic criterion (ENPC) for extracting communities from ego networks with some missing feature values. The ENPC score balances both the positive and negative impacts of social circles on the probabilities of forming an ego network. Experimental results using Facebook, Twitter, and Google+ datasets indicate that the ENPGM and community learning algorithms can predict social circles with similar quality to the ground-truth communities.
Similar content being viewed by others
References
Airoldi EM, Blei DM, Fienberg SE, Xing EP (2008) Mixed membership stochastic blockmodels. J Mach Learn Res 9:1981–2014
Arnaboldi V, Conti M, Passarella A, Dunbar RI (2017) Online social networks and information diffusion: The role of ego networks. Online Soc Netw Media 1:44–55
Bastian M, Heymann S, Jacomy M et al (2009) Gephi: an open source software for exploring and manipulating networks. ICWSM 8:361–362
Bezdek JC (2013) Pattern recognition with fuzzy objective function algorithms. Springer Science & Business Media, Berlin
Biswas A, Biswas B (2015) Investigating community structure in perspective of ego network. Expert Syst Appl 42(20):6913–6934
Buglass SL, Binder JF, Betts LR, Underwood JD (2017) Looking for trouble: A multilevel analysis of disagreeable contacts in online social networks. Comput Hum Behav 70:234–243
Burton SH, Giraud-Carrier CG (2014) Discovering social circles in directed graphs. ACM Trans Knowl Discov Data 8(4):21
Cheraghchi HS, Zakerolhosseini A (2017) Toward a novel art inspired incremental community mining algorithm in dynamic social network. Appl Intell 46(2):409–426
Cruz-Ramírez N, Acosta-Mesa HG, Barrientos-Martínez RE, Nava-Fernández LA (2006) How good are the bayesian information criterion and the minimum description length principle for model selection? a bayesian network analysis. In: Mexican International Conference on Artificial Intelligence, pp. 494–504. Springer
Daly R, Shen Q, Aitken S (2011) Learning bayesian networks: approaches and issues. Knowl Eng Rev 26(02):99–157. https://doi.org/10.1017/S0269888910000251
De Laat M, Lally V, Lipponen L, Simons RJ (2007) Investigating patterns of interaction in networked learning and computer-supported collaborative learning: A role for social network analysis. Int J Comput-Support Collab Learn 2(1):87–103
Dinakar S, Andhale P, Rege M (2015) Sentiment analysis of social network content. In: 2015 IEEE International Conference on Information Reuse and Integration (IRI), pp. 189–192. IEEE
Dotsika F, Watkins A (2017) Identifying potentially disruptive trends by means of keyword network analysis. Technological Forecasting and Social Change
Fagnan J, Zaïane O, Barbosa D (2014) Using triads to identify local community structure in social networks. In: 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 108–112. IEEE
Frey BJ, Dueck D (2007) Clustering by passing messages between data points. Science 315(5814):972–976
Gibbons JD, Chakraborti S (2011) Nonparametric statistical inference. Springer, Berlin
Girvan M, Newman ME (2002) Community structure in social and biological networks. Proc Natl Acad Sci 99(12):7821–7826
Gonzalez-Pardo A, Jung JJ, Camacho D (2017) ACO-based clustering for ego network analysis. Futur Gener Comput Syst 66:160–170
Handcock MS, Raftery AE, Tantrum JM (2007) Model-based clustering for social networks. J R Stat Soc A Stat Soc 170(2):301–354
Heer J, Perer A (2014) Orion: A system for modeling, transformation and visualization of multidimensional heterogeneous networks. Inf Vis 13(2):111–133
Jianqiang Z, Xiaolin G, Feng T (2017) A new method of identifying influential users in the micro-blog networks. IEEE Access 5:3008–3015
Kamal A, Elahi ML, Poon B, Amin MA (2015) Fusion based approach to discovering social circles in EGO networks. In: 2015 International Conference on Machine Learning and Cybernetics (ICMLC), vol. 2, pp. 511–517. IEEE
Khan GF, Wood J (2015) Information technology management domain: Emerging themes and keyword analysis. Scientometrics 105(2):959–972
Lam W, Segre AM (2002) A distributed learning algorithm for bayesian inference networks. IEEE Trans Knowl Data Eng 14(1):93–105
Lauritzen SL, Spiegelhalter DJ (1988) Local computations with probabilities on graphical structures and their application to expert systems. J R Stat Soc. Series B (Methodological) 157–224
Leskovec J, Krevl A (2014) SNAP Datasets: Stanford large network dataset collection. http://snap.stanford.edu/data. Accessed: 2017-05-02
Leskovec J, Mcauley J (2012) Learning to discover social circles in ego networks. In: Advances in Neural Information Processing Systems, pp 539–547. Curran Associates, Inc
Li Z, He L, Li Y (2016) A novel multiobjective particle swarm optimization algorithm for signed network community detection. Appl Intell 44(3):621–633
Lim KH, Datta A (2013) A seed-centric community detection algorithm based on an expanding ring search. In: Proceedings of the First Australasian Web Conference-Volume 144, pp 21–25. Australian Computer Society, Inc
MacQueen J et al (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, vol 1, pp 281–297. Oakland, CA, USA
Mcauley J, Leskovec J (2014) Discovering social circles in ego networks. ACM Trans Knowl Discov Data 8(1):4
Newman ME (2004) Fast algorithm for detecting community structure in networks. Phys Rev E 69(6):066,133
Nguyen TH, Shirai K, Velcin J (2015) Sentiment analysis on social media for stock movement prediction. Expert Syst Appl 42(24):9603–9611
Obar JA, Wildman S (2015) Social media definition and the governance challenge: An introduction to the special issue. Telecommun Policy 9(39):745–750
Petkos G, Papadopoulos S, Kompatsiaris Y (2015) Social circle discovery in ego-networks by mining the latent structure of user connections and profile attributes. In: 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 880–887. IEEE
Rendón E, Abundez I, Arizmendi A, Quiroz EM (2011) Internal versus external cluster validation indexes. Int J Comput Commun 5(1):27–34
Rissanen J (1986) Stochastic complexity and modeling. Ann Stat, pp. 1080–1100
Rokach L, Maimon O (2005) Clustering methods. Springer, Berlin
Rosa JLG, Piazentin DR (2016) A new cognitive filtering approach based on freeman k3 neural networks. Appl Intell 2(45):363–382
Shannon CE, Weaver W (1948) A mathematical theory of communication. Bell system technical journal 27(379-423):623–656
Stark TH, Krosnick JA (2017) Gensi: A new graphical tool to collect ego-centered network data. Soc Networks 48: 36–45
Strehl A, Ghosh J (2002) Cluster ensembles—a knowledge reuse framework for combining multiple partitions. J Mach Learn Res 3(Dec):583–617
Tabarzad MA, Hamzeh A (2017) A heuristic local community detection method (hlcd). Appl Intell 46 (1):62–78
Tsamardinos I, Brown LE, Aliferis CF (2006) The max-min hill-climbing bayesian network structure learning algorithm. Mach Learn 65(1):31–78
Verma A, Bharadwaj KK (2017) Identifying community structure in a multi-relational network employing non-negative tensor factorization and ga k-means clustering. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 7(1)
Wang M, Zuo W, Wang Y (2016) An improved density peaks-based clustering method for social circle discovery in social networks. Neurocomputing 179:219–227
Xu R, Wunsch D (2005) Survey of clustering algorithms. IEEE Trans Neural Netw 16(3):645–678
Yakoubi Z, Kanawati R (2014) Licod: A leader-driven algorithm for community detection in complex networks. Vietnam J Comput Sci 1(4):241–256
Acknowledgements
This work was supported by the National Natural Science Foundation of China (General Program) under Grant No. 61572253, the 13th Five-Year Plan Equipment Pre-Research Projects Fund under Grant No. 61402420101HK02001, and the Aviation Science Fund under Grant No. 2016ZC52030.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ding, F., Zhuang, Y. Ego-network probabilistic graphical model for discovering on-line communities. Appl Intell 48, 3038–3052 (2018). https://doi.org/10.1007/s10489-018-1137-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-018-1137-y