Scalable Detection of Overlapping Communities and Role Assignments in Networks via Bayesian Probabilistic Generative Affiliation Modeling

Costa, Gianni; Ortale, Riccardo

doi:10.1007/978-3-319-48472-3_6

Gianni Costa²⁰ &
Riccardo Ortale²⁰

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 10033))

Included in the following conference series:

OTM Confederated International Conferences "On the Move to Meaningful Internet Systems"

1478 Accesses
4 Citations

Abstract

A new generative model of directed networks is developed to explain link formation from a Bayesian probabilistic perspective. Essentially, nodes can be affiliated to multiple (or, even, all) communities as well as roles. Affiliations are dichotomized to account for link direction. The unknown strength of node affiliations to communities and roles is captured through latent nonnegative random variables, that are ruled by Gamma priors for better model interpretability. Overall, such random variables are meant to generalize both mixed-membership and directed affiliation modeling, which allows for a differentiated connectivity structure inside communities. The probability of a link between two nodes is governed by a Poisson distribution, whose rate increases with the number of shared community affiliations as well as the strength of their affiliations to the common communities and respective roles. The properties of the Poisson distribution are especially beneficial on sparse networks for faster posterior inference. The latter is implemented by a coordinate-ascent variational algorithm enabling affiliation exploration and link prediction. The results of a comparative evaluation carried out on several real-world networks show the overcoming performance of the devised approach in community compactness, link prediction and scalability.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Airoldi, E.M., Blei, D.M., Fienberg, S.E., Xing, E.P.: Mixed membership stochastic blockmodels. J. Mach. Learn. Res. 9, 1981–2014 (2008)
MATH Google Scholar
Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)
MATH Google Scholar
Blei, D.M., Kucukelbir, A., McAuliffe, J.D.: Variational Inference: A Review for Statisticians. arXiv:1601.00670 (2016)
Chatterjee, N., Sinha, S.: Understanding the mind of a worm: hierarchical network structure underlying nervous system function in c. elegans. In: Banerjee, R., Chakrabarti, B.K. (eds.) Progress in Brain Research, pp. 145–153. Elsevier B.V. (2008)
Google Scholar
Chou, B.-H., Suzuki, E.: Discovering community-oriented roles of nodes in a social network. In: Bach Pedersen, T., Mohania, M.K., Tjoa, A.M. (eds.) DaWaK 2010. LNCS, vol. 6263, pp. 52–64. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15105-7_5
Chapter Google Scholar
Costa, G., Ortale, R.: A bayesian hierarchical approach for exploratory analysis of communities and roles in social networks. In: Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pp. 194–201 (2012)
Google Scholar
Costa, G., Ortale, R.: Probabilistic analysis of communities and inner roles in networks: Bayesian generative models and approximate inference. Soc. Netw. Anal. Min. 3(4), 1015–1038 (2013)
Article Google Scholar
Costa, G., Ortale, R.: A unified generative bayesian model for community discovery and role assignment based upon latent interaction factors. In: Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pp. 93–100 (2014)
Google Scholar
Costa, G., Ortale, R.: Model-based collaborative personalized recommendation on signed social rating networks. ACM Trans. Internet Technol. 16(3), 20:1–20:21 (2016)
Article Google Scholar
Creamer, G., Rowe, R., Hershkop, S., Stolfo, S.J.: Segmentation and automated social hierarchy detection through email network analysis. In: Zhang, H., Spiliopoulou, M., Mobasher, B., Giles, C.L., McCallum, A., Nasraoui, O., Srivastava, J., Yen, J. (eds.) WebKDD/SNA-KDD 2007. LNCS, vol. 5439, pp. 40–58. Springer, Heidelberg (2009). doi:10.1007/978-3-642-00528-2_3
Chapter Google Scholar
Fortunato, S.: Community detection in graphs. Phys. Rep. 486(3–5), 75–174 (2010)
Article MathSciNet Google Scholar
Girvan, M., Newman, M.E.J.: Community structure in social and biological networks. Proc. Nat. Acad. Sci. 99(12), 7821–7826 (2002)
Article MathSciNet MATH Google Scholar
Gopalan, P., Hofman, J., Blei, D.: Scalable recommendation with hierarchical poisson factorization. In: Proceedings of Uncertainty in Artificial Intelligence, pp. 326–335 (2015)
Google Scholar
Gopalan, P., Wang, C., Blei, D.M.: Modeling overlapping communities with node popularities. In: Proceedings of Advances in Neural Information Processing Systems, pp. 2850–2858 (2013)
Google Scholar
Gopalan, P.K., Blei, D.M.: Efficient discovery of overlapping communities in massive networks. Proc. Natl. Acad. Sci. 110(36), 14534–14539 (2013)
Article MathSciNet MATH Google Scholar
Henderson, K., Eliassi-Rad, T., Papadimitriou, S., Faloutsos, C.: Hcdf: a hybrid community discovery framework. In: SIAM SDM, pp. 754–765 (2010)
Google Scholar
Henderson, K., Eliassi Rad, T.: Applying latent dirichlet allocation to group discovery in large graphs. In: ACM SAC, pp. 1456–1461 (2009)
Google Scholar
Kernighan, B.W., Lin, S.: An efficient heuristic procedure for partitioning graphs. Bell Syst. Tech. J. 49(1), 291–307 (1970)
Article MATH Google Scholar
Koller, D., Friedman, N.: Probabilistic Graphical Models. Principles and Techniques. The MIT Press, Cambridge (2009)
MATH Google Scholar
Liben-Nowell, D., Kleinberg, J.: The link-prediction problem for social networks. J. Am. Soc. Inform. Sci. Technol. 58(7), 1019–1031 (2007)
Article Google Scholar
Lorrain, F., White, H.C.: The structural equivalence of individuals in social networks. J. Math. Sociol. 1, 49–80 (1971)
Article Google Scholar
McAuley, J., Leskovec, J.: Learning to discover social circles in ego networks. In: Proceedings of Advances in Neural Information Processing Systems, pp. 548–556 (2012)
Google Scholar
McCallum, A., Wang, X., Corrada-Emmanuel, A.: Topic and role discovery in social networks with experiments on enron and academic email. J. Artif. Intell. Res. 30(1), 249–272 (2007)
Google Scholar
Newman, M.E.J.: Detecting community structure in networks. Eur. Phys. J. B 38(2), 321–330 (2004)
Article Google Scholar
Newman, M.E.J.: Fast algorithm for detecting community structure in networks. Phys. Rev. E 69, 066133 (2004)
Article Google Scholar
Newman, M.E.J., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E 69(2), 026113 (2004)
Article Google Scholar
Pathak, N., Delong, C., Banerjee, A., Erickson, K.: Social topic models for community extraction. In: Proceedings of KDD Workshop on Social Network Mining and Analysis (2008)
Google Scholar
Pothen, A., Simon, H.D., Liou, K.-P.: Partitioning sparse matrices with eigenvectors of graphs. SIAM J. Matrix Anal. Appl. 11(3), 430–452 (1990)
Article MathSciNet MATH Google Scholar
Radicchi, F., Castellano, C., Cecconi, F., Loreto, V., Parisi, D.: Defining and identifying communities in networks. In: Proceedings of the National Academy of Sciences of the United States of America, vol. 101(9), pp. 2658–2663 (2004)
Google Scholar
Scripps, J., Tan, P.-N., Esfahanian, A.-H.: Exploration of link structure and community-based node roles in network analysis. In: Proceeding of International Conference on Data Mining, pp. 649–654 (2007)
Google Scholar
Scripps, J., Tan, P.-N., Esfahanian, A.-H.: Node roles and community structure in networks. In: Proceedings of Workshop on Web Mining and Social Network Analysis (WebKDD and SNA-KDD), pp. 26–35 (2007)
Google Scholar
Sohn, Y., Choi, M.-K., Ahn, Y.-Y., Lee, J., Jeong, J.: Topological cluster analysis reveals the systemic organization of the caenorhabditis elegans connectome. PLoS Comput. Biol. 7(5), e1001139 (2011)
Article MathSciNet Google Scholar
Wasserman, S., Faust, K.: Social Network Analysis: Methods and Applications. Cambridge University Press, Cambridge (1994)
Book MATH Google Scholar
Watts, D.J., Strogatz, S.H.: Collective dynamics of small-world networks. Nature 393(6684), 440–442 (1998)
Article Google Scholar
White, J.G., Southgate, E., Thompson, J.N., Brenner, S.: The structure of the nervous system of the nematode caenorhabditis elegans. Philos. Trans. R. Soc. B: Biol. Sci. 314(1165), 1–340 (1986)
Article Google Scholar
Xie, J., Kelley, S., Szymanski, B.K.: Overlapping community detection in networks: the state of the art and comparative study. ACM Comput. Surv. 45(4), 43:1–43:35 (2013)
Article MATH Google Scholar
Yang, J., Leskovec, J.: Structure and overlaps of ground-truth communities in networks. ACM Trans. Intell. Syst. Technol. 5(2), 26:1–26:35 (2014)
Article Google Scholar
Yang, J., McAuley, J., Leskovec, J.: Community detection in networks with node attributes. In: Proceedings of International Conference on Data Mining, pp. 1151–1156 (2013)
Google Scholar
Yang, J., McAuley, J., Leskovec, J.: Detecting cohesive and 2-mode communities in directed and undirected networks. In: Proceedings of ACM International Conference on Web Search and Data Mining, pp. 323–332 (2014)
Google Scholar
Zhang, H., Qiu, B., Giles, C.L., Foley, H.C., Yen, J.: An lda-based community structure discovery approach for large-scale social networks. In: Proceedings of IEEE International Conference on Intelligence and Security Informatics, pp. 200–207 (2007)
Google Scholar
Zhou, D., Manavoglu, E., Li, J., Giles, C.L., Zha, H.: Probabilistic models for discovering e-communities. In: Proceedings of International Conference on World Wide Web, pp. 173–182 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

ICAR-CNR, Via Bucci 41c, 87036, Rende (CS), Italy
Gianni Costa & Riccardo Ortale

Authors

Gianni Costa
View author publications
You can also search for this author in PubMed Google Scholar
Riccardo Ortale
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Riccardo Ortale .

Editor information

Editors and Affiliations

ADAPT Centre, Trinity College Dublin, Dublin 2, Ireland
Christophe Debruyne
University of Lorraine, Vandoeuvre-les-Nancy, France
Hervé Panetto
TU Graz, Graz, Austria
Robert Meersman
La Trobe University, Melbourne, Australia
Tharam Dillon
Institute of Computer Languages, TU Wien, Vienna, Austria
eva Kühn
ADAPT Centre, Trinity College Dublin, Dublin 2, Ireland
Declan O'Sullivan
Università degli Studi di Milano Crema, Crema, Italy
Claudio Agostino Ardagna

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Costa, G., Ortale, R. (2016). Scalable Detection of Overlapping Communities and Role Assignments in Networks via Bayesian Probabilistic Generative Affiliation Modeling. In: Debruyne, C., et al. On the Move to Meaningful Internet Systems: OTM 2016 Conferences. OTM 2016. Lecture Notes in Computer Science(), vol 10033. Springer, Cham. https://doi.org/10.1007/978-3-319-48472-3_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-48472-3_6
Published: 18 October 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48471-6
Online ISBN: 978-3-319-48472-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics