Abstract
Analysts interested in understanding the community structure of a particular real-world network will often simply choose a popular community detection algorithm and trust the generated results without further investigation, but algorithm performance can vary depending on the network characteristics. We demonstrate that by running experiments on benchmark graphs tailored to match characteristics of a real-world network of interest, a better understanding can be obtained on how community detection algorithms will perform on the real-world network. We show that the correlation between the performance of the community detection methods on a publicly available dataset to the average performance of the same methods on the corresponding tailored benchmark graphs is high whereas the correlation with LFR benchmark graphs is negative. This means the methods that performed well on the tailored graphs also performed well on the real-world network but methods that perform well on LFR graphs did not perform well on the real-world network, demonstrating that the proposed methodology has merit.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Barabási, A.: Network science. Phil. Trans. R. Soc. A 371(1987) (2013)
Berry, J.W., Hendrickson, B., LaViolette, R.A., Phillips, C.A.: Tolerating the community detection resolution limit with edge weighting. Phys. Rev. E 83(5) (2011)
Blondel, V.D., Guillaume, J., Lambiotte, R., Lefebvre, E.: Fast unfolding of communities in large networks. J. Stat. Mech.: Theory Exp. 2008(10) (2008)
Bollobás, B.: A probabilistic proof of an asymptotic formula for the number of labelled regular graphs. Eur. J. Comb. 1(4) (1980)
Clauset, A., Newman, M., Moore, C.: Finding community structure in very large networks. Phys. Rev. E 70(6) (2004)
Faloutsos, M., Faloutsos, P., Faloutsos, C.: On power-law relationships of the internet topology. ACM SIGCOMM CCR 29(4) (1999)
Fortunato, S., Hric, D.: Community detection in networks. Phys. Rep. 659 (2016)
Girvan, M., Newman, M.: Community structure in social and biological networks. PNAS 99(12) (2002)
Khadivi, A., Hasler, M.: A weighting scheme for enhancing community detection in networks. In: 2010 ICC. IEEE (2010)
Khadivi, A., Rad, A.A., Hasler, M.: Network community-detection enhancement by proper weighting. Phys. Rev. E 83(4) (2011)
Lancichinetti, A., Fortunato, S., Radicchi, F.: Benchmark graphs for testing community detection algorithms. Phys. Rev. E 78(4) (2008)
Leskovec, J., Krevl, A.: SNAP Datasets. http://snap.stanford.edu/data (2014)
Lyzinski, V., Tang, M., Athreya, A., Park, Y., Priebe, C.E.: Community detection and classification in hierarchical stochastic blockmodels. IEEE Trans. Netw. Sci. Eng. 4(1) (2016)
Moradi-Jamei, B., Shakeri, H., Poggi-Corradini, P., Higgins, M.J.: A new method for quantifying network cyclic structure to improve community detection. Stat, Phys. A (2020)
Muscoloni, A., Cannistraci, C.V.: Rich-clubness test (2017). arXiv:1704.03526
Muscoloni, A., Cannistraci, C.V.: Leveraging the nonuniform pso network model as a benchmark for performance evaluation in community detection and link prediction. New J. Phys. 20(6) (2018)
Muscoloni, A., Cannistraci, C.V.: A nonuniform popularity-similarity optimization (npso) model to efficiently generate realistic complex networks with communities. New J. Phys. 20(5) (2018)
Muscoloni, A., Thomas, J.M., Ciucci, S., Bianconi, G., Cannistraci, C.V.: Machine learning meets complex networks via coalescent embedding in the hyperbolic space. Nat. Commun. 8(1) (2017)
Orman, G.K., Labatut, V., Cherifi, H.: Comparative evaluation of community detection algorithms. J. Stat. Mech. Theory Exp. 2012(08) (2012)
Papadopoulos, F., Kitsak, M., Serrano, M.Á., Boguná, M., Krioukov, D.: Popularity versus similarity in growing networks. Nature 489(7417) (2012)
Peel, L., Larremore, D.B., Clauset, A.: The ground truth about metadata and community detection in networks. Sci. Adv. 3(5) (2017)
Pons, P., Latapy, M.: Computing communities in large networks using random walks. In: Int. Symp. Comput. Inf. Sci. Springer (2005)
Radicchi, F., Castellano, C., Cecconi, F., Loreto, V., Parisi, D.: Defining and identifying communities in networks. PNAS 101(9) (2004)
Raghavan, U.N., Albert, R., Kumara, S.: Near linear time algorithm to detect community structures in large-scale networks. Phys. Rev. E 76(3) (2007)
Ravasz, E., Barabási, A.: Hierarchical organization in complex networks. Phys. Rev. E 67(2) (2003)
Rosvall, M., Bergstrom, C.T.: Maps of random walks on complex networks reveal community structure. PNAS 105(4) (2008)
Savkli, C., Schwartz, C., Galante, A., Cohen, J.: Novel edge and density metrics for link cohesion. In: Int. Conf. on Comp. Netw. Springer (2019)
Schaub, M.T., Delvenne, J., Rosvall, M., Lambiotte, R.: The many facets of community detection in complex networks. Appl. Netw. Sci. 2(1) (2017)
Schwartz, C.: Analyzing semi-local link cohesion to detect communities and anomalies in complex networks. Ph.D. thesis (2021)
Yang, Z., Algesheimer, R., Tessone, C.J.: A comparative analysis of community detection algorithms on artificial networks. Sci. Rep. 6(1) (2016)
Zhang, H., Zhou, C., Liang, X., Zhao, X., Li, Y.: A novel edge weighting method to enhance network community detection. IEEE Trans. Man. Cybern. Syst., p. 2015 (2015)
Acknowledgments
This paper was written under a Janney Grant from Johns Hopkins University Applied Physics Laboratory. We also thank Carlo V. Cannistraci, who introduced the authors to the nPSO benchmark at the 8th International Conference on Complex Networks and their Applications in 2019.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Schwartz, C., Savkli, C., Galante, A., Czaja, W. (2024). Tailoring Benchmark Graphs to Real-World Networks for Improved Prediction of Community Detection Performance. In: Cherifi, H., Rocha, L.M., Cherifi, C., Donduran, M. (eds) Complex Networks & Their Applications XII. COMPLEX NETWORKS 2023. Studies in Computational Intelligence, vol 1142. Springer, Cham. https://doi.org/10.1007/978-3-031-53499-7_9
Download citation
DOI: https://doi.org/10.1007/978-3-031-53499-7_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-53498-0
Online ISBN: 978-3-031-53499-7
eBook Packages: EngineeringEngineering (R0)