Abstract
Detecting overlapping groups is a specific challenge which offers relevant solutions for many application domains that require organizing data into non-disjoint clusters. Recently, several methods are proposed in the literature giving different layouts for the overlapping boundaries between clusters. However, the assessment process to evaluate the performance of these methods still a challenging issue to deal with. In fact, existing evaluation measures for overlapping clustering do not take into account the overlap error, local to each data object, while it calculates the whole overlap size relative to all clusters. Therefore, we propose in this work a new external evaluation measure, referred to as Micro-Overlap, able to perform efficient and robust evaluation of overlapping clustering. Experiments on synthetic and real datasets show the performance of the proposed measure compared to existing ones.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Wang, Q., Fleury, E.: Uncovering overlapping community structure. Complex Networks 16, 176–186 (2011)
Zhang, H., Fritts, J.E., Goldman, S.A.: Image segmentation evaluation: a survey of unsupervised methods. Comput. Vis. Image Underst. 110, 260–280 (2008)
Sarstedt, M., Mooi, E.: Cluster analysis. A Concise Guide to Market Research. STBE, pp. 273–324. Springer, Heidelberg (2014). doi:10.1007/978-3-642-53965-7_9
Steinbach, M., Karypis, G., Kumar, V.: A comparison of document clustering techniques. In: KDD Workshop on Text Mining, pp. 525–526 (2000)
Tsoumakas, G., Spyromitros-Xioufis, E., Vilcek, J., Vlahavas, I.: Mulan: a java library for multi-label learning. J. Mach. Learn. Res. 12, 2411–2414 (2011)
Amig, E., Gonzalo, J., Artiles, J., Verdejo, F.: A comparison of extrinsic clustering evaluation metrics based on formal constraints. Inf. Retrieval 12, 461–486 (2009)
Banerjee, A., Krumpelman, C., Ghosh, J., Basu, S., Mooney, R.J.: Model-based overlapping clustering. In: Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, pp. 532–537 (2005)
Ben N’Cir, C.-E., Cleuziou, G., Essoussi, N.: Overview of overlapping partitional clustering methods. In: Celebi, M.E. (ed.) Partitional Clustering Algorithms, pp. 245–275. Springer, Cham (2015). doi:10.1007/978-3-319-09259-1_8
Rosales-MĂ©ndez, H., RamĂrez-Cruz, Y.: CICE-BCubed: a new evaluation measure for overlapping clustering algorithms. In: Ruiz-Shulcloper, J., Sanniti di Baja, G. (eds.) CIARP 2013. LNCS, vol. 8258, pp. 157–164. Springer, Heidelberg (2013). doi:10.1007/978-3-642-41822-8_20
Ben N’Cir, C.E., Cleuziou, G., Essoussi, N.: Generalization of c-means for identifying non-disjoint clusters with overlap regulation. In: Pattern Recognition Letters, pp. 92–98 (2014)
Mirkin, B.G.: Method of principal cluster analysis. Autom. Remote Control 48, 1379–1386 (1987)
Depril, D., Van Mechelen, I., Mirkin, B.G.: Algorithms for additive clustering of rectangular data tables. Comput. Stat. Data Anal. 52(11), 4923–4938 (2008)
Guillaume, C.: Two variants of the OKM for overlapping clustering. Adv. Knowl. Disc. Manage. 2, 149–166 (2009)
Gregory, S.: A fast algorithm to find overlapping communities in networks. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008. LNCS, vol. 5211, pp. 408–423. Springer, Heidelberg (2008). doi:10.1007/978-3-540-87479-9_45
Zhang, S., Wang, R.-S., Zhang, X.-S.: Identification of overlapping community structure in complex networks using fuzzy c-means clustering. Phys. A: Stat. Mech. Appl. 374, 483–490 (2007)
Bertrand, P., Janowitz, P.F.: The k-weak hierarchical representations: an extension of the indexed closed weak hierarchies. Discrete Appl. Math. 127, 199–220 (2003)
Heller, K., Ghahramani, Z.: A nonparametric Bayesian approach to modeling overlapping clusters. J. Mach. Learn. Res. 20, 187–194 (2007)
Qiang, F., Banerjee, A.: Multiplicative mixture models for overlapping clustering. In: Proceedings of the IEEE International Conference on Data Mining, ICDM 2008, Washington, USA, pp. 791–796 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Ben N’Cir, CE., Essoussi, N. (2017). New Overlap Measure for the Validation of Non-disjoint Partitioning. In: Jallouli, R., Zaïane, O., Bach Tobji, M., Srarfi Tabbane, R., Nijholt, A. (eds) Digital Economy. Emerging Technologies and Business Innovation. ICDEc 2017. Lecture Notes in Business Information Processing, vol 290. Springer, Cham. https://doi.org/10.1007/978-3-319-62737-3_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-62737-3_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-62736-6
Online ISBN: 978-3-319-62737-3
eBook Packages: Computer ScienceComputer Science (R0)