Abstract
Comparing partitions is an important issue in classification and clustering when comparing results from different methods, parameters, or initializations. A well–established method for comparing partitions is the Rand index but this index is suitable for crisp partitions only. Recently, the Hüllermeier–Rifqi index was introduced which is a generalization of the Rand index to fuzzy partitions. In this paper we introduce a new approach to comparing partitions based on the similarities of their clusters in the sense of set similarity. All three indices, Rand, Hüllermeier–Rifqi, and subset similarity, are reflexive, invariant against row permutations, and invariant against additional empty subsets. The subset similarity index is not a generalization of the Rand index, but produces similar values. Subset similarity yields more intuitive similarities than Hüllermeier–Rifqi when comparing crisp and fuzzy partitions, and yields smoother nonlinear transitions. Finally, the subset similarity index has a lower computational complexity than the Hüllermeier–Rifqi index for large numbers of objects.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zadeh, L.A.: Fuzzy sets. Information and Control 8, 338–353 (1965)
Bezdek, J.C.: Numerical taxonomy with fuzzy sets. Journal of Mathematical Biology 1(1), 57–71 (1974)
Schürmann, J.: Pattern Classification — A Unified View of Statistical and Neural Approaches. Wiley, New York (1996)
Höppner, F., Klawonn, F., Kruse, R., Runkler, T.A.: Fuzzy Cluster Analysis — Methods for Image Recognition, Classification, and Data Analysis. Wiley, Chichester (1999)
Windham, M.P.: Cluster validity for the fuzzy c–means clustering algorithm. IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI-4(4), 357–363 (1982)
Xie, X.L., Beni, G.: A validity measure for fuzzy clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 13(8), 841–847 (1991)
Rand, W.M.: Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association 66(336), 846–850 (1971)
Campello, R.J.G.B.: A fuzzy extension of the Rand index and other related indexes for clustering and classification assessment. Pattern Recognition Letters 28(7), 833–841 (2007)
Hüllermeier, E., Rifqi, M.: A fuzzy variant of the Rand index for comparing clustering structures. In: Joint IFSA World Congress and EUSFLAT Conference, Lisbon, Portugal, July 2009, pp. 1294–1298 (2009)
Schweizer, B., Sklar, A.: Associative functions and statistical triangle inequalities. Publicationes Mathematicae 8, 169–186 (1961)
Beringer, J., Hüllermeier, E.: Fuzzy clustering of parallel data streams. In: de Oliveira, J.V., Pedrycz, W. (eds.) Advances in Fuzzy Clustering and its Applications, pp. 333–352. Wiley, Chichester (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Runkler, T.A. (2010). Comparing Partitions by Subset Similarities. In: Hüllermeier, E., Kruse, R., Hoffmann, F. (eds) Computational Intelligence for Knowledge-Based Systems Design. IPMU 2010. Lecture Notes in Computer Science(), vol 6178. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14049-5_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-14049-5_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14048-8
Online ISBN: 978-3-642-14049-5
eBook Packages: Computer ScienceComputer Science (R0)