Skip to main content

Comparing Partitions by Subset Similarities

  • Conference paper
Computational Intelligence for Knowledge-Based Systems Design (IPMU 2010)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6178))

Abstract

Comparing partitions is an important issue in classification and clustering when comparing results from different methods, parameters, or initializations. A well–established method for comparing partitions is the Rand index but this index is suitable for crisp partitions only. Recently, the Hüllermeier–Rifqi index was introduced which is a generalization of the Rand index to fuzzy partitions. In this paper we introduce a new approach to comparing partitions based on the similarities of their clusters in the sense of set similarity. All three indices, Rand, Hüllermeier–Rifqi, and subset similarity, are reflexive, invariant against row permutations, and invariant against additional empty subsets. The subset similarity index is not a generalization of the Rand index, but produces similar values. Subset similarity yields more intuitive similarities than Hüllermeier–Rifqi when comparing crisp and fuzzy partitions, and yields smoother nonlinear transitions. Finally, the subset similarity index has a lower computational complexity than the Hüllermeier–Rifqi index for large numbers of objects.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zadeh, L.A.: Fuzzy sets. Information and Control 8, 338–353 (1965)

    Article  MATH  MathSciNet  Google Scholar 

  2. Bezdek, J.C.: Numerical taxonomy with fuzzy sets. Journal of Mathematical Biology 1(1), 57–71 (1974)

    Article  MATH  MathSciNet  Google Scholar 

  3. Schürmann, J.: Pattern Classification — A Unified View of Statistical and Neural Approaches. Wiley, New York (1996)

    Google Scholar 

  4. Höppner, F., Klawonn, F., Kruse, R., Runkler, T.A.: Fuzzy Cluster Analysis — Methods for Image Recognition, Classification, and Data Analysis. Wiley, Chichester (1999)

    Google Scholar 

  5. Windham, M.P.: Cluster validity for the fuzzy c–means clustering algorithm. IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI-4(4), 357–363 (1982)

    Article  Google Scholar 

  6. Xie, X.L., Beni, G.: A validity measure for fuzzy clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 13(8), 841–847 (1991)

    Article  Google Scholar 

  7. Rand, W.M.: Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association 66(336), 846–850 (1971)

    Article  Google Scholar 

  8. Campello, R.J.G.B.: A fuzzy extension of the Rand index and other related indexes for clustering and classification assessment. Pattern Recognition Letters 28(7), 833–841 (2007)

    Article  Google Scholar 

  9. Hüllermeier, E., Rifqi, M.: A fuzzy variant of the Rand index for comparing clustering structures. In: Joint IFSA World Congress and EUSFLAT Conference, Lisbon, Portugal, July 2009, pp. 1294–1298 (2009)

    Google Scholar 

  10. Schweizer, B., Sklar, A.: Associative functions and statistical triangle inequalities. Publicationes Mathematicae 8, 169–186 (1961)

    MATH  MathSciNet  Google Scholar 

  11. Beringer, J., Hüllermeier, E.: Fuzzy clustering of parallel data streams. In: de Oliveira, J.V., Pedrycz, W. (eds.) Advances in Fuzzy Clustering and its Applications, pp. 333–352. Wiley, Chichester (2007)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Runkler, T.A. (2010). Comparing Partitions by Subset Similarities. In: Hüllermeier, E., Kruse, R., Hoffmann, F. (eds) Computational Intelligence for Knowledge-Based Systems Design. IPMU 2010. Lecture Notes in Computer Science(), vol 6178. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14049-5_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-14049-5_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-14048-8

  • Online ISBN: 978-3-642-14049-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics