Abstract
With the widely used of data mining and cluster analysis, cluster validation is attracting increasing attention. In this paper, the concept and development of cluster validation are introduced, then, based on the membership degree, a classification of cluster validity indexes is proposed: cluster validity indexes fit for crisp cluster, cluster validity indexes fit for fuzzy cluster. Based on this, combining with Cluster Validity Analysis Platform (CVAP), describing the two most important usages of cluster validation: to find the optimal number of clusters and to find appropriate clustering algorithms to a particular data set. Experiments give visualization representation of clustering validation process.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Visvanathan, M., Adagarla, B.S., Gerald, H.L., Smith, P.: Cluster validation: An integrative method for cluster analysis. In: IEEE International Conference on Bioinformatics and Biomedicine Workshop, pp. 238–242. IEEE Press, New York (2009)
Zadeh, L.A.: Fuzzy sets. Info. Control 8, 338–353 (1965)
Bedzek, J.C.: Cluster validity with fuzzy sets. Journal of Cybernetics 3, 58–72 (1974)
Gunderson, R.W.: Application of fuzzy ISODATA algorithms to star tracker pointing system. In: A link between science and applications of automatic control; Proceedings of 7th Triennial World IFAC Congress, pp. 1319–1323. Pergamon Press, Oxford (1979)
Xie, X.L., Beni, G.: A validity measure for fuzzy clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 13, 841–847 (1991)
Halkidi, M., Vazirgiannis, M.: Clustering validity assessment: finding the optimal partitioning of a dataset. In: Proceedings IEEE International Conference on Data Mining, ICDM 2001, pp. 187–194. IEEE Press, New York (2001)
Fan, J., Pen, J., Xie, W.: Cluster Validity Function: Entropy Formula. Fuzzy Systems and Mathematics 12, 68–74 (1998)
Kim, Y.I., Kim, D.W., Lee, D., Lee, K.H.: A cluster validation index for GK cluster analysis based on relative degree of sharing. Information Sciences 168, 225–242 (2004)
Li, J., Gao, X.-b., Jiao, L.-c.: New cluster validity function based on the modified partition fuzzy degree. Systems Engineering and Electronics 27, 723–726 (2005)
Xie, W., Liu, J.: The Mergence of hard clustering and Fuzzy Clustering-A Fast FCM Algorithm with Two Layers. Fuzzy Systems and Mathematics 6, 77–85 (1992)
Dunn, J.C.: Well Separated Clusters and Optimal Fuzzy Partitions. Journal of Cybernetics 4, 95–104 (1974)
Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Transactions on Pattern Analysis and Machine Intellifence 1, 224–227 (1979)
Bezdek, J.C., Pal, N.R.: Some new indexes of chluster validity. IEEE Transaction on System, Man and Cybernetics-Part B: Cybernetics 28, 301–315 (1998)
Gao, X., Xie, W.: Current developments in application of fuzzy cluster. Chinese Science Bulletin, 2241–2251 (1999)
Halkdim, M., Atistakisc, Y., Vazirgiannism, M.: On clustering validation techniques. Intelligent Information Systems 17, 107–145 (2001)
Larsen, B., Aone, C.: Fast and effective text mining using linear-time document clustering. In: Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 16–22. ACM, New York (1999)
Wang, K., Wang, B., Peng, L.: CVAP: Validation for cluster analyses. Data Science Journal 8, 88–93 (2009)
Fisher, R.A.: The use of Multiple Measurements in Taxonomic Problems. Annals of Eugenics 7, 179–188 (1936)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xie, N., Hu, L., Luktarhan, N., Zhao, K. (2011). A Classification of Cluster Validity Indexes Based on Membership Degree and Applications. In: Gong, Z., Luo, X., Chen, J., Lei, J., Wang, F.L. (eds) Web Information Systems and Mining. WISM 2011. Lecture Notes in Computer Science, vol 6987. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23971-7_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-23971-7_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23970-0
Online ISBN: 978-3-642-23971-7
eBook Packages: Computer ScienceComputer Science (R0)