Skip to main content

Clustering Validity

  • Reference work entry
Encyclopedia of Database Systems

Synonyms

Cluster validation; Cluster stability; Quality assessment; Stability-based validation of clustering

Definition

A problem one faces in clustering is to decide the optimal partitioning of the data into clusters. In this context visualization of the data set is a crucial verification of the clustering results. In the case of large multidimensional data sets (e.g., more than three dimensions) effective visualization of the data set is cumbersome. Moreover the perception of clusters using available visualization tools is a difficult task for humans that are not accustomed to higher dimensional spaces. The procedure of evaluating the results of a clustering algorithm is known under the term cluster validity. Cluster validity consists of a set of techniques for finding a set of clusters that best fits natural partitions (of given datasets) without any a priori class information. The outcome of the clustering process is validated by a cluster validity index.

Historical Background

Clust...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 2,500.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recommended Reading

  1. Bezdek J.C. and Pal N.R. Some new indexes of cluster validity, IEEE Trans., Systems, Man, and Cybernetics, Part B. 28(3):301–315, 1998.

    Article  Google Scholar 

  2. Datta S. and Datta S. Comparisons and validation of statistical clustering techniques for microarray gene expression data. Bioinformatics, 19(4):459–466, 2003.

    Article  Google Scholar 

  3. El-Melegy M.T., E.A., Zanaty Abd-Elhafiez W.M., and Farag A.A. 2007.On cluster validity indexes in fuzzy and hard clustering algorithms for image segmentation. In Proc. Int. Conf. Image Processing, pp. 5–8.

    Google Scholar 

  4. Halkidi M., Batistakis Y., and Vazirgiannis M. On clustering validation techniques. J. Intell. Inf. Syst., 17(2–3):107–145, 2001.

    Article  MATH  Google Scholar 

  5. Halkidi M., Gunopulos D., Vazirgiannis M., Kumar N., and Domeniconi C. A clustering framework based on subjective and objective validity criteria. ACM Trans. Knowl. Discov. Data, 1(4), 2008.

    Google Scholar 

  6. Jiang D., Tang C., and Zhang A. Cluster Analysis for Gene Expression Data: A Survey. IEEE Trans. Knowl. Data Eng., 16(11):1370–1386, 2004.

    Article  Google Scholar 

  7. Kim M. and Ramakrishna R.S. New indices for cluster validity assessment. Pattern Recogn. Lett., 26(15):2353–2363, 2005.

    Article  Google Scholar 

  8. Maulik U. and Bandyopadhyay S. Performance evaluation of some clustering algorithms and validity indices. IEEE Trans. Pattern Anal. Mach. Intell., 24(12):1650–1654, 2002.

    Article  Google Scholar 

  9. NIPS 2005 workshop on theoretical foundations of clustering, Saturday, December 10th, 2005. Available at: (http://www.kyb.tuebingen.mpg.de/bs/people/ule/clustering_workshop_nips05/clustering_workshop_nips05.htm_).

  10. Pal N.R. and Bezdek J.C. On cluster validity for the fuzzy c-means model, IEEE Trans. Fuzzy Systems., 3(3):370–379, 1995.

    Article  Google Scholar 

  11. Rand W.M. Objective criteria for the evaluation of clustering methods. J. Am. Stat. Assoc., 66(336):846–850, 1971.

    Article  Google Scholar 

  12. Wang J.-S. and Chiang J.-C. A cluster validity measure with a hybrid parameter search method for the support vector clustering algorithm. Pattern Recognit., 41(2):506–520, 2008.

    Article  MATH  Google Scholar 

  13. Zhang J. and Modestino J.W. A model-fitting approach to cluster validation with application to stochastic model-based image segmentation. IEEE Trans. Pattern Anal. Mach. Intell., 12(10):1009–1017, 1990.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer Science+Business Media, LLC

About this entry

Cite this entry

Vazirgiannis, M. (2009). Clustering Validity. In: LIU, L., ÖZSU, M.T. (eds) Encyclopedia of Database Systems. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-39940-9_616

Download citation

Publish with us

Policies and ethics