Skip to main content
Log in

Is VAT really single linkage in disguise?

  • Published:
Annals of Mathematics and Artificial Intelligence Aims and scope Submit manuscript

Abstract

This paper addresses the relationship between the Visual Assessment of cluster Tendency (VAT) algorithm and single linkage hierarchical clustering. We present an analytical comparison of the two algorithms in conjunction with numerical examples to show that VAT reordering of dissimilarity data is directly related to the clusters produced by single linkage hierarchical clustering. This analysis is important to understanding the underlying theory of VAT and, more generally, other algorithms that are based on VAT-ordered dissimilarity data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Bezdek, J., Hathaway, R.: VAT: a tool for visual assessment of (cluster) tendency. In: Proc. IJCNN 2002, pp. 2225–30. Piscataway (2002)

  2. Bezdek, J., Hathaway, R., Huband, J.: Visual assessment of clustering tendency for rectangular dissimilarity matrices. IEEE Trans. Fuzzy Syst. 15(5), 890–903 (2007)

    Article  Google Scholar 

  3. Bezdek, J., Keller, J., Krishnapuram, R., Pal, N.: Fuzzy Models and Algorithms for Pattern Recognition and Image Processing. Kluwer, Norwell (1999)

    MATH  Google Scholar 

  4. Ding, Y., Harrison, R.: Relational visual cluster validity (RVCV). Pattern Recogn. Lett. 28, 2071–2079 (2007)

    Article  Google Scholar 

  5. Duda, R., Hart, P., Stork, D.: Pattern Classification, 2nd edn. Wiley, New York (2000)

    Google Scholar 

  6. Dunn, J.: A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. J. Cybern. 3(3), 32–57 (1974)

    Article  MathSciNet  Google Scholar 

  7. Gower, J., Ross, G.: Minimum spanning trees and single linkage cluster analysis. Appl. Stat. 18, 54–64 (1969)

    Article  MathSciNet  Google Scholar 

  8. Harary, F.: Graph Theory. Addison-Wesley, Reading (2004)

    Google Scholar 

  9. Hartigan, J.: Clustering Algorithms. Wiley, New York (1975)

    MATH  Google Scholar 

  10. Hathaway, R., Bezdek, J.: Visual cluster validity for prototype generator clustering models. Pattern Recogn. Lett. 24, 1563–1569 (2003)

    Article  MATH  Google Scholar 

  11. Hathaway, R., Bezdek, J., Huband, J.: Scalable visual asseessment of cluster tendency for large data sets. Pattern Recogn. 39(7), 1315–1324 (2006)

    Article  MATH  Google Scholar 

  12. Havens, T., Bezdek, J., Keller, J., Popescu, M.: Clustering in ordered dissimilarity data. Int. J. Intell. Syst. 24(5), 504–528 (2009)

    Article  MATH  Google Scholar 

  13. Huband, J., Bezdek, J.: Computational Intelligence: Research Frontiers, chap. VCV2—Visual Cluster Validity, pp. 293–308. Springer, Berlin (2008)

    Google Scholar 

  14. Huband, J., Bezdek, J., Hathaway, R.: Revised visual assessment of (cluster) tendency (reVAT). In: Proc. NAFIPS, pp. 101–104. IEEE Press, Banff (2004)

    Google Scholar 

  15. Huband, J., Bezdek, J., Hathaway, R.: bigVAT: visual assessment of cluster tendency for large data sets. Pattern Recogn. 38(11), 1875–1886 (2005)

    Article  Google Scholar 

  16. Jain, A., Dubes, R.: Algorithms for Clustering Data. Prentice-Hall, Englewood Cliffs (1988)

    MATH  Google Scholar 

  17. Kruskal, J.: On the shortest spanning subtree of a graph and the traveling salesman problem. In: Proc. of the Am. Math. Soc., vol. 7, pp. 48–50 (1956)

  18. Lynch, N.: Distributed Algorithms, 4th edn. Morgan Kaufmann, San Fransisco (1996)

    MATH  Google Scholar 

  19. Myllyharju, J., Kivirikko, K.: Collagens, modifying enzymes, and their mutation in humans, flies, and worms. Trends Genet. 20(1), 33–43 (2004)

    Article  Google Scholar 

  20. Notsu, A., Ichihashi, H., Honda, K., Katai, O.: Visualization of balancing systems based on naive psychological approaches. AI Soc. 23(2), 281–296 (2007)

    Article  Google Scholar 

  21. Popescu, M., Bezdek, J., Keller, J., Havens, T., Huband, J.: A new cluster validity measure for bioinformatics relational datasets. In: Proc. FUZZ-IEEE, pp. 726–731. Hong Kong, China (2008)

  22. Popescu, M., Keller, J., Mitchell, J., Bezdek, J.: Functional summarization of gene product clusters using Gene Ontology similarity measures. In: Proc. 2004 ISSNIP, pp. 553–559. IEEE, Piscataway (2004)

    Google Scholar 

  23. Prim, R.: Shortest connection networks and some generalisations. Bell Syst. Tech. J. 36, 1389–1401 (1957)

    Google Scholar 

  24. Sledge, I., Havens, T., Huband, J., Bezdek, J., Keller, J.: Finding the number of clusters in ordered dissimilarities. Soft Comput. 13, 1125–1142 (2009)

    Article  Google Scholar 

  25. The Gene Ontology Consortium: The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res. 32, D258–D261 (2004)

    Article  Google Scholar 

  26. Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 3rd edn. Academic, San Diego (2006)

    MATH  Google Scholar 

  27. Hubbard, T.J.P., et al.: Ensembl 2009. Nucleic Acids Res. 37, D690–D697 (2009)

    Article  Google Scholar 

  28. Wang, L., Leckie, C., Rao, K., Bezdek, J.: Automatically determining the number of clusters from unlabeled data sets. IEEE Trans. Knowl. Data Eng. 21(3), 335–350 (2009)

    Article  Google Scholar 

  29. Yang, S., Luo, S., Li, J.: Advanced data mining and applications. In: Lecture Notes in Computer Science, vol. 4093, chap. A novel visual clustering algorithm for finding community in complex systems, pp. 396–403. Springer, Berlin (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Timothy C. Havens.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Havens, T.C., Bezdek, J.C., Keller, J.M. et al. Is VAT really single linkage in disguise?. Ann Math Artif Intell 55, 237 (2009). https://doi.org/10.1007/s10472-009-9157-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10472-009-9157-2

Keywords

Mathematics Subject Classifications (2000)

Navigation