Skip to main content

A New Implementation of the co-VAT Algorithm for Visual Assessment of Clusters in Rectangular Relational Data

  • Conference paper
Book cover Artificial Intelligence and Soft Computing (ICAISC 2010)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6113))

Included in the following conference series:

Abstract

This paper presents a new implementation of the co-VAT algorithm. We assume we have an m×n matrix D, where the elements of D are pair-wise dissimilarities between m row objects O r and n column objects O c . The union of these disjoint sets are (N = m + n) objects O. Clustering tendency assessment is the process by which a data set is analyzed to determine the number(s) of clusters present. In 2007, the co-Visual Assessment of Tendency (co-VAT) algorithm was proposed for rectangular data such as these. co-VAT is a visual approach that addresses four clustering tendency questions: i) How many clusters are in the row objects O r ? ii) How many clusters are in the column objects O c ? iii) How many clusters are in the union of the row and column objects O r  ∪ O c ? And, iv) How many (co)-clusters are there that contain at least one of each type? co-VAT first imputes pair-wise dissimilarity values among the row objects, the square relational matrix D r , and the column objects, the square relational matrix D c , and then builds a larger square dissimilarity matrix D r ∪  c . The clustering questions can then be addressed by using the VAT algorithm on D r , D c , and D r ∪  c ; D is reordered by shuffling the reordering indices of D r ∪  c . Subsequently, the co-VAT image of D may show tendency for co-clusters (problem iv). We first discuss a different way to construct this image, and then we also extend a path-based distance transform, which is used in the iVAT algorithm, to co-VAT. The new algorithm, co-iVAT, shows dramatic improvement in the ability of co-VAT to show cluster tendency in rectangular dissimilarity data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Dhillon, I.: Co-clustering documents and words using bipartite spectral graph partitioning. In: Proc. 7th ACM SIGKDD Int. Conf. on Knowledge Discovery Data Mining, San Francisco, CA, pp. 269–274 (2001)

    Google Scholar 

  2. Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 3rd edn. Academic Press, San Diego (2006)

    MATH  Google Scholar 

  3. Bezdek, J., Hathaway, R.: VAT: A tool for visual assessment of (cluster) tendency. In: Proc. IJCNN 2002, Honolulu, HI, pp. 2225–2230 (2002)

    Google Scholar 

  4. Bezdek, J., Hathaway, R., Huband, J.: Visual assessment of clustering tendency for rectangular dissimilarity matrices. IEEE Trans. Fuzzy Systems 15(5), 890–903 (2007)

    Article  Google Scholar 

  5. Prim, R.: Shortest connection networks and some generalisations. Bell System Tech. J. 36, 1389–1401 (1957)

    Google Scholar 

  6. Wang, L., Nguyen, T., Bezdek, J., Leckie, C., Ramamohanarao, K.: iVAT and aVAT: enhanced visual analysis for cluster tendency assessment (2009) (in review)

    Google Scholar 

  7. Fisher, B., Zoller, T., Buhmann, J.: Path based pairwise data clustering with application to texture segmentation. In: Figueiredo, M., Zerubia, J., Jain, A.K. (eds.) EMMCVPR 2001. LNCS, vol. 2134, pp. 235–250. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  8. Park, L., Bezdek, J., Leckie, C.: Visualization of clusters in very large rectangular dissimilarity data. In: Gupta, G.S., Mukhopadhyay, S. (eds.) Proc. 4th Int. Conf. Autonomous Robots and Agents, February 2009, pp. 251–256 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Havens, T.C., Bezdek, J.C., Keller, J.M. (2010). A New Implementation of the co-VAT Algorithm for Visual Assessment of Clusters in Rectangular Relational Data. In: Rutkowski, L., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds) Artificial Intelligence and Soft Computing. ICAISC 2010. Lecture Notes in Computer Science(), vol 6113. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13208-7_46

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-13208-7_46

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-13207-0

  • Online ISBN: 978-3-642-13208-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics