Abstract
This paper presents a new implementation of the co-VAT algorithm. We assume we have an m×n matrix D, where the elements of D are pair-wise dissimilarities between m row objects O r and n column objects O c . The union of these disjoint sets are (N = m + n) objects O. Clustering tendency assessment is the process by which a data set is analyzed to determine the number(s) of clusters present. In 2007, the co-Visual Assessment of Tendency (co-VAT) algorithm was proposed for rectangular data such as these. co-VAT is a visual approach that addresses four clustering tendency questions: i) How many clusters are in the row objects O r ? ii) How many clusters are in the column objects O c ? iii) How many clusters are in the union of the row and column objects O r ∪ O c ? And, iv) How many (co)-clusters are there that contain at least one of each type? co-VAT first imputes pair-wise dissimilarity values among the row objects, the square relational matrix D r , and the column objects, the square relational matrix D c , and then builds a larger square dissimilarity matrix D r ∪ c . The clustering questions can then be addressed by using the VAT algorithm on D r , D c , and D r ∪ c ; D is reordered by shuffling the reordering indices of D r ∪ c . Subsequently, the co-VAT image of D may show tendency for co-clusters (problem iv). We first discuss a different way to construct this image, and then we also extend a path-based distance transform, which is used in the iVAT algorithm, to co-VAT. The new algorithm, co-iVAT, shows dramatic improvement in the ability of co-VAT to show cluster tendency in rectangular dissimilarity data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Dhillon, I.: Co-clustering documents and words using bipartite spectral graph partitioning. In: Proc. 7th ACM SIGKDD Int. Conf. on Knowledge Discovery Data Mining, San Francisco, CA, pp. 269–274 (2001)
Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 3rd edn. Academic Press, San Diego (2006)
Bezdek, J., Hathaway, R.: VAT: A tool for visual assessment of (cluster) tendency. In: Proc. IJCNN 2002, Honolulu, HI, pp. 2225–2230 (2002)
Bezdek, J., Hathaway, R., Huband, J.: Visual assessment of clustering tendency for rectangular dissimilarity matrices. IEEE Trans. Fuzzy Systems 15(5), 890–903 (2007)
Prim, R.: Shortest connection networks and some generalisations. Bell System Tech. J. 36, 1389–1401 (1957)
Wang, L., Nguyen, T., Bezdek, J., Leckie, C., Ramamohanarao, K.: iVAT and aVAT: enhanced visual analysis for cluster tendency assessment (2009) (in review)
Fisher, B., Zoller, T., Buhmann, J.: Path based pairwise data clustering with application to texture segmentation. In: Figueiredo, M., Zerubia, J., Jain, A.K. (eds.) EMMCVPR 2001. LNCS, vol. 2134, pp. 235–250. Springer, Heidelberg (2001)
Park, L., Bezdek, J., Leckie, C.: Visualization of clusters in very large rectangular dissimilarity data. In: Gupta, G.S., Mukhopadhyay, S. (eds.) Proc. 4th Int. Conf. Autonomous Robots and Agents, February 2009, pp. 251–256 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Havens, T.C., Bezdek, J.C., Keller, J.M. (2010). A New Implementation of the co-VAT Algorithm for Visual Assessment of Clusters in Rectangular Relational Data. In: Rutkowski, L., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds) Artificial Intelligence and Soft Computing. ICAISC 2010. Lecture Notes in Computer Science(), vol 6113. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13208-7_46
Download citation
DOI: https://doi.org/10.1007/978-3-642-13208-7_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13207-0
Online ISBN: 978-3-642-13208-7
eBook Packages: Computer ScienceComputer Science (R0)