Abstract
Classical clustering methods, such as partitioning and hierarchical clustering algorithms, often fail to deliver satisfactory results, given clusters of arbitrary shapes. Motivated by a clustering validity index based on inter-cluster and intra-cluster density, we propose that the clustering validity index be used not only globally to find optimal partitions of input data, but also locally to determine which two neighboring clusters are to be merged in a hierarchical clustering of Self-Organizing Map (SOM). A new two-level SOM-based clustering algorithm using the clustering validity index is also proposed. Experimental results on synthetic and real data sets demonstrate that the proposed clustering algorithm is able to cluster data in a better way than classical clustering algorithms on an SOM.
Similar content being viewed by others
References
Pal, N. R., Bezdek, J. C. and Tsao, E. C.-K.: Generalized clustering networks and Kohonen's self-organizing scheme, IEEE Trans. Neural Networks, 4(4) (1993), 549–557.
Jain, A.K., Murty, M. N. and Flyn, P. J.: Data clustering: a review, ACM Computing Surveys, 31(3) (1999), 264–323.
Han, J. and Kamber, M.: Data Mining: Concepts and Techniques, Morgan-Kaufman, San Francisco, 2000.
Kohonen, T.: Self-Organizing Maps, Springer-Verlag, Berlin, Germany, 1997.
Huntsberger, T. and Ajjimarangsee, P.: Parallel self-organizing feature maps for unsupervised pattern recognition, Int. J. General systems, 16 (1989), 357–372.
Mao, J. and Jain, A. K.: A self-organizing network for hyperellipsoidal clustering (HEC), IEEE Trans. onNeural Networks, 7(1) (1996), 16–29.
Lampinen, J. and Oja, E.: Clustering properties of hierarchical self-organizing maps, J. Math. Imag. Vis., 2(2-3) (1992), 261–272.
Murtagh, F.: Interpreting the Kohonen self-organizing feature map using contiguity-constrained clustering, Pattern Recognition Letters, 16 (1995), 399–408.
Kiang, M. Y.: Extending the Kohonen self-organizing map networks for clustering analysis, Computational Statistics & Data Analysis, 38 (2001), 161–180.
Cottrell, M., Gaubert, P., Letremy P. and Rousset, P.: Analyzing and representing multidimensional quantitative and qualitative data: Demographic study of the Rhône valley. The domestic consumption of the Canadian families, In: Erkki Oja and Samuel Kaski (eds.), Kohonenmaps, pp. 1–14, Elsevier, 1999.
Vesanto, J. and Alhonierni, E.: Clustering of the self-organizing map, IEEETrans.Neural Networks, 11(3) (2000), 586–600.
Lopson, H. and Siegelmann, H. T.: Clustering irregular shapes using high-order neurons, Neural Computation, 12(10) (2000), 2331–2353.
Halkidi, M. and Vazirgiannis, M.: Clustering validity assessment using multi representatives, In: Proceedings of SETN Conference, Thessaloniki, Greece, April 2002.
Guha, S., Rastogi, R. and Shim, K.: CURE: An efficient clustering algorithm for large databases, In: Proceedings of ACM SIGMOD International Conference on Management of Data, New York, pp. 73–84, 1998.
Duda, O., Hart, P. E. and Stork, D. G.: Pattern Classi¢cation, John Wiley & Sons, USA, 2001.
Fisher, R. A.: The use of multiple measure in taxonomic problems, Ann. Eugenics, 7(Part II) (1936), 179–188.
Bezdek, J. C. and Pal, N. R.: Some new indexes of cluster validity, IEEETrans.onSystem, Man, andCybernetics, 28(3) (1998), 301–315.
Alon, U., Barkai, N., Notterman, D. A., Gish, G., Ybarra, S., Mack, D. and Levine, A. J.: Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by olignucleotide arrays, Proc.Natl.Acad.Sci., 96 (1999), 6745–6750.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Wu, S., Chow, T.W.S. Self-Organizing-Map Based Clustering Using a Local Clustering Validity Index. Neural Processing Letters 17, 253–271 (2003). https://doi.org/10.1023/A:1026083612746
Issue Date:
DOI: https://doi.org/10.1023/A:1026083612746