Abstract
The problem of selection of variables seems to be the key issue in classification of multi-dimensional objects. An optimal set of features should be made of only those variables, which are essential for the differentiation of studied objects. This selection may be made easier if a graphic analysis of an U-matrix is carried out. It allows to easily identify variables, which do not differentiate the studied objects. A graphic analysis may, however, not suffice to analyse data when an object is described with hundreds of variables. The authors of the paper propose a procedure which allows to eliminate variables with the smallest discriminating potential based on the measurement of concentration of objects on the Kohonen self organising map networks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
DEBOECK G., KOHONEN T. (1998), Visual explorations in finance with Self-Organizing Maps, Springer-Verlag, London.
GNANADESIKAN R., KETTENRING J.R., TSAO S.L. (1995), Weighting and selection of variables for cluster analysis, Journal of Classification, vol. 12, p. 113-136.
GORDON A.D. (1999), Classification , Chapman and Hall / CRC, London, p.3 KOHONEN T. (1997), Self-Organizing Maps, Springer Series in Information Sciences, Springer-Verlag, Berlin Heidelberg.
MILLIGAN G.W., COOPER M.C. (1985), An examination of procedures for determining the number of clusters in data set. Psychometrika, 50(2), p. 159-179.
MILLIGAN G.W. (1994), Issues in Applied Classification: Selection of Variables to Cluster, Classification Society of North America News Letter, November Issue 37.
MILLIGAN G.W. (1996), Clustering validation: Results and implications for applied analy-ses. In Phipps Arabie, Lawrence Hubert & G. DeSoete (Eds.), Clustering and classifica-tion, River Edge, NJ: World Scientific, p. 341-375.
MIGDAĐ NAJMAN K., NAJMAN K. (2003), Zastosowanie sieci neuronowej typu SOM w badaniu przestrzennego zróŻnicowania powiatów, Wiadomosci Statystyczne, 4/2003, p. 72-85.
ROUSSEEUW P.J. (1987), Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, p. 53-65.
VESANTO J. (1997), Data Mining Techniques Based on the Self Organizing Map, Thesis for the degree of Master of Science in Engineering, Helsinki University of Technology.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Najman, K.M., Najman, K. (2008). Applying the Kohonen Self-Organizing Map Networks to Select Variables. In: Preisach, C., Burkhardt, H., Schmidt-Thieme, L., Decker, R. (eds) Data Analysis, Machine Learning and Applications. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78246-9_6
Download citation
DOI: https://doi.org/10.1007/978-3-540-78246-9_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78239-1
Online ISBN: 978-3-540-78246-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)