Abstract
Here we introduce VSOM, an efficient implementation of stochastic training for self-organizing maps. We derive VSOM from the standard stochastic training algorithm as published by Kohonen by replacing all iterative constructs in the algorithm with vector and matrix operations. Our novel implementation based on these vector and matrix operations provides substantial performance increases over Kohonen’s iterative algorithm as well as batchSOM, currently the fastest implementation of Self-organizing maps (SOM) training without resorting to multi-processing. The quality of the maps produced by VSOM matches the quality of the maps produced by the original iterative algorithm and outperforms the quality of the maps produced by batchSOM. In its current incarnation VSOM is single threaded and therefore well suited as a replacement for iterative stochastic training of self-organizing maps in R since R does not support multi-threading well.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Miller, A., Coe, M.: Star/galaxy classification using kohonen self-organizing maps. Mon. Not. R. Astron. Soc. 279(1), 293–300 (1996)
Lee, C.-H.: Mining text documents for thematic hierarchies using self-organizing maps. In: Opportunities and Challenges, Data Mining (2003)
Hadzic, F., Dillon, T.S., Tan, H.: Outlier detection strategy using the self-organizing map. In: Knowledge Discovery and Data Mining: Challenges and Realities, pp. 224–243 (2007)
Kihara, D.: Characterization and classification of local protein surfaces using self-organizing map. In: Computational Knowledge Discovery for Bioinformatics Research (2012)
Alakhdar, Y., MartĂnez-MartĂnez, J.M., GuimerĂ -Tomás, J., Escandell-Montero, P., Benitez, J., Soria-Olivas, E.: Visual data mining in physiotherapy using self-organizing maps: a new approximation to the data analysis. In: Data Mining: Concepts, Methodologies, Tools, and Applications, pp. 650–657. IGI Global (2013)
Kohonen, T.: Self-organizing Maps. Springer, Berlin (2001)
Cottrell, M., Olteanu, M., Rossi, F., Villa-Vialaneix, N.: Theoretical and applied aspects of the self-organizing maps. In: Advances in Self-Organizing Maps and Learning Vector Quantization, pp. 3–26. Springer (2016)
Fort, J.-C., Letremy, P., Cottrell, M.: Advantages and drawbacks of the batch kohonen algorithm. ESANN 2, 223–230 (2002)
Hamel, L., Ott, B., Breard, G.: popsom: Functions for Constructing and Evaluating Self-Organizing Maps. R package version 4.0.1 (2016). https://CRAN.R-project.org/package=popsom
Hamel, L., Brown, C.W.: Improved interpretability of the unified distance matrix with connected components. In: 7th International Conference on Data Mining (DMIN’11), pp. 338–343 (2011)
A. Ultsch, “Clustering wih som: U * c. In: Proceedings of the 5th Workshop on Self-Organizing Maps, vol. 2, pp. 75–82 (2005)
Hamel, L.: SOM quality measures: an efficient statistical approach. In: Advances in Self-organizing Maps and Learning Vector Quantization, pp. 49–59. Springer (2016)
Yin, H., Allinson, N.M.: On the distribution and convergence of feature space in self-organizing maps. Neural Comput. 7(6), 1178–1187 (1995)
Yan, J.: SOM: Self-Organizing Map. R package version 0.3-5 (2010). https://CRAN.R-project.org/package=som
Venables, W.N., Ripley, B.D.: Modern Applied Statistics with S, 4th edn. Springer, New York (2002). iSBN 0-387-95457-0. http://www.stats.ox.ac.uk/pub/MASS4
Fisher, R.A.: The use of multiple measurements in taxonomic problems. Ann. Eugenics 7(2), 179–188 (1936)
Thall, P.F., Vail, S.C.: Some covariance models for longitudinal count data with overdispersion. Biometrics 46, 657–671 (1990)
Street, W.N., Wolberg, W.H., Mangasarian, O.L.: Nuclear feature extraction for breast tumor diagnosis. In: IS&T/SPIE’s Symposium on Electronic Imaging: Science and Technology, pp. 861–870. International Society for Optics and Photonics (1993)
Lawrence, R.D., Almasi, G.S., Rushmeier, H.E.: A scalable parallel algorithm for self-organizing maps with applications to sparse data mining problems. Data Mining Knowl. Discov. 3(2), 171–195 (1999)
Wittek, P., Gao, S.C., Lim, I.S., Zhao, L.: Somoclu: an efficient parallel library for self-organizing maps (2013). arXiv preprint arXiv:1305.1422
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Hamel, L. (2019). VSOM: Efficient, Stochastic Self-organizing Map Training. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Intelligent Systems and Applications. IntelliSys 2018. Advances in Intelligent Systems and Computing, vol 869. Springer, Cham. https://doi.org/10.1007/978-3-030-01057-7_60
Download citation
DOI: https://doi.org/10.1007/978-3-030-01057-7_60
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01056-0
Online ISBN: 978-3-030-01057-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)