Skip to main content

Clustering Massive High Dimensional Data with Dynamic Feature Maps

  • Conference paper
Neural Information Processing (ICONIP 2006)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4233))

Included in the following conference series:

  • 1011 Accesses

Abstract

This paper presents an algorithm based on the Growing Self Organizing Map (GSOM) called the High Dimensional Growing Self Organizing Map with Randomness (HDGSOMr) that can cluster massive high dimensional data efficiently. The original GSOM algorithm is altered to accommodate for the issues related to massive high dimensional data. These modifications are presented in detail with experimental results of a massive real-world dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Kohonen, T.: Self Organized formation of Topological Correct Feature Maps. Biological Cybernetics 43, 59–69 (1982)

    Article  MATH  MathSciNet  Google Scholar 

  2. Kohonen, T.: Self Organizing Maps, 3rd edn. Springer, Heidelberg (2001)

    MATH  Google Scholar 

  3. Su, M.-C., Liu, T.-K., Chang, H.-T.: Improving the Self-Organizing Feature Map Algorithm Using an Efficient Initialization Scheme. Tamkang Journal of Science and Engineering 5(1), 35–48 (2002)

    Google Scholar 

  4. Alahakoon, D., Halgamuge, S.K., Sirinivasan, B.: Dynamic Self Organizing Maps With Controlled Growth for Knowledge Discovery. IEEE Transactions on Neural Networks, Special Issue on Knowledge Discovery and Data Mining 11(3), 601–614 (2000)

    Google Scholar 

  5. Fritzke, B.: Growing cell structures – a self-organizing network for unsupervised and supervised learning. Neural Networks 7(9), 1441–1460 (1994)

    Article  Google Scholar 

  6. Fritzke, B.: Growing grid-a self-organizing network with constant neighborhood range and adaptation strength. Neural Processing Letters 2(5), 9–13 (1995)

    Article  Google Scholar 

  7. Fritzke, B.: A growing neural gas network learns topologies. In: Tesauro, G., Touretzky, D.S., Leen, T.K. (eds.) Advances in Neural Information Processing Systems 7, pp. 625–632. MIT Press, Cambridge (1995)

    Google Scholar 

  8. Rauber, A., Merkl, D., Dittenbach, M.: The Growing Hierarchical Self-Organizing Map: Exploratory Analysis of High Dimensional Data. IEEE Transactions on Neural Networks 13(6), 1331–1341 (2002)

    Article  Google Scholar 

  9. Kaski, S.: Fast winner search for SOM-based monitoring and retrieval of high-dimensional data. In: Artificial Neural Networks, 1999. ICANN 1999. Ninth International Conference on (Conf. Publ. No. 470), IEE (1999)

    Google Scholar 

  10. Kohonen, T.: Fast Evolutionary Learning with Batch-Type Self-Organizing Maps. Neural Processing Letters 9(2), 153–162 (1999)

    Article  Google Scholar 

  11. Kaski, S., et al.: WEBSOM- Self Organizing maps of document collections. Neurocomputing 21, 101–117 (1998)

    Article  MATH  Google Scholar 

  12. Amarasiri, R., et al.: Enhancing Clustering Performance of Feature Maps Using Randomness. In: Workshop on Self Organizing Maps (WSOM), Paris, France (2005)

    Google Scholar 

  13. Amarasiri, R., et al.: HDGSOMr: A High Dimensional Growing Self Organizing Map Using Randomness for Efficient Web and Text Mining. In: IEEE/ACM/WIC Conference on Web Intelligence (WI), Paris, France (2005)

    Google Scholar 

  14. Wikipedia, Randomness (2005)

    Google Scholar 

  15. Holland, J.: Genetic algorithms and the optimal allocations of trials. SIAM Journal of Computing 2(2), 88–105 (1973)

    Article  MATH  MathSciNet  Google Scholar 

  16. Kirkpatrick, S., Gelatt Jr., C.D., Vecchi, M.P.: Optimization by Simulated Annealing. Science 220(4598), 671–680 (1983)

    Article  MathSciNet  Google Scholar 

  17. Amarasiri, R., Ceddia, J., Alahakoon, D.: Exploratory Data Mining Lead by Text Mining Using a Novel High Dimensional Clustering Algorithm. In: International Conference on Machine Learning and Applications, IEEE, LA, USA (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Amarasiri, R., Alahakoon, D., Smith-Miles, K. (2006). Clustering Massive High Dimensional Data with Dynamic Feature Maps. In: King, I., Wang, J., Chan, LW., Wang, D. (eds) Neural Information Processing. ICONIP 2006. Lecture Notes in Computer Science, vol 4233. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11893257_90

Download citation

  • DOI: https://doi.org/10.1007/11893257_90

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-46481-5

  • Online ISBN: 978-3-540-46482-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics