Skip to main content

Input Noise Robustness and Sensitivity Analysis to Improve Large Datasets Clustering by Using the GRID

  • Conference paper
Discovery Science (DS 2008)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5255))

Included in the following conference series:

Abstract

In this paper we investigate the performance of a refined version of the Kohonen self organizing feature maps algorithm in terms of classification correctness when we inject in a sparse input matrix different kinds of noise and compared these classification results with the one without noise. The analysis not only gives indications on the classification errors due to noisy data, but also let a methodology to emerge in order to identify the portion of the input matrix that must be controlled with great care for avoiding classification errors. The methodology also suggests a suitable data partitioning approach for a GRID implementation of the described algorithm. The methodological indications were successfully verified by a case study belonging to the bioinformatics field.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zhao, Y., Karypis, G.: Data clustering in life science. Molecular Biotechnology 31(1), 55–80 (2005)

    Article  Google Scholar 

  2. Xu, R., Wunsch, D.: Survey of Clustering Algorithms. IEEE Transactions on Neural Networks 16(3) (2005)

    Google Scholar 

  3. Tirozzi, B., Bianchi, D., Ferraro, E.: Introduction to computational neurobiology and clustering. World Scientific, Singapore (2007)

    MATH  Google Scholar 

  4. Kohonen, T.: Self Organizing Maps. Springer, Heidelberg (1995)

    Book  Google Scholar 

  5. Kaski, S., Kangas, J., Kohonen, T.: Bibliography of self organizing map (SOM) papers: 1981 – 1997. Neural Computing Survey 1(3), 102–350 (1998)

    Google Scholar 

  6. Oja, M., Kaski, S., Kohonen, T.: Bibliography of self organizing map (SOM) papers: 1998 – 2001 Addendum. Neural Computing Survey 3(1), 1–156 (2003)

    Google Scholar 

  7. Cottrel, M., Fort, J.C., Letremy, P.: Advantages and drawbacks of the batch Kohonen Algorithm. In: 10th European Symposium On Artificial Neural Network, Bruges, Belgium, pp. 223–230 (2005)

    Google Scholar 

  8. Faro, A., Giordano, D., Maiorana, F.: Discovering complex regularities by adaptive Self Organizing classification. Enformatika I, 27–30 (2005)

    Google Scholar 

  9. Faro, A., Giordano, D., Maiorana, F.: Discovering complex regularities from tree to semi – lattice classifications. International Journal of Computational Intelligence 2(1), 34–39 (2005)

    Google Scholar 

  10. Beck, S., Ghosh, J.: Noise sensitivity of static neural network classifiers. In: Rogers, S.K. (ed.) Proceding of SPIE Conference on applications of Artificial Neural Networks III, vol. 1709, pp. 770–779 (1992)

    Google Scholar 

  11. Derks, E.P.P.A., Pastor, M.S.S., Buydens, L.M.C.: Robustness analysis of radial base function and multi-layered feed forward neural network models. Chemometrics and Intelligent Lab. System 28(1), 46–60 (1995)

    Google Scholar 

  12. Liu, Y., Weisberg, R.H., Mooers, C.N.K.: Performance evaluation of the self organizing map for feature extraction. Journal of geophysical Research 111 (2006)

    Google Scholar 

  13. Chen, G., Jaradat, S.A., Banerjee, N.: Evaluation and comparison of clustering algorithms in analyzing cell gene expression data. Statistica Sinica 12, 241–262 (2002)

    MathSciNet  MATH  Google Scholar 

  14. Bittner, M., et al.: Molecular classification of cutaneous malignant melanoma by gene expression profiling. Nature 406, 536–540 (2000)

    Article  Google Scholar 

  15. Mangiameli, P., Chen, S.K., West, D.: A comparison of SOM neural network and hierarchical clustering methods. Eur. Journal of Operational Research 93(2) (1996)

    Google Scholar 

  16. Faro, A., Giordano, D., Maiorana, F., Spampinato, C.: Discovering Genes–Diseases Associations from Specialized Literature using the GRID. IEEE Transactions on Information Technology in Biomedicine 18(5) (2008)

    Google Scholar 

  17. Faro, A., Giordano, D., Maiorana, F.: Optimizing the execution of parallel clustering algorithms over the GRID (2007), http://www.ing.campusone.it/FGM01.pdf

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer Berlin Heidelberg

About this paper

Cite this paper

Faro, A., Giordano, D., Maiorana, F. (2008). Input Noise Robustness and Sensitivity Analysis to Improve Large Datasets Clustering by Using the GRID. In: Jean-Fran, JF., Berthold, M.R., Horváth, T. (eds) Discovery Science. DS 2008. Lecture Notes in Computer Science(), vol 5255. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88411-8_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-88411-8_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-88410-1

  • Online ISBN: 978-3-540-88411-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics