Abstract
In this paper we investigate the performance of a refined version of the Kohonen self organizing feature maps algorithm in terms of classification correctness when we inject in a sparse input matrix different kinds of noise and compared these classification results with the one without noise. The analysis not only gives indications on the classification errors due to noisy data, but also let a methodology to emerge in order to identify the portion of the input matrix that must be controlled with great care for avoiding classification errors. The methodology also suggests a suitable data partitioning approach for a GRID implementation of the described algorithm. The methodological indications were successfully verified by a case study belonging to the bioinformatics field.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zhao, Y., Karypis, G.: Data clustering in life science. Molecular Biotechnology 31(1), 55–80 (2005)
Xu, R., Wunsch, D.: Survey of Clustering Algorithms. IEEE Transactions on Neural Networks 16(3) (2005)
Tirozzi, B., Bianchi, D., Ferraro, E.: Introduction to computational neurobiology and clustering. World Scientific, Singapore (2007)
Kohonen, T.: Self Organizing Maps. Springer, Heidelberg (1995)
Kaski, S., Kangas, J., Kohonen, T.: Bibliography of self organizing map (SOM) papers: 1981 – 1997. Neural Computing Survey 1(3), 102–350 (1998)
Oja, M., Kaski, S., Kohonen, T.: Bibliography of self organizing map (SOM) papers: 1998 – 2001 Addendum. Neural Computing Survey 3(1), 1–156 (2003)
Cottrel, M., Fort, J.C., Letremy, P.: Advantages and drawbacks of the batch Kohonen Algorithm. In: 10th European Symposium On Artificial Neural Network, Bruges, Belgium, pp. 223–230 (2005)
Faro, A., Giordano, D., Maiorana, F.: Discovering complex regularities by adaptive Self Organizing classification. Enformatika I, 27–30 (2005)
Faro, A., Giordano, D., Maiorana, F.: Discovering complex regularities from tree to semi – lattice classifications. International Journal of Computational Intelligence 2(1), 34–39 (2005)
Beck, S., Ghosh, J.: Noise sensitivity of static neural network classifiers. In: Rogers, S.K. (ed.) Proceding of SPIE Conference on applications of Artificial Neural Networks III, vol. 1709, pp. 770–779 (1992)
Derks, E.P.P.A., Pastor, M.S.S., Buydens, L.M.C.: Robustness analysis of radial base function and multi-layered feed forward neural network models. Chemometrics and Intelligent Lab. System 28(1), 46–60 (1995)
Liu, Y., Weisberg, R.H., Mooers, C.N.K.: Performance evaluation of the self organizing map for feature extraction. Journal of geophysical Research 111 (2006)
Chen, G., Jaradat, S.A., Banerjee, N.: Evaluation and comparison of clustering algorithms in analyzing cell gene expression data. Statistica Sinica 12, 241–262 (2002)
Bittner, M., et al.: Molecular classification of cutaneous malignant melanoma by gene expression profiling. Nature 406, 536–540 (2000)
Mangiameli, P., Chen, S.K., West, D.: A comparison of SOM neural network and hierarchical clustering methods. Eur. Journal of Operational Research 93(2) (1996)
Faro, A., Giordano, D., Maiorana, F., Spampinato, C.: Discovering Genes–Diseases Associations from Specialized Literature using the GRID. IEEE Transactions on Information Technology in Biomedicine 18(5) (2008)
Faro, A., Giordano, D., Maiorana, F.: Optimizing the execution of parallel clustering algorithms over the GRID (2007), http://www.ing.campusone.it/FGM01.pdf
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer Berlin Heidelberg
About this paper
Cite this paper
Faro, A., Giordano, D., Maiorana, F. (2008). Input Noise Robustness and Sensitivity Analysis to Improve Large Datasets Clustering by Using the GRID. In: Jean-Fran, JF., Berthold, M.R., Horváth, T. (eds) Discovery Science. DS 2008. Lecture Notes in Computer Science(), vol 5255. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88411-8_23
Download citation
DOI: https://doi.org/10.1007/978-3-540-88411-8_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88410-1
Online ISBN: 978-3-540-88411-8
eBook Packages: Computer ScienceComputer Science (R0)