Abstract
Clustering, which has been widely used as a forecasting tool for gene expression data, remains problematic at a very deep level: different initial points of clustering lead to different processes of convergence. However, the setting of initial points is mainly dependent on the judgments of experimenters. This subjectivity brings problems, including local minima and an extra computing consumption when bad initial points are selected. Hence, spatial contiguity analysis has been implemented to reduce the subjectivity of clustering. Data points near the cluster centroids are selected as initial points in this paper. This accelerates the process of convergence, and avoids the local minima. The proposed approach has been validated on benchmark datasets, and satisfactory results have been obtained.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Yeung, K.Y., Haynor, D.R., Ruzzo, W.L.: Validating clustering for gene expression data. Bioinformatics 17(4), 309–318 (2001)
Cho, R.J., Campbell, M.J., Winzeler, E.A., et al.: A genome-wide transcriptional analysis of the mitotic cell cycle. Molecular Cell 2, 65–73 (1998)
Bolshakova, N., Azuaje, F.: Cluster validation techniques for genome expression data. Signal Processing 83, 825–833 (2003)
Handl, J., Knowles, J., Kell, D.B.: Computation cluster validation in post-genomic data analysis. Bioinformatics 21(15), 3201–3212 (2005)
Dougherty, E.R., Barrera, J., Brun, M., et al.: Inference from clustering with application to gene-expression microarray. J. Comput. Biol. 9(1), 105–126 (2002)
Xu, R., Wunsch II, D.: Survey of Clustering Algorithms. IEEE Trans. on Neural Networks 16(3), 645–678 (2003)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surveys 31(3), 264–323 (1999)
D’haeseleer, P.: How does gene expression clustering work? Nature Biotechnology 23(12), 1499–1501 (2005)
Dougherty, E.R., Brun, M.: A probabilistic theory of clustering. Pattern Recognition 37, 917–925 (2004)
Bezdek, J.: Cluster validity with fuzzy sets. J. Cybernt. 3(3), 58–72 (1974)
Bezdek, J., Hathaway, R., et al.: Convergence theory for fuzzy c-means: counterexamples and repairs. IEEE Transactions on Systems, Man and Cybernetics 17(15), 873–877 (1987)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yi, H., Song, X., Jiang, B., Liu, Y. (2011). Reducing the Subjectivity of Gene Expression Data Clustering Based on Spatial Contiguity Analysis. In: Kim, Th., et al. Database Theory and Application, Bio-Science and Bio-Technology. BSBT DTA 2011 2011. Communications in Computer and Information Science, vol 258. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27157-1_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-27157-1_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27156-4
Online ISBN: 978-3-642-27157-1
eBook Packages: Computer ScienceComputer Science (R0)