Abstract
Clustering techniques, such as hierarchical clustering, k- means algorithm and self-organizing maps, are widely used to analyze gene expression data. Results of these algorithms depend on several parameters, e.g., the number of clusters. However, there is no theoretical criterion to determine such parameters. In order to overcome this problem, we propose a method using mixture of PCA models trained by a variational Bayes (VB) estimation. In our method, good clustering results are selected based on the free energy obtained within the VB estimation. Furthermore, by taking an ensemble of estimation results, a robust clustering is achieved without any biological knowledge. Our method is applied to a clustering problem for gene expression data during a sporulation of Bacillus subtilis and it is able to capture characteristics of the sigma cascade.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Driks A, and Losick R.: Bacillus Subtilis spore coat. Microbiol. Mol. Biol. Rev. 63(1) (1999) 1–20
Oba, S., Ishii, S. and Sato, M.: Variational Bayes method for mixture of principal component analyzers. Proceedings of 7th International Conference on Neural Information Processing (2000) 1416–1421
Quackenbush, J.: Computational analysis of microarray data. Nature Reviews Genetics 2(6) (2001) 418–427
Schena, M., Shalon, D., Davis, R.W. and Brown, P.O.: Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science 270 (1995) 467–470
Tipping, M.E. and Bishop, Mixtures of probabilistic principal component analyzers. Neural Computation 11 (1999) 443–482
Yoshida, W., Ishii, S. and Sato, M.: Reconstruction of chaotic dynamics using a noise-robust embedding method. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) I (2000) 181–184
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yoshioka, T., Morioka, R., Kobayashi, K., Oba, S., Ogawsawara, N., Ishii, S. (2002). Clustering of Gene Expression Data by Mixture of PCA Models. In: Dorronsoro, J.R. (eds) Artificial Neural Networks — ICANN 2002. ICANN 2002. Lecture Notes in Computer Science, vol 2415. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46084-5_85
Download citation
DOI: https://doi.org/10.1007/3-540-46084-5_85
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44074-1
Online ISBN: 978-3-540-46084-8
eBook Packages: Springer Book Archive