Abstract
We present a multivariate method to find genes with correlated expressions across the samples. Our contributions in this study are three-fold: firstly, we develop a difference vector-based technique which unfolds hidden correlations over a subset of genes, secondly, we present a similarity measure which enables grouping of gene expressions based on local similarity regardless of global distance, and thirdly, we devise visualization tools that are useful for conducting an ‘explainable’ analysis. Integrating these techniques with the spectral clustering algorithm, biomarker genes can be effectively identified. We have evaluated our method on six microarray datasets that are widely used as a testbed. When we apply our method in the sample classification problem as well as gene selection, we can successfully explain the source of misclassification by showing the correlation patterns for a subset of genes with the aid of the visualization tools.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abeel, T., Helleputte, T., Van de Peer, Y., Dupont, P., Saeys, Y.: Robust biomarker identification for cancer diagnosis with ensemble feature selection methods. Bioinformatics 26, 392–398 (2010)
Alon, U., Barkai, N., Notterman, D.A., Gish, K., Ybarra, S., Mack, D., Levine, A.J.: Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc. Natl. Acad. Sci. USA 96, 6745–6750 (1999)
Brunet, J.P., Tamayo, P., Golub, T.R., Mesirov, J.P.: Metagenes and molecular pattern discovery using matrix factorization. Proc. Natl. Acad. Sci. USA 101, 4164–4169 (2004)
Chu, W., Ghahramani, Z., Falciani, F., Wild, D.L.: Biomarker discovery in microarray gene expression data with Gaussian processes. Bioinformatics 21, 3385–3393 (2005)
Guyon, I., Elisseeff, A.: An introduction to variables and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)
Ng, A., Jordan, M., Weiss, Y.: On spectral clustering: analysis and an algorithm. In: Diettrich, T., Baker, S., Ghahramani, Z. (eds.) Advances in Neural Information Processing Systems 14. MIT Press (2002)
Singh, D., Febbo, P.G., Ross, K., Jackson, D.G., Manola, J., Ladd, C., Tamayo, P., Renshaw, A.A., D’Amico, A.V., Rich, J.P.: Gene expression correlates of clinical prostate cancer behavior. Cancer Cell 1, 203–209 (2002)
Statnikov, A., Aliferis, C.F., Tsamardinos, I., Hardin, D., Levy, S.: A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnois. Bioinformatics 21, 631–643 (2005)
Stekel, D.: Analysis of relations between genes, tissues or treatments. In: Microarray Bioinformatics, ch. 8. Cambridge University Press, Cambridge (2004)
Xu, L., Tan, A.C., Naiman, D.Q., Geman, D., Winslow, R.L.: Robust prostate cancer marker genes emerge from direct integration of inter-study microarray data. Bioinformatics 21, 3905–3911 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pok, G., Jin, C., Namsrai, OE., Ryu, K.H. (2012). Detection of Correlated Microarray Expressions Using Difference Values. In: Lee, G., Howard, D., Ślęzak, D., Hong, Y.S. (eds) Convergence and Hybrid Information Technology. ICHIT 2012. Communications in Computer and Information Science, vol 310. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32692-9_65
Download citation
DOI: https://doi.org/10.1007/978-3-642-32692-9_65
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32691-2
Online ISBN: 978-3-642-32692-9
eBook Packages: Computer ScienceComputer Science (R0)