Abstract
Molecular predictor is a new tool for disease diagnosis, which uses gene expression to classify the diagnostic category of a patient. The statistical challenge for constructing such a predictor is that there are thousands of genes to predict for disease category, but only a small number of samples are available. Here we proposed a gene network modules-based linear discriminant analysis (MLDA) approach by integrating ‘essential’ correlation structure among genes into the predictor in order that the module or cluster structure of genes, which is related to diagnostic classes we look for, can have potential biological interpretation. We evaluated performance of the new method with other established classification methods using three real data sets. Our results show that the new approach has the advantage of computational simplicity and efficiency with lower classification error rates than the compared methods in most cases.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J.P., Coller, H., Loh, M.L., Downing, J.R., Caligiuri, M.A., Bloomfield, C.D., Lander, E.S.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286, 531–536 (1999)
Radmacher, M.D., McShane, L.M., Simon, R.: A paradigm for class prediction using gene expression profiles. J. Comput. Biol. 9, 505–512 (2002)
Dudoit, S., Fridlyand, J., Speed, T.P.: Comparison of discrimination methods for the classification of tumors using gene expression data. Journal of the American Statistical Association 97, 77–87 (2002)
Guo, Y., Hastie, T., Tibshirani, R.: Regularized linear discriminant analysis and its application in microarrays. Biostatistics 8, 86–100 (2007)
Li, H., Hong, F.: Cluster-Rasch models for microarray gene expression data. Genome Biol. 2, 0031.1–0031.13 (2001)
Hastie, T., Tibshirani, R., Botstein, D., Brown, P.: Supervised harvesting of expression trees. Genome Biol. 2, 0003.1–0003.12 (2001)
Dettling, D., Bühlmann, P.: Supervised Clustering of Genes. Genome Biol. 3, 0069.1–0069.15 (2002)
Yu, X.: Regression methods for microarray data. Ph.D. thesis, Stanford University (2005)
Elo, L., Jarvenpaa, H., Oresic, M., Lahesmaa, R., Aittokallio, T.: Systematic construction of gene coexpression networks with applications to human T helper cell differentiation process. Bioinformatics 23, 2096–2103 (2007)
Presson, A., Sobel, E., Papp, J., Suarez, C., Whistler, T., Rajeevan, M., Vernon, S., Horvath, S.: Integrated weighted gene co-expression network analysis with an application to chronic fatigue syndrome. BMC Syst. Biol. 2, 95 (2008)
Horvath, S., Dong, J.: Geometric interpretation of gene coexpression network analysis. PLoS Comput. Biol. 4, e1000117 (2008)
Taylor, I.W., Linding, R., Warde-Farley, D., Liu, Y., Pesquita, C., Faria, D., Bull, S., Pawson, T., Morris, Q., Wrana, J.L.: Dynamic modularity in protein interaction networks predicts breast cancer outcome. Nat Biotechnol. 27, 199–204 (2009)
Jolliffe, I.T.: Principal component analysis. Springer, New York (2002)
Tibshirani, R., Wasserman, L.: Correlation-sharing for detection of differential gene expression. arXiv, math. ST, math/0608061 (2006)
Tusher, V., Tibshirani, R., Chu, G.: Significance analysis of microarrays applied to the ionizing radiation response. Proc. Natl. Acad. Sci. USA 98, 5116–5121 (2001)
Irizarry, R.A., Bolstad, B.M., Collin, F., Cope, L.M., Hobbs, B., Speed, T.P.: Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Research 31, E15 (2003)
Alon, U., Barkai, N., Notterman, D.A., Gish, K., Ybarra, S., Mack, D., Levine, A.J.: Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc. Natl. Acad. Sci. USA 96, 6745–6750 (1999)
Stuart, R.O., Wachsman, W., Berry, C.C., Wang-Rodriguez, J., Wasserman, L., Klacansky, I., Masys, D., Arden, K., Goodison, S., McClelland, M., Wang, Y., Sawyers, A., Kalcheva, I., Tarin, D., Mercola, D.: In silico dissection of cell-type-associated patterns of gene expression in prostate cancer. Proc. Natl. Acad. Sci. USA 101, 615–620 (2004)
Spira, A., Beane, J.E., Shah, V., Steiling, K., Liu, G., Schembri, F., Gilman, S., Dumas, Y.M., Calner, P., Sebastiani, P., Sridhar, S., Beamis, J., Lamb, C., Anderson, T., Gerry, N., Keane, J., Lenburg, M.E., Brody, J.S.: Airway epithelial gene expression in the diagnostic evaluation of smokers with suspect lung cancer. Nat. Med. 13, 361–366 (2007)
Antoniadis, A., Lambert-Lacroix, S., Leblanc, F.: Effective dimension reduction methods for tumor classification using gene expression data. Bioinformatics 19, 563–570 (2003)
Shen, R., Ghosh, D., Chinnaiyan, A.M., Meng, Z.: Eigengene based linear discriminant model for gene expression data analysis. Bioinformatics 22, 2635–2642 (2006)
Pang, H., Tong, T., Zhao, H.: Shrinkage-based diagonal discriminant analysis and its applications in high-dimensional data. Biometrics 65, 1021–1029 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hu, P., Bull, S., Jiang, H. (2011). Gene Network Modules-Based Liner Discriminant Analysis of Microarray Gene Expression Data. In: Chen, J., Wang, J., Zelikovsky, A. (eds) Bioinformatics Research and Applications. ISBRA 2011. Lecture Notes in Computer Science(), vol 6674. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21260-4_28
Download citation
DOI: https://doi.org/10.1007/978-3-642-21260-4_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21259-8
Online ISBN: 978-3-642-21260-4
eBook Packages: Computer ScienceComputer Science (R0)