Abstract
The development of novel high-throughput experimental techniques makes it possible to comprehensively analyze biological data in health and disease. However, a large amount of data generated results in dramatic data-analytic challenges in discovery of ‘signature’ molecules, which are specific to different biological conditions (e.g. normal vs. disease, treated vs. untreated). Current statistical methods are effective only in the case their hypothesis can be matched. In this paper, we apply an ensemble statistical method to infer significant molecules. In our approach, four well-done and well-understanding statistical techniques had been used for the analysis to the experimental data, and then the results will be collected into an ensemble framework to find the high confident “significant” molecules which can distinguish the different experimental conditions. We evaluate the performance of our approach on a test dataset which deposited on GEO database with an access number of GSE45114.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baumgartner, C., Osl, M., Netzer, M., Baumgartner, D.: Bioinformatic-driven search for metabolic biomarkers in disease. J. Clin Bioinformatics 1(2) (2011), doi:10.1186/2043-9113-1181-1182
Wang, B., Chen, P., Wang, P., Zhao, G., Zhang, X.: Radial basis function neural network ensemble for predicting protein-protein interaction sites in heterocomplexes. Protein Pept. Lett. 17(9), 1111–1116 (2010)
Huang, D.S., Zheng, C.H.: Independent component analysis-based penalized discriminant method for tumor classification using gene expression data. Bioinformatics 22(15), 1855–1862 (2006)
Wang, B., Chen, P., Zhang, J., Zhao, G., Zhang, X.: Inferring protein-protein interactions using a hybrid genetic algorithm/support vector machine method. Protein Pept. Lett. 17(9), 1079–1084 (2010)
Zheng, C.H., Huang, D.S., Zhang, L., Kong, X.Z.: Tumor clustering using nonnegative matrix factorization with gene selection. IEEE Trans. Inf. Technol. Biomed. 13(4), 599–607 (2009)
Wang, B., Wong, H.S., Huang, D.S.: Inferring protein-protein interacting sites using residue conservation and evolutionary information. Protein Pept. Lett. 13(10), 999–1005 (2006)
Zhang, F., Chen, J.Y.: Data mining methods in Omics-based biomarker discovery. Methods in Molecular Biology 719, 511–526 (2011)
Kwon, S., Cui, J., Rhodes, S.L., Tsiang, D., Rotter, J.I., Guo, X.: Application of Bayesian classification with singular value decomposition method in genome-wide association studies. BMC Proceedings 3(suppl. 7), S9 (2009)
Deng, X., Geng, H., Ali, H.H.: Cross-platform analysis of cancer biomarkers: a Bayesian network approach to incorporating mass spectrometry and microarray data. Cancer Informatics 3, 183–202 (2007)
Su, Y.H., Shen, J., Qian, H.G., Ma, H.C., Ji, J.F., Ma, H., Ma, L.H., Zhang, W.H., Meng, L., Li, Z.F., Wu, J., Jin, G.L., Zhang, J.Z., Shou, C.C.: Diagnosis of gastric cancer using decision tree classification of mass spectral data. Cancer Sci. 98(1), 37–43 (2007)
Wang, H.Q., Wong, H.S., Zhu, H., Yip, T.T.: A neural network-based biomarker association information extraction approach for cancer classification. Journal of Biomedical Informatics 42(4), 654–666 (2009)
Chi, C.L., Street, W.N., Wolberg, W.H.: Application of artificial neural network-based survival analysis on two breast cancer datasets. In: AMIA.. Annual Symposium Proceedings/AMIA Symposium, pp. 130–134 (2007)
Amiri, Z., Mohammad, K., Mahmoudi, M., Zeraati, H., Fotouhi, A.: Assessment of gastric cancer survival: using an artificial hierarchical neural network. Pak J. Biol. Sci. 11(8), 1076–1084 (2008)
Brunner, E., Munzel, U.: The nonparametric Behrens-Fisher problem: Asymptotic theory and a small-sample approximation. Biometrical Journal 42(1), 17–25 (2000)
Baumgartner, W., Weiss, P., Schindler, H.: A nonparametric test for the general two-sample problem. Biometrics 54(3), 1129–1135 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, B., Ji, Z. (2013). Disease-Related Gene Expression Analysis Using an Ensemble Statistical Test Method. In: Huang, DS., Jo, KH., Zhou, YQ., Han, K. (eds) Intelligent Computing Theories and Technology. ICIC 2013. Lecture Notes in Computer Science(), vol 7996. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39482-9_33
Download citation
DOI: https://doi.org/10.1007/978-3-642-39482-9_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39481-2
Online ISBN: 978-3-642-39482-9
eBook Packages: Computer ScienceComputer Science (R0)