Abstract
In this study we perform wavelet transform on the analysis of DNA microarray data. A set of wavelet features is used to measure the change of gene expression profile. Then wavelet features are input to support vector machine (SVM) to classify DNA microarray data into different diagnostic classes. Experiments are carried out on six datasets of microarray data. On a wide range of data sets, our method displays a highly competitive accuracy in comparison to the best performance of other kinds of classification models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Roberts, C.J., Nelson, B., Marton, M.J., Stoughton, R., Meyer, M.R., Bennett, H.A., He, Y.D., Dai, H., Walker, W.L., Hughes, T.R., Tyers, M., Boone, C., Friend, S.H.: Signaling and circuitry of multiple mapk pathways revealed by a matrix of global gene expression profiles. Science 287, 873–880 (2000)
Zhang, L., Zhou, W., Velculescu, V.E., Kern, S.E., Hruban, R.H., Hamilton, S.R., Vogelstein, B., Kinzler, K.W.: Gene expression profiles in normal and cancer cells. Science 276, 1268–1272 (1997)
Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gaasenbeeck, M., Mesirov, J.P., Coller, H., Loh, M.L., Downing, J.R., Caligiuri, M.A.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286, 531–537 (1999)
Xiong, M., Jin, L., Li, W., Boerwinkle, E.: Computational methods for gene expressionbased tumor classification. Biotechniques 29, 1264–1270 (2000)
Baldi, P., Long, A.D.: A Bayesian framework for the analysis of microarray expression data: regularized t -test and statistical inferences of gene changes. Bionformatics 17, 509–519 (2001)
Huang, D.S., Zheng, C.H.: Independent component analysis-based penalized discriminant method for tumor classification using gene expression data. Bioinformatics 22, 1855–1862 (2006)
Li, S., Liao, C., Kwok, J.T.: Wavelet-based feature extraction for microarray data classification. In: Proceedings of International Joint Conference on Neural Networks, pp. 5028–5033 (2006)
Grossmann, A., Morlet, J.: Decomposition of Hardy functions into square integrable wavelets of constant shape SIAM. Journal of Mathematical Analysis 15, 723–736 (1984)
Daubechies, I.: Orthonormal bases of compactly supported wavelets. Communications on Pure and Applied Mathematics 41, 909–996 (1988)
Vapnik, V.N.: Statistical learning theory. Wiley, New York (1998)
Burges, C.: A Tutorial on Support Vector Machines for Pattern Recognition. Kluwer Academic Publishers, Dordrecht (1998)
Kyeong, E.L., Sha, N., Dougherty, E.R., Vannucci, M., Mallick, B.K.: Gene selection: a bayesian variable selection approach. Bioinformatics 19, 90–97 (2003)
Danh, V.N., David, M.R.: Tumor classification by partial least squares using microarray gene expression data. Bioinformatics 18, 39–50 (2002)
Bicciato, S., Luchini, A., Bello, C.D.: PCA disjoint models for multiclass cancer analysis using gene expression data. Bioinformatics 19, 571–578 (2003)
Aedin, C.C., Guy, P., Elizabeth, C.C., Thomas, G.C., Desmond, G.H.: Between group analysis of microarray data. Bioinformatics 18, 1600–1608 (2002)
Armstrong, S.A., Staunton, J.E., Silverman, L.B., Pieters, R., den Boer, M.L., Minden, M.D., Sallan, S.E., Lander, E.S., Golub, T.R., Korsmeyer, S.J.: MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia. Nat. Genet. 30, 41–47 (2002)
van ’t Veer, L.J., Dai, H., van de Vijver, M.J., He, Y.D., Hart, A.A., Mao, M., Peterse, H.L., van der Kooy, K., Marton, M.J., Witteveen, A.T., Schreiber, G.J., Kerkhoven, R.M., Roberts, C., Linsley, P.S., Bernards, R., Friend, S.H.: Gene expression profiling predicts clinical outcome of breast cancer. Nature 415, 530–536 (2002)
Yeung, K.Y., Bumgarner, R.E.: Multiclass classification of microarray data with repeated measurements: application to cancer. Genome. Biol 4, R83 (2003)
Singh, D., Febbo1, P.G., Ross, K., Jackson, D.G., Manola, J., Ladd, C., Tamayo, P., Renshaw, A.A., D’Amico, A.V., Richie, J.P., Lander, E.S., Loda, M., Kantoff, P.W., Golub, T.R., Sellers, W.R.: Gene expression correlates of clinical prostate cancer behavior. Cancer Cell 1, 203–209 (2002)
Dettling, M.: BagBoosting for tumor classification with gene expression data. Bioinformatics 20, 3583–3593 (2004)
Gordon, G.J., Jensen, R.V., Hsiao, L.L., Gullans, S.R., Blumenstock, J.E., Ramaswamy, S., Richards, W.G., Sugarbaker, D.J., Bueno, R.: Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesothelioma. Cancer research 62, 4963–4967 (2002)
Yeoh, E.-J., Ross, M.E., Shurtleff, S.A., Williams, W.K., Patel, D., Mahfouz, R., Behm, F.G., Raimondi, S.C., Relling, M.V., Patel, A., et al.: Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling. Cancer Cell 1, 133–143 (2002)
Jolliffe, I.T.: Principal Component Analysis, 2nd edn. Springer, Heidelberg (2002)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, Y. (2007). Cancer Identification Based on DNA Microarray Data. In: Washio, T., et al. Emerging Technologies in Knowledge Discovery and Data Mining. PAKDD 2007. Lecture Notes in Computer Science(), vol 4819. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77018-3_17
Download citation
DOI: https://doi.org/10.1007/978-3-540-77018-3_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77016-9
Online ISBN: 978-3-540-77018-3
eBook Packages: Computer ScienceComputer Science (R0)