Abstract
Feature extraction plays a key role in tumor classification based on gene expression profiles, which can improve the performance of classifier. We design two novel feature extraction methods to extract tumor-related features. One is combining gene ranking and discrete cosine transform (DCT) with principal component analysis (PCA), and another is combining gene ranking and discrete Fourier transform (DFT) with PCA. The proposed feature extraction methods are proved successfully and effectively to classify tumor dataset. Experiments show that the obtained classification performance are very steady, which are evaluated by support vector machines (SVM) and K-nearest neighbor (K-NN) classifier on two well-known tumor datasets. Experiment results also show that the 4-fold cross-validated accuracy rate of 100% is obtained for the leukemia dataset and 96.77% for the colon tumor dataset. Compared with other related works, the proposed method not only has higher classification accuracy rate but also is steadier in classification performance.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Zhang, X., Yap, Y.L., Wei, D., Chen, F., Danchin, A.: Molecular diagnosis of human cancer type by gene expression profiles and independent component analysis. European Journal of Human Genetics 05(9), 1018–4813 (2005)
Wang, S., Wang, J., Chen, H., Tang, W.: The classification of tumor using gene expression profile based on support vector machines and factor analysis. In: Intelligent Systems Design and Applications, Jinan, pp. 471–476. IEEE Computer Society Press, Los Alamitos (2006)
Li, S., Liao, C., Kwok, J.T.: Gene feature extraction using T-test statistics and kernel partial least squares. In: King, I., Wang, J., Chan, L., Wang, D. (eds.) ICONIP 2006. LNCS, vol. 4234, pp. 11–20. Springer, Heidelberg (2006)
Nguyen, D.V., Rocke, D.M.: Tumor classification by partial least squares using microarray gene expression data. Bioinformatics 18(1), 39–45 (2002)
Kira, K., Rendell, L.A.: The feature selection problem: traditional methods and a new algorithm. In: Swartout, W. (ed.) Proceedings of the 10th National Conference on Aritficial Inteligence, pp. 129–134. AAAI Press/The MIT Press, Cambridge, MA (1992)
Ahmed, N., Natarajan, T., Rao, K.R.: Discrete Cosine Transform. IEEE Trans. Computers C-23, 90–94 (1974)
Theodoridis, S., Koutroumbas, K.: Pattern recognition, pp. 341–342. Academic Press, London (1999)
Vapnik, V.N.: Statistical learning theory. Springer, New York (1998)
Dasarathy, B.: Nearest Neighbor Norms: NN Pattern Classification Techniques. IEEE Computer Society Press, Los Alamitos (1991)
Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J.P., Coller, H., Loh, M.L., Downing, J.R., Caligiuri, M.A., Bloomfield, C.D., Lander, E.S.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286, 531–537 (1999)
Alon, U., Barkai, N., Notterman, D.A., Gish, K., Ybarra, S., Mack, D., Levine, A.: Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues by oligonucleotide arrays. Proc. Nat. Acad. Sci. USA 96, 6745–6750 (1999)
Chang, C.-C., Lin, C.-J.: LIBSVM: A library for support vector machines (2001), Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
Krishnapuram, B., Carin, L., Hartemink, A.: Gene expression analysis: Joint feature selection and classifier design. In: Scholkopf, B., Tsuda, K., Vert, J.-P. (eds.) Kernel Methods in Computational Biology, pp. 299–318. MIT, Cambridge (2004)
Ben-Dor, A., Bruhn, L., Friedman, N., Nachman, I., Schummer, M., Yakhini, Z.: Tissue classification with gene expression profiles. Journal of Computional Biology 7, 559–584 (2000)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, S., Chen, H., Li, S., Zhang, D. (2007). Feature Extraction from Tumor Gene Expression Profiles Using DCT and DFT. In: Neves, J., Santos, M.F., Machado, J.M. (eds) Progress in Artificial Intelligence. EPIA 2007. Lecture Notes in Computer Science(), vol 4874. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77002-2_41
Download citation
DOI: https://doi.org/10.1007/978-3-540-77002-2_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77000-8
Online ISBN: 978-3-540-77002-2
eBook Packages: Computer ScienceComputer Science (R0)