Skip to main content

Feature Extraction from Tumor Gene Expression Profiles Using DCT and DFT

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4874))

Abstract

Feature extraction plays a key role in tumor classification based on gene expression profiles, which can improve the performance of classifier. We design two novel feature extraction methods to extract tumor-related features. One is combining gene ranking and discrete cosine transform (DCT) with principal component analysis (PCA), and another is combining gene ranking and discrete Fourier transform (DFT) with PCA. The proposed feature extraction methods are proved successfully and effectively to classify tumor dataset. Experiments show that the obtained classification performance are very steady, which are evaluated by support vector machines (SVM) and K-nearest neighbor (K-NN) classifier on two well-known tumor datasets. Experiment results also show that the 4-fold cross-validated accuracy rate of 100% is obtained for the leukemia dataset and 96.77% for the colon tumor dataset. Compared with other related works, the proposed method not only has higher classification accuracy rate but also is steadier in classification performance.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zhang, X., Yap, Y.L., Wei, D., Chen, F., Danchin, A.: Molecular diagnosis of human cancer type by gene expression profiles and independent component analysis. European Journal of Human Genetics 05(9), 1018–4813 (2005)

    Google Scholar 

  2. Wang, S., Wang, J., Chen, H., Tang, W.: The classification of tumor using gene expression profile based on support vector machines and factor analysis. In: Intelligent Systems Design and Applications, Jinan, pp. 471–476. IEEE Computer Society Press, Los Alamitos (2006)

    Chapter  Google Scholar 

  3. Li, S., Liao, C., Kwok, J.T.: Gene feature extraction using T-test statistics and kernel partial least squares. In: King, I., Wang, J., Chan, L., Wang, D. (eds.) ICONIP 2006. LNCS, vol. 4234, pp. 11–20. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  4. Nguyen, D.V., Rocke, D.M.: Tumor classification by partial least squares using microarray gene expression data. Bioinformatics 18(1), 39–45 (2002)

    Article  Google Scholar 

  5. Kira, K., Rendell, L.A.: The feature selection problem: traditional methods and a new algorithm. In: Swartout, W. (ed.) Proceedings of the 10th National Conference on Aritficial Inteligence, pp. 129–134. AAAI Press/The MIT Press, Cambridge, MA (1992)

    Google Scholar 

  6. Ahmed, N., Natarajan, T., Rao, K.R.: Discrete Cosine Transform. IEEE Trans. Computers C-23, 90–94 (1974)

    Article  MathSciNet  Google Scholar 

  7. Theodoridis, S., Koutroumbas, K.: Pattern recognition, pp. 341–342. Academic Press, London (1999)

    Google Scholar 

  8. Vapnik, V.N.: Statistical learning theory. Springer, New York (1998)

    MATH  Google Scholar 

  9. Dasarathy, B.: Nearest Neighbor Norms: NN Pattern Classification Techniques. IEEE Computer Society Press, Los Alamitos (1991)

    Google Scholar 

  10. Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J.P., Coller, H., Loh, M.L., Downing, J.R., Caligiuri, M.A., Bloomfield, C.D., Lander, E.S.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286, 531–537 (1999)

    Article  Google Scholar 

  11. Alon, U., Barkai, N., Notterman, D.A., Gish, K., Ybarra, S., Mack, D., Levine, A.: Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues by oligonucleotide arrays. Proc. Nat. Acad. Sci. USA 96, 6745–6750 (1999)

    Article  Google Scholar 

  12. Chang, C.-C., Lin, C.-J.: LIBSVM: A library for support vector machines (2001), Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm

  13. Krishnapuram, B., Carin, L., Hartemink, A.: Gene expression analysis: Joint feature selection and classifier design. In: Scholkopf, B., Tsuda, K., Vert, J.-P. (eds.) Kernel Methods in Computational Biology, pp. 299–318. MIT, Cambridge (2004)

    Google Scholar 

  14. Ben-Dor, A., Bruhn, L., Friedman, N., Nachman, I., Schummer, M., Yakhini, Z.: Tissue classification with gene expression profiles. Journal of Computional Biology 7, 559–584 (2000)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

José Neves Manuel Filipe Santos José Manuel Machado

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wang, S., Chen, H., Li, S., Zhang, D. (2007). Feature Extraction from Tumor Gene Expression Profiles Using DCT and DFT. In: Neves, J., Santos, M.F., Machado, J.M. (eds) Progress in Artificial Intelligence. EPIA 2007. Lecture Notes in Computer Science(), vol 4874. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77002-2_41

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-77002-2_41

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-77000-8

  • Online ISBN: 978-3-540-77002-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics