Abstract
Recently, demand on the tools to efficiently analyze biological genomic information has been on the rise. In this paper, we attempt to explore the optimal features and classifiers through a comparative study with the most promising feature selection methods and machine learning classifiers. In order to predict the cancer class, the gene information from patient’s marrow expressed by DNA microarray, who has either the acute myeloid leukemia or acute lymphoblastic leukemia. Pearson and Spearman’s correlation, Euclidean distance, cosine coefficient, information gain, mutual information and signal to noise ratio have been used for feature selection. Backpropagation neural network, self-organizing map, structure adaptive self-organizing map, support vector machine, inductive decision tree and k-nearest neighbor have been used for classification. Experimental results indicate that backpropagation neural network with Pearson’s correlation coefficients is the best method, obtaining 97.1% of recognition rate on the test data.
Keywords
- Support Vector Machine
- Acute Myeloid Leukemia
- Feature Selection
- Acute Lymphoblastic Leukemia
- Mutual Information
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This paper was supported by Brain Science and Engineering Research Program sponsored by Korean Ministry of Science and Technology.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Golub, T.R. et al.: Molecular classification of cancer: Class discovery and class prediction by gene-expression monitoring. Science, Vol. 286. (1999) 531–537
Tamayo, P.: Interpreting patterns of gene expression with self-organizing map: Methods and application to hematopoietic differentiation. Proceedings of the National Academy of Sciences of the United States of America. Vol. 96. (1999) 2907–2912
Kim, H.D. and Cho, S.B.: Genetic optimization of structure-adaptive self-organizing map for efficient classification. Proceedings of International Conference on Soft Computing, pp. 34–39, (October 2000)
Quinlan, J.R.: The effect of noise on concept learning. Machine Learning: an Artificial Intelligence Approach. R. S. Michalski, J. G. Carbonell and T. M. Mitchell. Eds. San Mateo, CA: Morgan Kauffmann, Vol. 2. (1986) 149–166
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ryu, J., Cho, SB. (2002). Towards Optimal Feature and Classifier for Gene Expression Classification of Cancer. In: Pal, N.R., Sugeno, M. (eds) Advances in Soft Computing — AFSS 2002. AFSS 2002. Lecture Notes in Computer Science(), vol 2275. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45631-7_41
Download citation
DOI: https://doi.org/10.1007/3-540-45631-7_41
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43150-3
Online ISBN: 978-3-540-45631-5
eBook Packages: Springer Book Archive