Abstract
This article proposes a Multiple-Filter (MF) using a genetic algorithm (GA) and Tabu Search (TS) combined with a Support Vector Machine (SVM) for gene selection and classification of DNA microarray data. The proposed method is designed to select a subset of relevant genes that classify the DNA-microarray data more accurately. First, five traditional statistical methods are used for preliminary gene selection (Multiple Filter). Then different relevant gene subsets are selected by using a Wrapper (GA/TS/SVM). A gene subset, consisting of relevant genes, is obtained from each statistical method, by analyzing the frequency of each gene in the different gene subsets. Finally, the most frequent genes are evaluated by the Multiple Wrapper approach to obtain a final relevant gene subset. The proposed method is tested in four DNA-microarray datasets. In the experimental results it is observed that our model work very well than other methods reported in the literature.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Golub, T., Slonim, D., Tamayo, P., et al.: Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring. Science, 531–537 (1999)
Alon, U., Barkai, N., Notterman, D., et al.: Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc. Nat. Acad. Sci. USA, 6745–6750 (1999)
Gordon, G.J., et al.: Translation of Microarray Data into Clinically Relevant Cancer Diagnostic Tests Using gene expression ratios in lung cancer and mesothelioma. Cancer Res. (2002)
Pomeroy, S.L., Tamayo, P., Gaasenbeek, M., Sturla, L.M., Golub, T.R.: Prediction of Central Nervous System Embryonaltumour Outcome Based on Gene Expression. Nature, 436–442 (2002)
Alizadeh, A.A., Eisen, B.M., Davis, R.E., et al.: Distinct Types of Diffuse Large (b)–Cell Lymphoma Identified by Gene Expression Profiling. Nature, 503–511 (2000)
Hernandez, J.C., Duval, B., Hao, J.K.: SVM-based Local Search For Gene Selection and Classification of Microarray Data. Communications in Computer and Information Science 13, 499–508 (2008)
Mohamad, M.S., et al.: A Hybrid of Genetic Algorithm and Support Vector Machine for Features Selection and Classification of Gene Expression Microarray. International Journal of Computational Intelligence and Applications, 91–107 (2005)
Dudoit, S., Fridlyand, J., Speed, T.P.: Comparison of Discrimination Methods for The Classification of Tmors Using Gene Expression Data. Journal of the American Statistical Association, 77–87 (2002)
Deng, L., Pei, J., Ma, J., Lee, D.L.: A Rank Sum Test Method for Informative Gene Discovery. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2004), pp. 410–419. ACM Press, Seattle (2004)
Mishra, D., Sahu, B.: Feature Selection for Cancer Classification: A Signal-to-noise Ratio Approach. International Journal of Scientific & Engineering Research 2 (2011)
Hernández Montiel, L.A., Bonilla Huerta, E., Morales Caporal, R.: A multiple-filter-GA-SVM Method for Dimension Reduction and Classification of DNA-microarray data. Revista Mexicana de Ingenieria Biomedical XXXII, 32–39 (2011)
Li, L., Darden, T.A., Weinberg, C.R., Levine, A.J., Pedersen, L.G.: Gene Assessment and Sample Classification for Gene Expression Data Using a Genetic Algorithm/K-Nearest Neighbor Method. Combinatorial Chemistry & High Throughput Screening, 727–739 (2001)
Luo, L.K., Huang, D.F., Ye, L.J., Zhou, Q.F., Shao, G.F., Peng, H.: Improving the Computational Efficiency of Recursive Cluster Elimination for Gene Selection. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 122–129 (2011)
Yu, L., Han, Y., Berens, M.E.: Stable Gene Selection from Microarray Data via Sample Weighting. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 262–272 (2012)
Yu, G., Feng, Y., Miller, D.J., Xuan, J., Hoffman, E.P., Clarke, R., Davidson, B., Shih, I.M., Wang, Y.: Matched Gene Selection and Committee Classifier for Molecular Classification of Heterogeneous Diseases. Journal of Machine Learning Research, 2141–2167 (2010)
Leung, Y., Hung, Y.: A Multiple-filter-multiple-wrapper Approach to Gene Selection and Microarray Data Classification. IEEE/ACM Trans. Comput. Biol. Bioinformatics 7(1), 108–117 (2010), doi:10.1109/TCBB.2008.46
Glover, F., Melián, B.: Tabu Search. Revista Iberoamericana de Inteligencia Artificial (2003)
Vélez, M.C., Motoya, J.A.: Metaheurísticos: Una alternativa para la solución de problemas combinatorios en Administración de Operaciones. EIA 8, 99–115 (2007)
Dudoit, S., Fridlyand, J., Speed, T.: Comparison of Discrimination Methods for The Classification of Tumors Using Gene Expression Data. Journal of the American Statistical Association, 77–87 (2002)
Fu, X., Tan, F., Wang, H., Zhang, Y.Q., Harrison, R.: Feature Similarity Based Redundancy Reduction for Gene Selection. In: Proceedings of 2006 International Conference on Data Mining (DMIN 2006), Las Vegas, June 26-29 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Luis, HM.A., Edmundo, BH., Roberto, MC., José, GG.A. (2014). Selection and Classification of Gene Expression Data Using a MF-GA-TS-SVM Approach. In: Huang, DS., Han, K., Gromiha, M. (eds) Intelligent Computing in Bioinformatics. ICIC 2014. Lecture Notes in Computer Science(), vol 8590. Springer, Cham. https://doi.org/10.1007/978-3-319-09330-7_36
Download citation
DOI: https://doi.org/10.1007/978-3-319-09330-7_36
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09329-1
Online ISBN: 978-3-319-09330-7
eBook Packages: Computer ScienceComputer Science (R0)