Abstract
Biomarker discovery becomes the bottle-neck of personalized medicine and has gained increasing interest from various research fields recently. Nevertheless, producing robust and accurate signatures is a crucial problem in biomarker discovery and relies heavily on the used feature selection algorithms. Feature selection is a preprocessing step which plays a crucial role in omics data analysis to improve learning. The accumulating evidence suggests that ensemble methods and swarm intelligence are two growing solutions for improving feature selection algorithms. In this paper, we propose a two stages approach to identify a predefined number of biomarkers from gene expression data. It is designed as a wrapper-based ensemble method; each part of the ensemble is performed through cooperative parallel meta-heuristics and a filter-based mechanism. Experiments from twelve DNA microarray datasets have shown that our approach competes with and even outperforms recent state-of-the-art methods in terms of accuracy and robustness. Also, biological interpretation shows that our approach selects highly informative genes for cancer diagnosis.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bauer, D.C., et al.: Genomics and personalised whole-of-life healthcare. Trends in Molecular Medicine (2014)
Zhang, X., et al.: Integrative Omics Technologies in Cancer Biomarker Discovery. Omics Technologies in Cancer Biomarker Discovery 129 (2011)
Lundblad, R.L.: Development and Application of Biomarkers. CRC Press (2010)
Osl, M., et al.: Applied Data Mining: From Biomarker Discovery to Decision Support Systems. In: Trajanoski, Z. (ed.) Computational Medicine, pp. 173–184 (2012)
Fortino, V., et al.: A Robust and Accurate Method for Feature Selection and Prioritization from Multi-Class OMICs Data. PloS One 9 (9) (2014)
Somorjai, R.L., et al.: Class Prediction and Discovery Using Gene Microarray and Proteomics Mass Spectroscopy Data: Curses, Caveats, Cautions. Bioinformatics 19, 1484–1491 (2003)
Saeys, Y., Inza, I., Larrañaga, P.: A Review of Feature Selection Techniques in Bioinformatics. Bioinformatics 23, 2507–2517 (2007)
Wu, M.-Y., et al.: Biomarker Identification and Cancer Classification Based on Microarray Data Using Laplace Naive Bayes Model with Mean Shrinkage. IEEE/ACM Transactions on Computational Biology and Bioinformatics 9, 1649–1662 (2012)
Bolón-Canedo, V., et al.: A review of microarray datasets and applied feature selection methods. Information Sciences 282, 111–135 (2014)
Sudha George, G.V., et al.: Review on feature selection techniques and the impact of SVM for cancer classification using gene expression profile. International Journal of Computer Science & Engineering Survey 2, 16–27 (2011)
Cosmin, L., et al.: A Survey on Filter Techniques for Feature Selection in Gene Expression Microarray Analysis. IEEE/ACM Transactions on Computational Biology and Bioinformatics 9, 1106–1119 (2012)
Martinez, E., et al.: Compact cancer biomarkers discovery using a swarm intelligence feature selection algorithm. Computational Biology and Chemistry 34, 244–250 (2010)
Saeys, Y., Abeel, T., Van de Peer, Y.: Robust Feature Selection Using Ensemble Feature Selection Techniques. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008, Part II. LNCS (LNAI), vol. 5212, pp. 313–325. Springer, Heidelberg (2008)
Cadenas, J.M., et al.: Feature subset selection Filter–Wrapper based on low quality data. Expert Systems with Applications 40, 6241–6252 (2013)
Zhang, S., et al.: A new unsupervised feature ranking method for gene expression data based on consensus affinity. IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB) 9(4), 1257–1263 (2012)
Boucheham, A., Batouche, M.: Robust biomarker discovery for cancer diagnosis based on meta-ensemble feature selection. In: Science and Information Conference (SAI), pp. 452–560. IEEE (2014)
Abeel, T., et al.: Robust biomarker identification for cancer diagnosis with ensemble feature selection methods. Bioinformatics 26, 392–398 (2010)
Ghorai, S., et al.: Cancer classification from gene expression data by NPPC ensemble. IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB) 8(3), 659–671 (2011)
Izzo, D., Ruciński, M., Biscani, F.: The Generalized Island Model. In: Fernandez de Vega, F., Hidalgo Pérez, J.I., Lanchares, J. (eds.) Parallel Architectures & Bioinspired Algorithms. SCI, vol. 415, pp. 151–170. Springer, Heidelberg (2012)
Boucheham, A., Batouche, M.: Robust Hybrid wrapper/filter Biomarker Discovery based on Generalized Island Model from Gene Expression Data. International Journal of Computational Biology and Drug Design (in press)
García-Nieto, J., et al.: Parallel multi-swarm optimizer for gene selection in DNA microarrays. Appl. Intell. 37, 255–266 (2012)
Gutiérrez, A.L., et al.: Comparison of different PSO initialization techniques for high dimensional search space problems: A test with FSS and antenna arrays. Antennas and Propagation (EUCAP). In: Proceedings of the 5th European Conference on IEEE, pp. 965–969 (2011)
J.: R, Quinlan.: C4.5: programs for machine learning. Morgan Kaufmann Publishers 1 (1993)
Alba, E., et al.: Gene selection in cancer classification using PSO/SVM and GA/SVM hybrid algorithms. In: IEEE Congress on Evolutionary Computation, CEC 2007, pp. 284–290. IEEE (2007)
Huang, C.-L.: ACO-based hybrid classification system with feature subset selection and model parameters optimization. Neurocomputing 73, 438–448 (2009)
Yazdani, S., et al.: Feature subset selection using constrained binary/integer biogeography- based optimization. ISA Transactions 52, 383–390 (2013)
Kuncheva, L.I.: A stability index for feature selection. International Multi- Conference. In: Artificial Intelligence and Applications, pp. 390–395. ACTA Press Anaheim, CA (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Boucheham, A., Batouche, M., Meshoul, S. (2015). An Ensemble of Cooperative Parallel Metaheuristics for Gene Selection in Cancer Classification. In: Ortuño, F., Rojas, I. (eds) Bioinformatics and Biomedical Engineering. IWBBIO 2015. Lecture Notes in Computer Science(), vol 9044. Springer, Cham. https://doi.org/10.1007/978-3-319-16480-9_30
Download citation
DOI: https://doi.org/10.1007/978-3-319-16480-9_30
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16479-3
Online ISBN: 978-3-319-16480-9
eBook Packages: Computer ScienceComputer Science (R0)