Skip to main content

An Ensemble of Cooperative Parallel Metaheuristics for Gene Selection in Cancer Classification

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 9044))

Abstract

Biomarker discovery becomes the bottle-neck of personalized medicine and has gained increasing interest from various research fields recently. Nevertheless, producing robust and accurate signatures is a crucial problem in biomarker discovery and relies heavily on the used feature selection algorithms. Feature selection is a preprocessing step which plays a crucial role in omics data analysis to improve learning. The accumulating evidence suggests that ensemble methods and swarm intelligence are two growing solutions for improving feature selection algorithms. In this paper, we propose a two stages approach to identify a predefined number of biomarkers from gene expression data. It is designed as a wrapper-based ensemble method; each part of the ensemble is performed through cooperative parallel meta-heuristics and a filter-based mechanism. Experiments from twelve DNA microarray datasets have shown that our approach competes with and even outperforms recent state-of-the-art methods in terms of accuracy and robustness. Also, biological interpretation shows that our approach selects highly informative genes for cancer diagnosis.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bauer, D.C., et al.: Genomics and personalised whole-of-life healthcare. Trends in Molecular Medicine (2014)

    Google Scholar 

  2. Zhang, X., et al.: Integrative Omics Technologies in Cancer Biomarker Discovery. Omics Technologies in Cancer Biomarker Discovery 129 (2011)

    Google Scholar 

  3. Lundblad, R.L.: Development and Application of Biomarkers. CRC Press (2010)

    Google Scholar 

  4. Osl, M., et al.: Applied Data Mining: From Biomarker Discovery to Decision Support Systems. In: Trajanoski, Z. (ed.) Computational Medicine, pp. 173–184 (2012)

    Google Scholar 

  5. Fortino, V., et al.: A Robust and Accurate Method for Feature Selection and Prioritization from Multi-Class OMICs Data. PloS One 9 (9) (2014)

    Google Scholar 

  6. Somorjai, R.L., et al.: Class Prediction and Discovery Using Gene Microarray and Proteomics Mass Spectroscopy Data: Curses, Caveats, Cautions. Bioinformatics 19, 1484–1491 (2003)

    Article  Google Scholar 

  7. Saeys, Y., Inza, I., Larrañaga, P.: A Review of Feature Selection Techniques in Bioinformatics. Bioinformatics 23, 2507–2517 (2007)

    Article  Google Scholar 

  8. Wu, M.-Y., et al.: Biomarker Identification and Cancer Classification Based on Microarray Data Using Laplace Naive Bayes Model with Mean Shrinkage. IEEE/ACM Transactions on Computational Biology and Bioinformatics 9, 1649–1662 (2012)

    Article  Google Scholar 

  9. Bolón-Canedo, V., et al.: A review of microarray datasets and applied feature selection methods. Information Sciences 282, 111–135 (2014)

    Article  Google Scholar 

  10. Sudha George, G.V., et al.: Review on feature selection techniques and the impact of SVM for cancer classification using gene expression profile. International Journal of Computer Science & Engineering Survey 2, 16–27 (2011)

    Article  Google Scholar 

  11. Cosmin, L., et al.: A Survey on Filter Techniques for Feature Selection in Gene Expression Microarray Analysis. IEEE/ACM Transactions on Computational Biology and Bioinformatics 9, 1106–1119 (2012)

    Article  Google Scholar 

  12. Martinez, E., et al.: Compact cancer biomarkers discovery using a swarm intelligence feature selection algorithm. Computational Biology and Chemistry 34, 244–250 (2010)

    Article  Google Scholar 

  13. Saeys, Y., Abeel, T., Van de Peer, Y.: Robust Feature Selection Using Ensemble Feature Selection Techniques. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008, Part II. LNCS (LNAI), vol. 5212, pp. 313–325. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  14. Cadenas, J.M., et al.: Feature subset selection Filter–Wrapper based on low quality data. Expert Systems with Applications 40, 6241–6252 (2013)

    Article  Google Scholar 

  15. Zhang, S., et al.: A new unsupervised feature ranking method for gene expression data based on consensus affinity. IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB) 9(4), 1257–1263 (2012)

    Article  Google Scholar 

  16. Boucheham, A., Batouche, M.: Robust biomarker discovery for cancer diagnosis based on meta-ensemble feature selection. In: Science and Information Conference (SAI), pp. 452–560. IEEE (2014)

    Google Scholar 

  17. Abeel, T., et al.: Robust biomarker identification for cancer diagnosis with ensemble feature selection methods. Bioinformatics 26, 392–398 (2010)

    Article  Google Scholar 

  18. Ghorai, S., et al.: Cancer classification from gene expression data by NPPC ensemble. IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB) 8(3), 659–671 (2011)

    Article  Google Scholar 

  19. Izzo, D., Ruciński, M., Biscani, F.: The Generalized Island Model. In: Fernandez de Vega, F., Hidalgo Pérez, J.I., Lanchares, J. (eds.) Parallel Architectures & Bioinspired Algorithms. SCI, vol. 415, pp. 151–170. Springer, Heidelberg (2012)

    Google Scholar 

  20. Boucheham, A., Batouche, M.: Robust Hybrid wrapper/filter Biomarker Discovery based on Generalized Island Model from Gene Expression Data. International Journal of Computational Biology and Drug Design (in press)

    Google Scholar 

  21. García-Nieto, J., et al.: Parallel multi-swarm optimizer for gene selection in DNA microarrays. Appl. Intell. 37, 255–266 (2012)

    Article  Google Scholar 

  22. Gutiérrez, A.L., et al.: Comparison of different PSO initialization techniques for high dimensional search space problems: A test with FSS and antenna arrays. Antennas and Propagation (EUCAP). In: Proceedings of the 5th European Conference on IEEE, pp. 965–969 (2011)

    Google Scholar 

  23. J.: R, Quinlan.: C4.5: programs for machine learning. Morgan Kaufmann Publishers 1 (1993)

    Google Scholar 

  24. Alba, E., et al.: Gene selection in cancer classification using PSO/SVM and GA/SVM hybrid algorithms. In: IEEE Congress on Evolutionary Computation, CEC 2007, pp. 284–290. IEEE (2007)

    Google Scholar 

  25. Huang, C.-L.: ACO-based hybrid classification system with feature subset selection and model parameters optimization. Neurocomputing 73, 438–448 (2009)

    Article  Google Scholar 

  26. Yazdani, S., et al.: Feature subset selection using constrained binary/integer biogeography- based optimization. ISA Transactions 52, 383–390 (2013)

    Article  Google Scholar 

  27. Kuncheva, L.I.: A stability index for feature selection. International Multi- Conference. In: Artificial Intelligence and Applications, pp. 390–395. ACTA Press Anaheim, CA (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Boucheham, A., Batouche, M., Meshoul, S. (2015). An Ensemble of Cooperative Parallel Metaheuristics for Gene Selection in Cancer Classification. In: Ortuño, F., Rojas, I. (eds) Bioinformatics and Biomedical Engineering. IWBBIO 2015. Lecture Notes in Computer Science(), vol 9044. Springer, Cham. https://doi.org/10.1007/978-3-319-16480-9_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-16480-9_30

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-16479-3

  • Online ISBN: 978-3-319-16480-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics