Abstract
Microarray data analysis is attracting increasing attention in computer science because of the many applications of machine learning methods in prediction problems. The process typically involves a feature selection step, important in order to increase the accuracy and speed of the classifiers. This work analyzes the characteristics of the features selected by two wrapper methods, the first one based on artificial neural networks (ANN) and the second in a novel constructive neural network (CNN) algorithm, to later propose a hybrid model that combines the advantages of wrapper and filter methods. The results obtained in terms of the computational costs involved and the prediction accuracy reached show the feasibility of the hybrid model proposed here and indicate an interesting research line for the near future.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Huda, S., Yearwood, J., Strainieri, A.: Hybrid Wrapper-Filter Approaches for Input Feature Selection Using Maximum Relevance and Artificial Neural Network Input Gain Measurement Approximation (ANNIGMA). In: 4th International Conference on Network and System Security, pp. 442–449 (2010)
Inza, I., Larrañaga, P., Blanco, R., Cerrolaza, A.J.: Filter versus wrapper gene selection approaches in DNA microarray domains. Artificial Intelligence in Medicine 31, 91–103 (2004)
Kohavi, R., John, G.: The Wrapper approach. In: Feature Extraction, Construction and Selection: a data mining perspective, pp. 33–51 (1998)
Lancashire, L.J., Rees, R.C., Ball, G.R.: Identification of gene transcript signatures predictive for er and lymph node status using a stepwise forward selection ann modelling approach. Artif. Intell. Med. 43, 99–111 (2008)
Pirooznia, M., Yang, J., Yang, M.Q., Deng, Y.: A comparative study of different machine learning methods on microarray gene expression data. BMC Genomics 9, S13 (2008)
Sebban, M., Nock, R.: A hybrid filter/wrapper approach of feature selection using information theory. Pattern Recognition 35, 835–846 (2002)
Subirats, J.L., Jerez, J.M., Gómez, I., Franco, L.: Multiclass Pattern Recognition Extension for the New C-Mantec Constructive Neural Network Algorithm. Cognitive Computation 2, 285–290 (2010)
Urda, D., Subirats, J.L., Franco, L., Jerez, J.M.: Constructive neural networks to predict breast cancer outcome by using gene expression profiles. In: García-Pedrajas, N., Herrera, F., Fyfe, C., Benítez, J.M., Ali, M. (eds.) IEA/AIE 2010. LNCS, vol. 6096, pp. 317–326. Springer, Heidelberg (2010)
West, M.: Bayesian factor regression models in the “large p, small n” paradigm. Bayesian statistics 7, 723–732 (2003)
West, M., Blanchette, C., Dressman, H., Huang, E., Ishida, S., Spang, R., Zuzan, H., Olson, J.A., Marks, J.R., Nevins, J.R.: Predicting the clinical status of human breast cancer by using gene expression profiles. PNAS 98, 11462–11467 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Couce, Y., Franco, L., Urda, D., Subirats, J.L., Jerez, J.M. (2011). Hybrid (Generalization-Correlation) Method for Feature Selection in High Dimensional DNA Microarray Prediction Problems. In: Cabestany, J., Rojas, I., Joya, G. (eds) Advances in Computational Intelligence. IWANN 2011. Lecture Notes in Computer Science, vol 6692. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21498-1_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-21498-1_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21497-4
Online ISBN: 978-3-642-21498-1
eBook Packages: Computer ScienceComputer Science (R0)