Abstract
The increase of dimensionality of data is a target for many existing feature selection methods with respect to efficiency and effectiveness. In this paper, the all relevant feature selection method based on information gathered using generational feature selection is described. The successive generations of feature subset were defined using Rule Quality Importance algorithm and next the subset of most important features was eliminated from the primary dataset. This process was executed until the most relevant feature has got importance value on the level equal to importance of the random, shadow feature. The proposed approach was also tested on well-known artificial Madelon dataset and the results confirm its efficiency. Thus, the conclusion is that the identified features are relevant but not all weakly relevant features were discovered.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kuhn, M., Johnson, K.: Applied Predictive Modeling. Springer, New York (2013). https://doi.org/10.1007/978-1-4614-6849-3
Liu, H., et al.: Feature selection aspects. In: Liu, H., Motoda, H. (eds.) Feature Selection for Knowledge Discovery and Data Mining. The Springer International Series in Engineering and Computer Science, vol. 454, pp. 43–72. Springer, Boston (1998). https://doi.org/10.1007/978-1-4615-5689-3_3
Nilsson, R., Peña, J.M., Björkegren, J., Tegnér, J.: Detecting multivariate differentially expressed genes. BMC Bioinform. 8, 150 (2007)
Paja, W., Wrzesień, M., Niemiec, R., Rudnicki, W.R.: Application of all-relevant feature selection for the failure analysis of parameter-induced simulation crashes in climate models. Geosci. Model Dev. 9, 1065–1072 (2016)
Rudnicki, W.R., Wrzesień, M., Paja, W.: All relevant feature selection methods and applications. In: Stańczyk, U., Jain, L.C. (eds.) Feature Selection for Data and Pattern Recognition. SCI, vol. 584, pp. 11–28. Springer, Heidelberg (2015). https://doi.org/10.1007/978-3-662-45620-0_2
Guyon, I., Weston, J., Barnhill, S., Vapnik, V.: Mach. Learn. 46, 389–422 (2002)
Johannes, M., Brase, J.C., Frohlich, H., Gade, S., Gehrmann, M., Falth, M., Sultmann, H., Beiflbarth, T.: Integration of pathway knowledge into a reweighted recursive feature elimination approach for risk stratification of cancer patients. Bioinformatics 26(17), 2136–2144 (2010)
Stoppiglia, H., Dreyfus, G., Dubois, R., Oussar, Y.: Ranking a random feature for variable and feature selection. J. Mach. Learn. Res. 3, 1399–1414 (2003)
Tuv, E., Borisov, A., Torkkola, K.: Feature selection using ensemble based ranking against artificial contrasts. In: International Symposium on Neural Networks, pp. 2181–2186 (2006)
Paja, W.: Feature selection methods based on decision rule and tree models. In: Czarnowski, I., Caballero, A.M., Howlett, R.J., Jain, L.C. (eds.) Intelligent Decision Technologies 2016. SIST, vol. 57, pp. 63–70. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-39627-9_6
Paja, W.: Generational feature elimination to find all relevant feature subset. In: Czarnowski, I., Howlett, R.J., Jain, L.C. (eds.) IDT 2017. SIST, vol. 72, pp. 140–148. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-59421-7_13
Guyon, I., Gunn, S., Ben-Hur, A., Dror, G.: Result analysis of the NIPS 2003: feature selection challenge. In: Advances in Neural Information Processing Systems, vol. 17, pp. 545–552 (2013)
Paja, W., Pancerz, K., Grochowalski, P.: Generational feature elimination and some other ranking feature selection methods. In: Stańczyk, U., Zielosko, B., Jain, L.C. (eds.) Advances in Feature Selection for Data and Pattern Recognition. ISRL, vol. 138, pp. 97–112. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-67588-6_6
Acknowledgments
This work was supported by the Center for Innovation and Transfer of Natural Sciences and Engineering Knowledge at the University of Rzeszów.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Paja, W. (2018). A Decision Rule Based Approach to Generational Feature Selection. In: Perner, P. (eds) Advances in Data Mining. Applications and Theoretical Aspects. ICDM 2018. Lecture Notes in Computer Science(), vol 10933. Springer, Cham. https://doi.org/10.1007/978-3-319-95786-9_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-95786-9_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-95785-2
Online ISBN: 978-3-319-95786-9
eBook Packages: Computer ScienceComputer Science (R0)