Abstract
In the field of data mining (DM), feature selection is one of the basic strategies handling with high-dimensionality problems. This paper makes a review of current methods of feature selection and proposes a unified strategy of feature selection, which divides overall procedures of feature selection into two stages, first to determine the FIF (Feature Important Factor) of features according to DM tasks, second to select features according to FIF. For classifying problems, we propose a new method for determining FIF based on decision trees and provide practical suggestion for feature selection. Through analysis on experiments conducted on UCI datasets, such a unified strategy of feature selection is proven to be effective and efficient.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kantardzic, M.: Data Mining Concepts, Models, Methods, and Algorithms. A John Wiley & Sons, Inc., Chichester (2003)
Tan, P.N., Steinbach, M., Kumar, V.: Introduction to Data Mining. Pearson Education, Inc., London (2006)
Dash, M.: Feature Selection for Classification. Intelligent Data Analysis 1, 131–156 (1997)
Das, S.: Filters, Wrappers and A Boosting Based Hybrid for Feature Selection. In: Proceedings of the Eighteenth International Conference on Machine Learning, pp. 74–81 (2001)
Ratanamahatana, C.A., Gunopulos, D.: Feature Selection for the Naive Bayesian Classifier Using Decision Trees. Applied Artificial Intelligence 17(5–6), 475–487 (2003)
Liu, P.: R-C4.5: A Robust Decision Tree Improved Model. In: Proceedings of ISICA 2005 (The International Symposium on Intelligent Computation and Its Application), Progress in Intelligent Computation and Its Applications, Wuhan, China, pp. 454–459 (2005)
Merz, C.J., Murphy, P.M.: UCI Repository of Machine Learning Datasets (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, London (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, P., Wu, N., Zhu, J., Yin, J., Zhang, W. (2006). A Unified Strategy of Feature Selection. In: Li, X., Zaïane, O.R., Li, Z. (eds) Advanced Data Mining and Applications. ADMA 2006. Lecture Notes in Computer Science(), vol 4093. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11811305_50
Download citation
DOI: https://doi.org/10.1007/11811305_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37025-3
Online ISBN: 978-3-540-37026-0
eBook Packages: Computer ScienceComputer Science (R0)