A Unified Strategy of Feature Selection

Liu, Peng; Wu, Naijun; Zhu, Jiaxian; Yin, Junjie; Zhang, Wei

doi:10.1007/11811305_50

Peng Liu²²,
Naijun Wu²²,
Jiaxian Zhu²²,
Junjie Yin²² &
…
Wei Zhang²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4093))

Included in the following conference series:

International Conference on Advanced Data Mining and Applications

2814 Accesses
3 Citations

Abstract

In the field of data mining (DM), feature selection is one of the basic strategies handling with high-dimensionality problems. This paper makes a review of current methods of feature selection and proposes a unified strategy of feature selection, which divides overall procedures of feature selection into two stages, first to determine the FIF (Feature Important Factor) of features according to DM tasks, second to select features according to FIF. For classifying problems, we propose a new method for determining FIF based on decision trees and provide practical suggestion for feature selection. Through analysis on experiments conducted on UCI datasets, such a unified strategy of feature selection is proven to be effective and efficient.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Kantardzic, M.: Data Mining Concepts, Models, Methods, and Algorithms. A John Wiley & Sons, Inc., Chichester (2003)
MATH Google Scholar
Tan, P.N., Steinbach, M., Kumar, V.: Introduction to Data Mining. Pearson Education, Inc., London (2006)
Google Scholar
Dash, M.: Feature Selection for Classification. Intelligent Data Analysis 1, 131–156 (1997)
Article MathSciNet Google Scholar
Das, S.: Filters, Wrappers and A Boosting Based Hybrid for Feature Selection. In: Proceedings of the Eighteenth International Conference on Machine Learning, pp. 74–81 (2001)
Google Scholar
Ratanamahatana, C.A., Gunopulos, D.: Feature Selection for the Naive Bayesian Classifier Using Decision Trees. Applied Artificial Intelligence 17(5–6), 475–487 (2003)
Article Google Scholar
Liu, P.: R-C4.5: A Robust Decision Tree Improved Model. In: Proceedings of ISICA 2005 (The International Symposium on Intelligent Computation and Its Application), Progress in Intelligent Computation and Its Applications, Wuhan, China, pp. 454–459 (2005)
Google Scholar
Merz, C.J., Murphy, P.M.: UCI Repository of Machine Learning Datasets (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, London (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Management and Engineering, Shanghai University of Finance and Economics, Shanghai, 200433, P.R. China
Peng Liu, Naijun Wu, Jiaxian Zhu, Junjie Yin & Wei Zhang

Authors

Peng Liu
View author publications
You can also search for this author in PubMed Google Scholar
Naijun Wu
View author publications
You can also search for this author in PubMed Google Scholar
Jiaxian Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Junjie Yin
View author publications
You can also search for this author in PubMed Google Scholar
Wei Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology and Electronic Engineering, The University of Queensland, Queensland, Australia
Xue Li
University of Alberta, Canada
Osmar R. Zaïane
Northwest Polytechnical University, China
Zhanhuai Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, P., Wu, N., Zhu, J., Yin, J., Zhang, W. (2006). A Unified Strategy of Feature Selection. In: Li, X., Zaïane, O.R., Li, Z. (eds) Advanced Data Mining and Applications. ADMA 2006. Lecture Notes in Computer Science(), vol 4093. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11811305_50

Download citation

DOI: https://doi.org/10.1007/11811305_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37025-3
Online ISBN: 978-3-540-37026-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics