Using Rough Sets with Heuristics for Feature Selection

Dong, Juzhen; Zhong, Ning; Ohsuga, Setsuo

doi:10.1007/978-3-540-48061-7_22

Juzhen Dong⁹,
Ning Zhong⁹ &
Setsuo Ohsuga¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1711))

Included in the following conference series:

International Workshop on Rough Sets, Fuzzy Sets, Data Mining, and Granular-Soft Computing

782 Accesses
6 Citations

Abstract

Practical machine learning algorithms are known to degrade in performance when faced with many features that are not necessary for rule discovery. To cope with this problem, many methods for selecting a subset of features with similar-enough behaviors to merit focused analysis have been proposed. In such methods, the filter approach that selects a feature subset using a preprocessing step, and the wrapper approach that selects an optimal feature subset from the space of possible subsets of features using the induction algorithm itself as a part of the evaluation function, are two typical ones. Although the filter approach is a faster one, it has some blindness and the performance of induction is not considered. On the other hand, the optimal feature subsets can be obtained by using the wrapper approach, but it is not easy to use because the complexity of time and space. In this paper, we propose an algorithm of using the rough set methodology with greedy heuristics for feature selection. In our approach, selecting features is similar as the filter approach, but the performance of induction is considered in the evaluation criterion for feature selection. That is, we select the features that damage the performance of induction as little as possible.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Pawlak, Z.: Rough Sets, Theoretical Aspects of Reasoning about Data. Kluwer, Dordrecht (1991)
MATH Google Scholar
Skowron, A., Rauszer, C.: The Discernibility Matrics and Functions in Information Systems. In: Slowinski, R. (ed.) Intelligent Decision Support, pp. 331–362. Kluwer, Dordrecht (1992)
Google Scholar
Skowron, A., Polkowski, L.: A Synthesis of Decision Systems from Data Tables. In: Lin, T.Y., Cercone, N. (eds.) Rough Sets and Data Mining, pp. 259–299. Kluwer, Dordrecht (1997)
Google Scholar
Boussouf, M.: A Hybrid Approach to Feature Selection. In: Zytkow, J., Quafafou, M. (eds.) PKDD 1998. LNCS (LNAI), vol. 1510, pp. 231–238. Springer, Heidelberg (1998)
Chapter Google Scholar
Yao, Y.Y., Wong, S.K.M., Butz, C.J.: On Information-Theoretic Measures of Attribute Importance. In: Zhong, N., Zhou, L. (eds.) PAKDD 1999. LNCS (LNAI), vol. 1574, pp. 231–238. Springer, Heidelberg (1999)
Google Scholar
Liu, H., Motoda, H.: Feature Selection. Kluwer, Dordrecht (1998)
Google Scholar
Zupan, B., Bohanec, M., Demsar, J., Bratko, I.: Feature Transformation by Function Decomposition. IEEE Intelligent Systems 13(2), 38–43 (1998)
Article Google Scholar
Aho, A.V., Hopcroft, J.E., Ullman, J.D.: Data Structures and Algorithms. Addison-Wesley, Reading (1983)
MATH Google Scholar
Zhong, N., Dong, J.Z., Ohsuga, S.: Data Mining based on the Generalization Distribution Table and Rough Sets. In: Wu, X., et al. (eds.) PAKDD 1998. LNCS (LNAI), vol. 1394, pp. 360–373. Springer, Heidelberg (1998)
Chapter Google Scholar
Dong, J.Z., Zhong, N., Ohsuga, S.: Probabilistic Rough Induction: The GDT-RS Methodology and Algorithms. In: Ras, Z.W., Skowron, A. (eds.) ISMIS 1999. LNCS (LNAI), vol. 1609, pp. 621–629. Springer, Heidelberg (1999)
Chapter Google Scholar
Kohavi, R.: Useful Feature Subsets and Rough Set Reducts. In: Proc. Third Inter. Workshop on Rough Set and Soft Computing, pp. 310–317 (1994)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science and Sys. Eng., Yamaguchi University,
Juzhen Dong & Ning Zhong
Dept. of Information and Computer Science, Waseda University,
Setsuo Ohsuga

Authors

Juzhen Dong
View author publications
You can also search for this author in PubMed Google Scholar
Ning Zhong
View author publications
You can also search for this author in PubMed Google Scholar
Setsuo Ohsuga
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

The International WIC Institute, Beijing University of Technology, China
Ning Zhong
Institute of Mathematics, Warsaw University, Banacha 2, 02-097, Warsaw, Poland
Andrzej Skowron
Professor Emeritus, University of Tokyo,
Setsuo Ohsuga

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dong, J., Zhong, N., Ohsuga, S. (1999). Using Rough Sets with Heuristics for Feature Selection. In: Zhong, N., Skowron, A., Ohsuga, S. (eds) New Directions in Rough Sets, Data Mining, and Granular-Soft Computing. RSFDGrC 1999. Lecture Notes in Computer Science(), vol 1711. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-48061-7_22

Download citation

DOI: https://doi.org/10.1007/978-3-540-48061-7_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66645-5
Online ISBN: 978-3-540-48061-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics