Consistency Based Attribute Reduction

Hu, Qinghua; Zhao, Hui; Xie, Zongxia; Yu, Daren

doi:10.1007/978-3-540-71701-0_12

Qinghua Hu¹,
Hui Zhao¹,
Zongxia Xie¹ &
…
Daren Yu¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4426))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

1866 Accesses
27 Citations

Abstract

Rough sets are widely used in feature subset selection and attribute reduction. In most of the existing algorithms, the dependency function is employed to evaluate the quality of a feature subset. The disadvantages of using dependency are discussed in this paper. And the problem of forward greedy search algorithm based on dependency is presented. We introduce the consistency measure to deal with the problems. The relationship between dependency and consistency is analyzed. It is shown that consistency measure can reflects not only the size of decision positive region, like dependency, but also the sample distribution in the boundary region. Therefore it can more finely describe the distinguishing power of an attribute set. Based on consistency, we redefine the redundancy and reduct of a decision system. We construct a forward greedy search algorithm to find reducts based on consistency. What’s more, we employ cross validation to test the selected features, and reduce the overfitting features in a reduct. The experimental results with UCI data show that the proposed algorithm is effective and efficient.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bhatt, R.B., Gopal, M.: On fuzzy-rough sets approach to feature selection. Pattern Recognition Letters 26, 965–975 (2005)
Article Google Scholar
Breiman, L., et al.: Classification and regression trees. Wadsworth International, Belmont (1984)
MATH Google Scholar
Dash, M., Liu, H.: Consistency-based search in feature selection. Artificial Intelligence 151, 155–176 (2003)
Article MATH MathSciNet Google Scholar
Guyon, I., Weston, J., Barnhill, S., et al.: Gene selection for cancer classification using support vector machines. Machine Learning 46, 389–422 (2002)
Article MATH Google Scholar
Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. Journal of Machine Learning Research 3, 1157–1182 (2003)
Article MATH Google Scholar
Hu, Q.H., Li, X.D., Yu, D.R.: Analysis on Classification Performance of Rough Set Based Reducts. In: Yang, Q., Webb, G. (eds.) PRICAI 2006. LNCS (LNAI), vol. 4099, pp. 423–433. Springer, Heidelberg (2006)
Google Scholar
Hu, Q.H., Yu, D.R., Xie, Z.X.: Information-preserving hybrid data reduction based on fuzzy-rough techniques. Pattern Recognition Letters 27, 414–423 (2006)
Article Google Scholar
Jensen, R., Shen, Q.: Semantics-preserving dimensionality reduction: Rough and fuzzy-rough-based approaches. IEEE transactions of knowledge and data engineering 16, 1457–1471 (2004)
Article Google Scholar
Liu, H., Yu, L.: Toward integrating feature selection algorithms for classification and clustering. IEEE Transactions on knowledge and data engineering 17, 491–502 (2005)
Article Google Scholar
Quinlan, J.R.: Induction of decision trees. Machine Learning 1, 81–106 (1986)
Google Scholar
Skowron, A., Rauszer, C.: The Discernibility Matrices and Functions in Information Systems. In: Slowinski, R. (ed.) Intelligent Decision Support-Handbook of Applications and Advances of the Rough Sets Theory, pp. 331–362 (1991)
Google Scholar
Slezak, D.: Approximate decision reducts. Ph.D. Thesis, Warsaw University (2001)
Google Scholar
Ślezak, D.: Approximate Entropy Reducts. Fundamenta Informaticae 53, 365–390 (2002)
MathSciNet Google Scholar
Swiniarski, R.W., Skowron, A.: Rough set methods in feature selection and recognition. Pattern recognition letters 24, 833–849 (2003)
Article MATH Google Scholar
Xie, Z.X., Hu, Q.H., Yu, D.R.: Improved feature selection algorithm based on SVM and correlation. In: Wang, J., et al. (eds.) ISNN 2006. LNCS, vol. 3971, pp. 1373–1380. Springer, Heidelberg (2006)
Chapter Google Scholar
Zhong, N., Dong, J., Ohsuga, S.: Using rough sets with heuristics for feature selection. J. Intelligent Information Systems 16, 199–214 (2001)
Article MATH Google Scholar
Ziarko, W.: Variable precision rough sets model. Journal of Computer and System Sciences 46, 39–59 (1993)
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Harbin Institute of Technology, Harbin 150001, P.R. China
Qinghua Hu, Hui Zhao, Zongxia Xie & Daren Yu

Authors

Qinghua Hu
View author publications
You can also search for this author in PubMed Google Scholar
Hui Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Zongxia Xie
View author publications
You can also search for this author in PubMed Google Scholar
Daren Yu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Zhi-Hua Zhou Hang Li Qiang Yang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hu, Q., Zhao, H., Xie, Z., Yu, D. (2007). Consistency Based Attribute Reduction. In: Zhou, ZH., Li, H., Yang, Q. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2007. Lecture Notes in Computer Science(), vol 4426. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71701-0_12

Download citation

DOI: https://doi.org/10.1007/978-3-540-71701-0_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71700-3
Online ISBN: 978-3-540-71701-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics