ComEnVprs: A Novel Approach for Inducing Decision Tree Classifiers

Wang, Shuqin; Wei, Jinmao; You, Junping; Liu, Dayou

doi:10.1007/11811305_13

Shuqin Wang²²,
Jinmao Wei^23,24,
Junping You²³ &
…
Dayou Liu²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4093))

Included in the following conference series:

International Conference on Advanced Data Mining and Applications

2817 Accesses

Abstract

This paper presents a new approach for inducing decision trees by combining information entropy criteria with VPRS based methods. From the angle of rough set theory, when inducing decision trees, entropy based methods emphasize the effect of class distribution. Whereas the rough set based approaches emphasize the effect of certainty. The presented approach takes the advantages of both criteria for inducing decision trees. Comparisons between the presented approach and the fundamental information entropy based method on some data sets from the UCI Machine Learning Repository are also reported.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hunt, E.B., Marin, J., Stone, P.J.: Experiments in Induction. Academic Press, New York (1966)
Google Scholar
Fayyad, U.M., Weir, N., Djiorgovski, S.: SKICAT: A machine learning system for automated cataloging of large scale sky surveys. In: Proc. the Tenth International Conference on Machine Learning, pp. 112–119. Morgan Kaufmann, Amherst, MA (1993)
Google Scholar
Michalski, R.S., Carbonell, J.G., Mitchell, T.M.: Machine Learning-An Artificial Intelligence Approach. Springer, Germany (1983)
Google Scholar
Chen, S.C., Shyu, M.L., Chen, M., Zhang, C.C.: A Decision Tree-based Multimodal Data Mining Framework for Soccer Goal Detection. In: IEEE International Conference on Multimedia and Expo, June 27-June 30, 2004, Taipei, Taiwan, ROC (2004)
Google Scholar
Quinlan, J.R.: Introduction of Decision Trees. Machine Learning 3, 81–106 (1986)
Google Scholar
Cheng, J., Bell, D.: Learning bayesian networks from data: an efficient approach based on information theory. In: Proc. of the sixth ACM International Conference on Information and Knowledge Management, pp. 325–331 (1997)
Google Scholar
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and regression trees. Technical report, Wadsworth International, Monterey, CA (1984)
Google Scholar
Pawlak, Z.: Rough sets. International Journal of Computer and Information Science 11, 341–356 (1982)
Article MATH MathSciNet Google Scholar
Jerzy, W., GrZymala-Busse, J.W., Ziarko, W.: Data mining and rough set theory. Communications of the ACM 43(4), 108–109 (2000)
Article Google Scholar
Pawlak, Z.: Rough set approach to multi-attribute decision analysis. European Journal of Operational Research 72(3), 443–459 (1994)
Article MATH Google Scholar
Pawlak, Z., Wang, S.K.M., Ziarko, W.: Rough sets: probabilistic versus deterministic approach. Int. J. Man-Machine Studies 29(1), 81–95 (1988)
Article MATH Google Scholar
Wei, J.M.: Rough Set Based Approach to Selection of Node. International Journal of Computational Cognition 1(2), 25–40 (2003)
Google Scholar
Mingers, J.: An empirical comparison of pruning methods for decision-tree induction. Machine Learning 4(2), 319–342 (1989)
Article Google Scholar
Quinlan, J.R., Rivest, R.: Inferring decision trees using the minimum description length principle. Information and Computation 80(3), 227–248 (1989)
Article MATH MathSciNet Google Scholar
Zbigniew, W.R., Zemankova, M.: Imprecise Concept Learning within a Growing Language. In: Proc. the sixth inter. workshop on Machine learning, Ithaca, New York, United States, pp. 314–319 (1989)
Google Scholar
Berzal, F., Cubero, J.C., Cuenca, F., Martín-Bautista, M.J.: On the quest for easy-to-understand splitting rules. Data & Knowledge Engineering 44(1), 31–48 (2003)
Article MATH Google Scholar
Ziarko, W.: Variable precision rough set model. Journal of Computer and System Sciences 46(1), 39–59 (1993)
Article MATH MathSciNet Google Scholar
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Google Scholar
Gehrke, J., Ramakrishnan, R., Ganti, V.: RainForest - A Framework for Fast Decision Tree Construction of Large Datasets. Data Mining and Knowledge Discovery 4(2/3), 127–162 (2000)
Article Google Scholar
Rastogi, R., Shim, K.: PUBLIC: A Decision Tree Classifier that integrates building and pruning. Data Mining and Knowledge Discovery 4(4), 315–344 (2000)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

School of Mathematics & Statistics, Northeast Normal University, Jilin, 130024, China
Shuqin Wang
Institute of Computational Intelligence, Northeast Normal University, Jilin, 130024, China
Jinmao Wei & Junping You
Open Symbol Computation and Knowledge Engineering Laboratory of State Education, Jilin University, Jilin, 130024, China
Jinmao Wei & Dayou Liu

Authors

Shuqin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jinmao Wei
View author publications
You can also search for this author in PubMed Google Scholar
Junping You
View author publications
You can also search for this author in PubMed Google Scholar
Dayou Liu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology and Electronic Engineering, The University of Queensland, Queensland, Australia
Xue Li
University of Alberta, Canada
Osmar R. Zaïane
Northwest Polytechnical University, China
Zhanhuai Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, S., Wei, J., You, J., Liu, D. (2006). ComEnVprs: A Novel Approach for Inducing Decision Tree Classifiers. In: Li, X., Zaïane, O.R., Li, Z. (eds) Advanced Data Mining and Applications. ADMA 2006. Lecture Notes in Computer Science(), vol 4093. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11811305_13

Download citation

DOI: https://doi.org/10.1007/11811305_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37025-3
Online ISBN: 978-3-540-37026-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics