Decision Trees Using the Minimum Entropy-of-Error Principle

de Sá, J. P. Marques; Gama, João; Sebastião, Raquel; Alexandre, Luís A.

doi:10.1007/978-3-642-03767-2_97

J. P. Marques de Sá¹⁸,
João Gama¹⁹,
Raquel Sebastião²⁰ &
…
Luís A. Alexandre²¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5702))

Included in the following conference series:

International Conference on Computer Analysis of Images and Patterns

1782 Accesses
2 Citations

Abstract

Binary decision trees based on univariate splits have traditionally employed so-called impurity functions as a means of searching for the best node splits. Such functions use estimates of the class distributions. In the present paper we introduce a new concept to binary tree design: instead of working with the class distributions of the data we work directly with the distribution of the errors originated by the node splits. Concretely, we search for the best splits using a minimum entropy-of-error (MEE) strategy. This strategy has recently been applied in other areas (e.g. regression, clustering, blind source separation, neural network training) with success. We show that MEE trees are capable of producing good results with often simpler trees, have interesting generalization properties and in the many experiments we have performed they could be used without pruning.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Enhancing techniques for learning decision trees from imbalanced data

Article 02 March 2019

Ensembles of Nested Dichotomies with Multiple Subset Evaluation

Reduction Stumps for Multi-class Classification

References

Asuncion, A., Newman, D.J.: UCI Machine Learning Repository. Univ. of California, SICS, Irvine, CA (2007), http://www.ics.uci.edu/~mlearn/MLRepository.html
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Chapman & Hall/CRC, Boca Raton (1993)
Google Scholar
Devroye, L., Giörfi, L., Lugosi, G.: A Probabilistic Theory of Pattern Recognition. Springer, Heidelberg (1996)
MATH Google Scholar
Forina, M., Armanino, C.: Eigenvector Projection and Simplified Nonlinear Mapping of Fatty Acid Content of Italian Olive Oils. Ann. Chim. 72, 127–155 (1981)
Google Scholar
Loh, W.-Y., Shih, Y.-S.: Split Selection Methods for Classification Trees. Statistica Sinica 7, 815–840 (1997)
MATH MathSciNet Google Scholar
Marques de Sá, J.P.: Applied Statistics Using SPSS, STATISTICA, MATLAB and R, 2nd edn. Springer, Heidelberg (2007)
MATH Google Scholar
Quinlan, J.R.: Induction of Decision Trees. Machine Learning 1, 81–106 (1986)
Google Scholar
Quinlan, J.R.: C4.5 Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Google Scholar
Silva, L.M., Felgueiras, C.S., Alexandre, L.A., Marques de Sá, J.: Error Entropy in Classification Problems: A Univariate Data Analysis. Neural Comp. 18, 2036–2061 (2006)
Article MATH Google Scholar
Silva, L.M., Embrechts, M.J., Santos, J.M., de Sá, J.M.: The influence of the risk functional in data classification with mLPs. In: Kůrková, V., Neruda, R., Koutník, J. (eds.) ICANN 2008, Part I. LNCS, vol. 5163, pp. 185–194. Springer, Heidelberg (2008)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

INEB-Instituto de Engenharia Biomédica, Porto, Portugal
J. P. Marques de Sá
LIAAD - INESC Porto, L.A. and Faculty of Economics, Porto, Portugal
João Gama
LIAAD - INESC Porto, L.A. and Faculty of Science, Porto, Portugal
Raquel Sebastião
Informatics Dept., Univ. Beira Interior, Networks and Multim. Group, Covilhã, Portugal
Luís A. Alexandre

Authors

J. P. Marques de Sá
View author publications
You can also search for this author in PubMed Google Scholar
João Gama
View author publications
You can also search for this author in PubMed Google Scholar
Raquel Sebastião
View author publications
You can also search for this author in PubMed Google Scholar
Luís A. Alexandre
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mathematics and Computer Science, University of Münster, Einsteinstrasse 62, 48149, Münster, Germany
Xiaoyi Jiang
Institute of Mathematics and Computing Science, University of Groningen, Nijenborgh 9, 9747, Groningen, AG, The Netherlands
Nicolai Petkov

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

de Sá, J.P.M., Gama, J., Sebastião, R., Alexandre, L.A. (2009). Decision Trees Using the Minimum Entropy-of-Error Principle. In: Jiang, X., Petkov, N. (eds) Computer Analysis of Images and Patterns. CAIP 2009. Lecture Notes in Computer Science, vol 5702. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03767-2_97

Download citation

DOI: https://doi.org/10.1007/978-3-642-03767-2_97
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03766-5
Online ISBN: 978-3-642-03767-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics