Decision tree pruning as a search in the state space

Esposito, Floriana; Malerba, Donate; Semeraro, Giovanni

doi:10.1007/3-540-56602-3_135

Floriana Esposito¹,
Donate Malerba¹^nAff2 &
Giovanni Semeraro¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 667))

Included in the following conference series:

European Conference on Machine Learning

1001 Accesses

Abstract

This paper presents a study of one particular problem of decision tree induction, namely (post-)pruning, with the aim of finding a common framework for the plethora of pruning methods appeared in literature. Given a tree T_max to prune, a state space is defined as the set of all subtrees of T_max to which only one operator, called any-depth branch pruning operator, can be applied in several ways in order to move from one state to another. By introducing an evaluation function f defined on the set of subtrees, the problem of tree pruning can be cast as an optimization problem, and it is also possible to classify each post-pruning method according to both its search strategy and the kind of information exploited by f. Indeed, while some methods use only the training set in order to evaluate the accuracy of a decision tree, other methods exploit an additional pruning set that allows them to get less biased estimates of the predictive accuracy of apruned tree. The introduction of the state space shows that very simple search strategies are used by the postpruning methods considered. Finally, some empirical results allow theoretical observations on strengths and weaknesses of pruning methods to be better understood.

Download to read the full chapter text

Chapter PDF

Decision Trees

Recent advances in decision trees: an updated survey

Article 10 October 2022

LazyBum: Decision Tree Learning Using Lazy Propositionalization

References

L. Breiman, J. Friedman, R. Olshen, C. Stone: Classification and regression trees. Belmont, CA: Wadsworth International 1984
Google Scholar
J. R. Quinlan: Induction of decision trees. Machine Learning 1, 81–106 (1986)
Google Scholar
J. R. Quinlan: Simplifying decision trees. International Journal of Man-Machine Studies 27, 221–234 (1987) (also appeared in: B. R. Gaines, J. H. Boose (eds.): Knowledge Acquisition for Knowledge-Based Systems. Academic Press 1988)
Google Scholar
M. Gams, N. Lavrac: Review of five empirical learning systems within a proposed schemata. In: I. Bratko, N. Lavrac (eds.): Progress in Machine Learning. Wilmslow: Sigma Press 1987
Google Scholar
J. Mingers: An empirical comparison of selection measures for decision-tree induction. Machine Learning 3, 319–342 (1989)
Google Scholar
J. Mingers: An empirical comparison of pruning methods for decision tree induction. Machine Learning 4, 227–243 (1989)
Google Scholar
B. Cestnik, I. Kononenko, I. Bratko: ASSISTANT 86: Aknowledge-elicitation tool for sophisticated users. In: I. Bratko, N. Lavrac (eds.): Progress in Machine Learning. Wilmslow: Sigma Press 1987
Google Scholar
J. R. Quinlan: Determinate literals in inductive logic programming. Proceedings of the IJCAI 91. San Mateo, CA: Morgan Kaufmann 1991, pp. 746–750
Google Scholar
T. Niblett: Constructing decision trees in noisy domains. In: I. Bratko, N. Lavrac (eds.): Progress in Machine Learning. Wilmslow: Sigma Press 1987
Google Scholar
A. V. Aho, J. E. Hopcroft, J. D. Ullman: The design and analysis of computer algorithms. Reading, MA: Addison Wesley 1974
Google Scholar
F. Esposito, D. Malerba, G. Semeraro: Pruning methods in decision tree induction: a unifying view. Technical report (1992)
Google Scholar
T. Niblett, I. Bratko: Learning decision rules in noisy domains. Proceedings of Expert Systems 86. Cambridge: University Press 1986
Google Scholar
B. Cestnik, I. Bratko: On estimating probabilities in tree pruning. Proceedings of the EWSL-91. Berlin: Springer-Verlag 1991, pp. 138–150
Google Scholar
A. Barr, E. Feigenbaum: The handbook of artificial intelligence, (Vol. 1). Reading, MA: Addison Wesley 1981
Google Scholar
S. B. Gelfand, C. S. Ravishankar, E. J. Delp: An iterative growing and pruning algorithm for classification tree design. IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI-13, 2, 163–174 (1991)
Google Scholar
W. Buntine, T. Niblett: A further comparison of splitting rules for decision-tree induction. Machine Learning 8, 75–85 (1992)
Google Scholar
C. Schaffer: Deconstructing the digit recognition problem. In: Machine Learning: Proceedings of the Ninth International Workshop (ML92). San Mateo, CA: Morgan Kaufmann (1992), pp. 394–399
Google Scholar

Download references

Author information

Donate Malerba
Present address: Department of Information and Computer Science, University of California, 92717, Irvine, CA

Authors and Affiliations

Dipartimento di Informatica, Università degli Studi di Bari, via G. Amendola 173, 70126, Bari, Italy
Floriana Esposito, Donate Malerba & Giovanni Semeraro

Authors

Floriana Esposito
View author publications
You can also search for this author in PubMed Google Scholar
Donate Malerba
View author publications
You can also search for this author in PubMed Google Scholar
Giovanni Semeraro
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Pavel B. Brazdil

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Esposito, F., Malerba, D., Semeraro, G. (1993). Decision tree pruning as a search in the state space. In: Brazdil, P.B. (eds) Machine Learning: ECML-93. ECML 1993. Lecture Notes in Computer Science, vol 667. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-56602-3_135

Download citation

DOI: https://doi.org/10.1007/3-540-56602-3_135
Published: 01 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-56602-1
Online ISBN: 978-3-540-47597-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics