Regression Trees

Torgo, Luís

doi:10.1007/978-0-387-30164-8_711

Luís Torgo

722 Accesses
4 Citations

Synonyms

Decision trees for regression; Piecewise constant models; Tree-based regression

Definition

Regression trees are supervised learning methods that address multiple regression problems. They provide a tree-based approximation \(\hat{f}\), of an unknown regression function \(Y \,=\,f(\mathbf{x}) + \epsilon\) with Y ∈ ℜ and ε ≈ N(0, σ ²), based on a given sample of data \(D = \{\langle {x}_{i,1},\ldots ,{x}_{i,p},{y}_{i}\rangle {\}}_{i=1}^{n}\). The obtained models consist of a hierarchy of logical tests on the values of any of the p predictor variables. The terminal nodes of these trees, known as the leaves, contain the numerical predictions of the model for the target variable Y .

Motivation and Background

Work on regression trees goes back to the AID system by Morgan and Sonquist Morgan and Sonquist (1963). Nonetheless, the seminal work is the book Classification and Regression Trees by Breiman and colleagues (Breiman, Friedman, Olshen, & Stone, 1984). This book has established...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Recommended Reading

Breiman, L., Friedman, J., Olshen, R., & Stone, C. (1984). Classification and regression trees. Statistics/probability series. Wadsworth & Brooks/Cole Advanced Books & Software.
Google Scholar
Breiman, L., & Meisel, W. S. (1976). General estimates of the intrinsic variability of data in nonlinear regression models. Journal of the American Statistical Association, 71, 301–307.
Article MATH Google Scholar
Buja, A., & Lee, Y.-S. (2001). Data mining criteria for tree-based regression and classification. In Proceedings of ACM SIGKDD international conference on knowledge discovery and data mining (pp. 27–36). San Francisco, California, USA.
Google Scholar
Friedman, J. H. (1979). A tree-structured approach to nonparametric multiple regression. In T. Gasser & M. Rosenblatt (Eds.), Smoothing techniques for curve estimation. Lecture notes in mathematics (Vol. 757, pp. 5–22). Berlin: Springer.
Google Scholar
Gama, J. (2004). Functional trees. Machine Learning, 55(3), 219–250.
Article MATH Google Scholar
Li, K. C., Lue, H., & Chen, C. (2000). Interactive tree-structured regression via principal Hessians direction. Journal of the American Statistical Association, 95, 547–560.
Article MATH MathSciNet Google Scholar
Loh, W. (2002). Regression trees with unbiased variable selection and interaction detection. Statistica Sinica, 12, 361–386.
MATH MathSciNet Google Scholar
Lubinsky, D. (1995). Tree structured interpretable regression. In Proceedings of the workshop on AI & statistics.
Google Scholar
Malerba, D., Esposito, F., Ceci, M., & Appice, A. (2004). Top-down induction of model trees with regression and splitting nodes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(5), 612–625.
Article Google Scholar
Morgan, J. N., & Sonquist, J. A. (1963). Problems in the analysis of survey data, and a proposal. Journal of American Statistical Association, 58(302), 415–434.
Article MATH Google Scholar
Robnik-Sikonja, M., & Kononenko, I. (1996). Context-sensitive attribute estimation in regression. In Proceedings of the ICML-96 workshop on learning in context-sensitive domains. Brighton, UK.
Google Scholar
Robnik-Sikonja, M., & Kononenko, I. (1998). Pruning regression trees with MDL. In Proceedings of ECAI-98. Brighton, UK.
Google Scholar
Torgo, L. (1998). Error estimates for pruning regression trees. In C. Nedellec & C. Rouveirol (Eds.), Proceedings of the tenth European conference on machine learning. LNAI (Vol. 1398). London, UK: Springer-Verlag.
Google Scholar
Torgo, L. (1999). Inductive learning of tree-based regression models. PhD thesis, Department of Computer Science, Faculty of Sciences, University of Porto.
Google Scholar
Torgo, L., & Ribeiro, R. (2003). Predicting outliers. In N. Lavrac, D. Gamberger, L. Todorovski, & H. Blockeel (Eds.), Proceedings of principles of data mining and knowledge discovery (PKDD’03). LNAI (Vol. 2838, pp. 447–458). Berlin/Heidelberg: Springer-Verlag.
Google Scholar

Download references

Author information

Authors and Affiliations

Authors

Luís Torgo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science and Engineering, University of New South Wales, Sydney, Australia, 2052
Claude Sammut
Faculty of Information Technology, Clayton School of Information Technology, Monash University, P.O. Box 63, Victoria, Australia, 3800
Geoffrey I. Webb

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Torgo, L. (2011). Regression Trees. In: Sammut, C., Webb, G.I. (eds) Encyclopedia of Machine Learning. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-30164-8_711

Download citation

DOI: https://doi.org/10.1007/978-0-387-30164-8_711
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-30768-8
Online ISBN: 978-0-387-30164-8
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics