A recursive partitioning tool for interval prediction

Krzanowski, Wojtek J.; Hand, David J.

doi:10.1007/s11634-007-0015-y

A recursive partitioning tool for interval prediction

Regular Article
Published: 16 November 2007

Volume 1, pages 241–254, (2007)
Cite this article

Advances in Data Analysis and Classification Aims and scope Submit manuscript

Wojtek J. Krzanowski¹ &
David J. Hand^2,3

99 Accesses
2 Citations
Explore all metrics

Abstract

The traditional approach to regression trees involves partitioning the space of predictor variables into subsets that optimise a function of the response variable(s), and then predicting future response values by a single-valued summary statistic in each subset. Our belief is that a prediction interval is of greater practical use than a predictive value, and that the criterion for the partitioning should be based on such intervals rather than on single values. We define four potential criteria in the case of a single response variable, discuss computational aspects of producing the partition, evaluate the criteria on both real and simulated data, and draw some tentative conclusions about their relative efficacies. The methodology is extended to the case of multiple response variables, and its viability is demonstrated by application to some further real data. The possibility of fitting distributions to within-subsets data is discussed, and some potential extensions are briefly outlined.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Breiman L (2001). Random forests. Mach Learn 45: 5–32
Article MATH Google Scholar
Breiman L, Friedman JH, Olshen RA and Stone CJ (1984). Classification and regression trees. Wadsworth, Belmont
MATH Google Scholar
Bremner AP and Ross H (2002). Modified classification and regression tree splitting criteria for data with interactions. Aust N Z J Stat 44: 169–176
Article MathSciNet Google Scholar
Cariou V (2006). Extension of multivariate regression trees to interval data. Application to electricity load profiling. Computat Stat 21: 325–341
Article MATH MathSciNet Google Scholar
Johnson NL and Kotz S (1970). Distributions in statistics: continuous univariate distributions 1. Wiley, New York
MATH Google Scholar
Kendall MG (1961). A course in the geometry of n dimensions. Charles Griffin& Company, London
MATH Google Scholar
Loh W-Y and Vanichesetakul N (1988). Tree structured classification via generalized discriminant analysis. J Am Stat Assoc 83: 715–725
Article MATH Google Scholar
Morimoto Y, Ishii H and Morishita S (2001). Efficient construction of regression trees with range and region splitting. Mach Learn 45: 235–259
Article MATH Google Scholar
Osei-Bryson K-M (2006). Splitting methods for decision tree induction: an exploration of the relative performance of two entropy-based families. Inform Syst Front 8: 195–209
Article MATH Google Scholar
Segal MR (1992). Tree-structured methods for longitudinal data. J Am Stat Assoc 87: 407–418
Article Google Scholar
Shi Y-S (1999). Families of splitting criteria for classification trees. Stat Comput 9: 309–315
Article Google Scholar
Taylor PC and Silverman BW (1993). Block diagrams and splitting criteria for classification trees. Stat Comput 3: 147–161
Article Google Scholar
Zhang HP (1998). Classification trees for multiple binary responses. J Am Stat Assoc 93: 180–193
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

School of Engineering, Computer Science and Mathematics, University of Exeter, North Park Road, Exeter, EX4 4QE, UK
Wojtek J. Krzanowski
Department of Mathematics, Imperial College of Science, Technology and Medicine, Huxley Building, 180 Queen’s Gate, London, SW7 2AZ, UK
David J. Hand
Institute for Mathematical Sciences, Imperial College of Science, Technology and Medicine, 53 Princes Gate, London, SW7 2AZ, UK
David J. Hand

Authors

Wojtek J. Krzanowski
View author publications
You can also search for this author inPubMed Google Scholar
David J. Hand
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Wojtek J. Krzanowski.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Krzanowski, W.J., Hand, D.J. A recursive partitioning tool for interval prediction. ADAC 1, 241–254 (2007). https://doi.org/10.1007/s11634-007-0015-y

Download citation

Received: 13 February 2007
Revised: 11 October 2007
Accepted: 21 October 2007
Published: 16 November 2007
Issue Date: December 2007
DOI: https://doi.org/10.1007/s11634-007-0015-y

Keywords

Mathematics Subject Classification (2000)

JEL Classification

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A recursive partitioning tool for interval prediction

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Tree-structured scale effects in binary and ordinal regression

Modeling Threshold Interaction Effects Through the Logistic Classification Trunk

BEST: a decision tree algorithm that handles missing values

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification (2000)

JEL Classification

Subscribe and save

Buy Now

A recursive partitioning tool for interval prediction

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Tree-structured scale effects in binary and ordinal regression

Modeling Threshold Interaction Effects Through the Logistic Classification Trunk

BEST: a decision tree algorithm that handles missing values

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2000)

JEL Classification

Subscribe and save

Buy Now