Hyper-rectangular space partitioning trees: A practical approach

De Macq, Isabelle; Simar, Leopold

doi:10.1007/BF02736126

Hyper-rectangular space partitioning trees: A practical approach

Published: 01 March 2005

Volume 20, pages 119–135, (2005)
Cite this article

Computational Statistics Aims and scope Submit manuscript

Isabelle De Macq¹ &
Leopold Simar¹

145 Accesses
Explore all metrics

Summary

The process of computation of classification trees can be characterized as involving three basic choices: the type of splits considered in the growing process, the criterion to be optimized at each step of the process, and the way to get right-sized trees. Most implementations are ordinary binary trees, i.e. trees whose successive cuts are made by hyper-planes perpendicular to the axes. L. Devroye, L. Györfy and G. Lugosi (1996) define and consider the remarkable theoretical properties of a binary tree classifier whose prominent feature is the particular type of splits used in its construction: at a given node, partitioning is made by hyper-rectangles rather than hyper-planes. We propose an approximation of the solution for the complex optimization problem involved to allow insights on the practical advantages of those trees. Then we compare the performance of our algorithm with some leading algorithms for ordinary binary trees, namely CART and C4.5 as implemented in the Splus “tree” procedure and in SAS’s Enterprise Miner respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Supervised Classification Box Algorithm Based on Graph Partitioning

Supervised Classification Using Feature Space Partitioning

T3C: improving a decision tree classification algorithm’s interval splits on continuous attributes

Article 27 April 2016

References

Breiman, L., Friedman, J., Olshen, R. & Stone, C. (1984),Classification and Regression Trees., Wadsworth International, Belmont, CA.
MATH Google Scholar
Buntine, W. & Caruana, R. (1992),Introduction to IND Version 2.1 and Recursive Partitioning, NASA Ames Research Center, Moffet Field, CA.
Google Scholar
Clark, L. & Pregibon, D. (1993), Tree-based models,in J. Chambers & T. Hastie, eds,‘Statistical Models in S’ Chapman & Hall, New York, NY, pp. 377–419.
Google Scholar
Devroye, L., Gyorfi, L. & Lugosi, G. (1996),A probabilistic theory of pattern recognition., Springer-Ver lag, New-York.
Book Google Scholar
Esposito, F., Malerba, D. & Semeraro, G. (1997),‘A comparative analysis of methods for pruning decision trees.’IEEE Transactions on pattern analysis and machine intelligence 19(5), 476–491.
Article Google Scholar
Friedman, J. & Fisher, N. (1999),‘Bump hunting in high-dimensional data’Statistics and Computing 9(2), 123–143.
Article Google Scholar
Michie, D., Spiegelhalter, D. J. & Taylor, C. C. (1994),Machine Learning, neural and statistical classification., Ellis Horwood.
Muller, W. & Wysotzki, F. (1997), The decision tree algorithm CAL5 based on a statistical approach to its splitting algorithm,in‘Machine Learning and Statistics: The Interface’ John Wiley & Sons, New York, NY, pp. 45–65.
Google Scholar
Murphy, P. M. & Aha, D. W. (1996),UCI Repository of machine learning databases., Department of Information and Computer Science, University of California, Irvine, CA.
Google Scholar
Quinlan, J. R. (1993),C4.5: Programs for Machine Learning, San Mateo, CA: Morgan Kaufmann.
Google Scholar
Shih, Y.-S., Lim, T.-S. & Loh, W.-Y. (2000),‘A comparison of prediction accuracy, complexity and training time of thirty-three old and new classification algorithms’Machine Learning 40, 203–228.
Article Google Scholar
Venables, W. & Ripley, B. (1994),Modern Applied Statistics with S-Plus, New-York, NY: Springer-Verlag.
Book Google Scholar

Download references

Author information

Authors and Affiliations

Institut de Statistique, Université Catholique de Louvain, B-1348 Louvain-la-Neuve, Belgium
Isabelle De Macq & Leopold Simar

Authors

Isabelle De Macq
View author publications
You can also search for this author inPubMed Google Scholar
Leopold Simar
View author publications
You can also search for this author inPubMed Google Scholar

Additional information

Research support from “Projet d’Actions de Recherche Concertées” (No. 98/03-217) and from the “Interuniversity Attraction Pole“, Phase V (No. P5/24) from the Belgian Government are also acknowledged.

Rights and permissions

Reprints and permissions

About this article

Cite this article

De Macq, I., Simar, L. Hyper-rectangular space partitioning trees: A practical approach. Computational Statistics 20, 119–135 (2005). https://doi.org/10.1007/BF02736126

Download citation

Published: 01 March 2005
Issue Date: March 2005
DOI: https://doi.org/10.1007/BF02736126

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Hyper-rectangular space partitioning trees: A practical approach

Summary

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Supervised Classification Box Algorithm Based on Graph Partitioning

Supervised Classification Using Feature Space Partitioning

T3C: improving a decision tree classification algorithm’s interval splits on continuous attributes

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now