CTCHAID: Extending the Application of the Consolidation Methodology

Ibarguren, Igor; Pérez, Jesús María; Muguerza, Javier

doi:10.1007/978-3-319-23485-4_56

Igor Ibarguren⁸,
Jesús María Pérez⁸ &
Javier Muguerza⁸

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9273))

Included in the following conference series:

Portuguese Conference on Artificial Intelligence

4057 Accesses

Abstract

The consolidation process, originally applied to the C4.5 tree induction algorithm, improved its discriminating capacity and stability. Consolidation creates multiple samples and builds a simple (non-multiple) classifier by applying the ensemble process during the model construction times. A benefit of consolidation is that the understandability of the base classifier is kept. The work presented aims to show the consolidation process can improve algorithms other than C4.5 by applying the consolidation process to another algorithm, CHAID*. The consolidation of CHAID*, CTCHAID, required solving the handicap of consolidating the value groupings proposed by each CHAID* tree for discrete attributes. The experimentation is divided in three classification contexts for a total of 96 datasets. Results show that consolidated algorithms perform robustly, ranking competitively in all contexts, never falling into lower positions unlike most of the other 23 rule inducting algorithms considered in the study. When performing a global comparison consolidated algorithms rank first.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Managing Monotonicity in Classification by a Pruned AdaBoost

Better trees: an empirical study on hyperparameter tuning of classification decision tree induction algorithms

Article 31 January 2024

SPAARC: A Fast Decision Tree Algorithm

References

Abbasian, H., Drummond, C., Japkowicz, N., Matwin, S.: Inner ensembles: using ensemble methods inside the learning algorithm. In: Blockeel, H., Kersting, K., Nijssen, S., Železný, F. (eds.) ECML PKDD 2013, Part III. LNCS, vol. 8190, pp. 33–48. Springer, Heidelberg (2013)
Chapter Google Scholar
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research 16(1), 321–357 (2002)
MATH Google Scholar
Fernández, A., Garcia, S., Luengo, J., Bernadó-Mansilla, E., Herrera, F.: Genetics-based machine learning for rule induction: State of the art, taxonomy, and comparative study. IEEE Transactions on Evolutionary Computation 14(6), 913–941 (2010)
Article Google Scholar
García, S., Fernández, A., Luengo, J., Herrera, F.: Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power. Information Sciences 180(10), 2044–2064 (2010)
Article Google Scholar
Ibarguren, I., Lasarguren, A., Pérez, J.M., Muguerza, J., Arbelaitz, O., Gurrutxaga, I.: BFPART: Best-first PART. Submitted to Information Sciences
Google Scholar
Ibarguren, I., Pérez, J.M., Muguerza, J., Gurrutxaga, I., Arbelaitz, O.: Coverage-based resampling: Building robust consolidated decision trees. Knowledge-Based Systems 79, 51–67 (2015)
Article Google Scholar
Kass, G.V.: Significance testing in automatic interaction detection (a.i.d.). Journal of the Royal Statistical Society. Series C (Applied Statistics) 24(2), 178–189 (1975)
Google Scholar
Morgan, J.A., Sonquist, J.N.: Problems in the analysis of survey data, and a proposal. J. Amer. Statistics Ass. 58, 415–434 (1963)
Article MATH Google Scholar
Pérez, J.M., Muguerza, J., Arbelaitz, O., Gurrutxaga, I., Martín, J.I.: Combining multiple class distribution modified subsamples in a single tree. Pattern Recognition Letters 28(4), 414–422 (2007)
Article Google Scholar
Quinlan, J.R.: C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc., San Francisco (1993)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Architecture and Technology, University of the Basque Country UPV/EHU, Manuel Lardizabal 1, 20018, Donostia, Spain
Igor Ibarguren, Jesús María Pérez & Javier Muguerza

Authors

Igor Ibarguren
View author publications
You can also search for this author in PubMed Google Scholar
Jesús María Pérez
View author publications
You can also search for this author in PubMed Google Scholar
Javier Muguerza
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Igor Ibarguren .

Editor information

Editors and Affiliations

ISEC - Coimbra Institute of Engineering, Polytechnic Institute of Coimbra, Coimbra, Portugal
Francisco Pereira
CIUSC, Department of Informatics Engineering, University of Coimbra, Coimbra, Portugal
Penousal Machado
CIUSC, Department of Informatics Engineering, University of Coimbra, Coimbra, Portugal
Ernesto Costa
CIUSC, Department of Informatics Engineering, University of Coimbra, Coimbra, Portugal
Amílcar Cardoso

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ibarguren, I., Pérez, J.M., Muguerza, J. (2015). CTCHAID: Extending the Application of the Consolidation Methodology. In: Pereira, F., Machado, P., Costa, E., Cardoso, A. (eds) Progress in Artificial Intelligence. EPIA 2015. Lecture Notes in Computer Science(), vol 9273. Springer, Cham. https://doi.org/10.1007/978-3-319-23485-4_56

Download citation

DOI: https://doi.org/10.1007/978-3-319-23485-4_56
Published: 25 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23484-7
Online ISBN: 978-3-319-23485-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics