Skip to main content

CTCHAID: Extending the Application of the Consolidation Methodology

  • Conference paper
  • First Online:
Progress in Artificial Intelligence (EPIA 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9273))

Included in the following conference series:

  • 4057 Accesses

Abstract

The consolidation process, originally applied to the C4.5 tree induction algorithm, improved its discriminating capacity and stability. Consolidation creates multiple samples and builds a simple (non-multiple) classifier by applying the ensemble process during the model construction times. A benefit of consolidation is that the understandability of the base classifier is kept. The work presented aims to show the consolidation process can improve algorithms other than C4.5 by applying the consolidation process to another algorithm, CHAID*. The consolidation of CHAID*, CTCHAID, required solving the handicap of consolidating the value groupings proposed by each CHAID* tree for discrete attributes. The experimentation is divided in three classification contexts for a total of 96 datasets. Results show that consolidated algorithms perform robustly, ranking competitively in all contexts, never falling into lower positions unlike most of the other 23 rule inducting algorithms considered in the study. When performing a global comparison consolidated algorithms rank first.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Abbasian, H., Drummond, C., Japkowicz, N., Matwin, S.: Inner ensembles: using ensemble methods inside the learning algorithm. In: Blockeel, H., Kersting, K., Nijssen, S., Železný, F. (eds.) ECML PKDD 2013, Part III. LNCS, vol. 8190, pp. 33–48. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  2. Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research 16(1), 321–357 (2002)

    MATH  Google Scholar 

  3. Fernández, A., Garcia, S., Luengo, J., Bernadó-Mansilla, E., Herrera, F.: Genetics-based machine learning for rule induction: State of the art, taxonomy, and comparative study. IEEE Transactions on Evolutionary Computation 14(6), 913–941 (2010)

    Article  Google Scholar 

  4. García, S., Fernández, A., Luengo, J., Herrera, F.: Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power. Information Sciences 180(10), 2044–2064 (2010)

    Article  Google Scholar 

  5. Ibarguren, I., Lasarguren, A., Pérez, J.M., Muguerza, J., Arbelaitz, O., Gurrutxaga, I.: BFPART: Best-first PART. Submitted to Information Sciences

    Google Scholar 

  6. Ibarguren, I., Pérez, J.M., Muguerza, J., Gurrutxaga, I., Arbelaitz, O.: Coverage-based resampling: Building robust consolidated decision trees. Knowledge-Based Systems 79, 51–67 (2015)

    Article  Google Scholar 

  7. Kass, G.V.: Significance testing in automatic interaction detection (a.i.d.). Journal of the Royal Statistical Society. Series C (Applied Statistics) 24(2), 178–189 (1975)

    Google Scholar 

  8. Morgan, J.A., Sonquist, J.N.: Problems in the analysis of survey data, and a proposal. J. Amer. Statistics Ass. 58, 415–434 (1963)

    Article  MATH  Google Scholar 

  9. Pérez, J.M., Muguerza, J., Arbelaitz, O., Gurrutxaga, I., Martín, J.I.: Combining multiple class distribution modified subsamples in a single tree. Pattern Recognition Letters 28(4), 414–422 (2007)

    Article  Google Scholar 

  10. Quinlan, J.R.: C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc., San Francisco (1993)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Igor Ibarguren .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Ibarguren, I., Pérez, J.M., Muguerza, J. (2015). CTCHAID: Extending the Application of the Consolidation Methodology. In: Pereira, F., Machado, P., Costa, E., Cardoso, A. (eds) Progress in Artificial Intelligence. EPIA 2015. Lecture Notes in Computer Science(), vol 9273. Springer, Cham. https://doi.org/10.1007/978-3-319-23485-4_56

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-23485-4_56

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-23484-7

  • Online ISBN: 978-3-319-23485-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics