Skip to main content

Energy-Based Clustering for Pruning Heterogeneous Ensembles

  • Conference paper
  • First Online:
Artificial Neural Networks and Machine Learning – ICANN 2018 (ICANN 2018)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11139))

Included in the following conference series:

Abstract

In this work, an energy-based clustering method is used to prune heterogeneous ensembles. Specifically, the classifiers are grouped according to their predictions in a set of validation instances that are independent from the ones used to build the ensemble. In the empirical evaluation carried out, the cluster that minimizes the error in the validations set, besides reducing computational costs for storage and the prediction times, is almost as accurate as the complete ensemble. Furthermore, it outperforms subensembles that summarize the complete ensemble by including representatives from each of the identified clusters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Bache, K., Lichman, M.: UCI Machine Learning Repository (2017). http://archive.ics.uci.edu/ml

  • Bakker, B., Heskes, T.: Clustering ensembles of neural network models. Neural Netw. 16, 261–269 (2003)

    Article  Google Scholar 

  • Bezdek, J., Elrich, R., Full, W.: The fuzzy C-means clustering algorithm. Comput. Geosci. 10, 191–203 (1984)

    Article  Google Scholar 

  • Breiman, L.: Bagging predictors. Mach. Learn. 24, 123–140 (1996)

    MathSciNet  MATH  Google Scholar 

  • Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001)

    Article  Google Scholar 

  • Buhmann, J., Kühnel, H.: Vector quantization with complexity costs. IEEE Trans. Inf. Theory 39, 1133–1145 (1993)

    Article  Google Scholar 

  • Dietterich, T.G.: Ensemble methods in machine learning. In: Proceedings of Multiple Classifier Systems: First International Workshop, MCs 2000, Cagliari, Italy, 21–23 June 2000, pp. 1–15 (2000)

    Google Scholar 

  • Lobato, D.H., Muñoz, G.M., Suárez, A.: On the independence of the individual predictions in parallel randomized Ensembles. In: 20th European Symposium on Artificial Neural Networks, Bruges (2012)

    Google Scholar 

  • MacQueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297 (1967)

    Google Scholar 

  • Rose, K.: Statistical mechanics of phase transition in clustering. Phys. Rev. Lett. 65, 945–948 (1990)

    Article  Google Scholar 

  • Rose, K.: Deterministic annealing for clustering, compression, classification, regression and related optimization problems. In: Proceedings for the IEEE, pp. 2210–2239 (1998)

    Article  Google Scholar 

  • Suárez, A., Hernández-Lobato, D., Martínez-Muñoz, G.: An analysis of ensemble pruning techniques based on ordered aggregation. IEEE Trans. Pattern Anal. Mach. Intell. 31, 245–259 (2009)

    Article  Google Scholar 

Download references

Acknowledgements

The authors acknowledge financial support from the Spanish Ministry of Economy, Industry and Competitiveness, project TIN2016-76406-P.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Javier Cela .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Cela, J., Suárez, A. (2018). Energy-Based Clustering for Pruning Heterogeneous Ensembles. In: Kůrková, V., Manolopoulos, Y., Hammer, B., Iliadis, L., Maglogiannis, I. (eds) Artificial Neural Networks and Machine Learning – ICANN 2018. ICANN 2018. Lecture Notes in Computer Science(), vol 11139. Springer, Cham. https://doi.org/10.1007/978-3-030-01418-6_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-01418-6_34

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-01417-9

  • Online ISBN: 978-3-030-01418-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics