Abstract
In this work, an energy-based clustering method is used to prune heterogeneous ensembles. Specifically, the classifiers are grouped according to their predictions in a set of validation instances that are independent from the ones used to build the ensemble. In the empirical evaluation carried out, the cluster that minimizes the error in the validations set, besides reducing computational costs for storage and the prediction times, is almost as accurate as the complete ensemble. Furthermore, it outperforms subensembles that summarize the complete ensemble by including representatives from each of the identified clusters.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bache, K., Lichman, M.: UCI Machine Learning Repository (2017). http://archive.ics.uci.edu/ml
Bakker, B., Heskes, T.: Clustering ensembles of neural network models. Neural Netw. 16, 261–269 (2003)
Bezdek, J., Elrich, R., Full, W.: The fuzzy C-means clustering algorithm. Comput. Geosci. 10, 191–203 (1984)
Breiman, L.: Bagging predictors. Mach. Learn. 24, 123–140 (1996)
Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001)
Buhmann, J., Kühnel, H.: Vector quantization with complexity costs. IEEE Trans. Inf. Theory 39, 1133–1145 (1993)
Dietterich, T.G.: Ensemble methods in machine learning. In: Proceedings of Multiple Classifier Systems: First International Workshop, MCs 2000, Cagliari, Italy, 21–23 June 2000, pp. 1–15 (2000)
Lobato, D.H., Muñoz, G.M., Suárez, A.: On the independence of the individual predictions in parallel randomized Ensembles. In: 20th European Symposium on Artificial Neural Networks, Bruges (2012)
MacQueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297 (1967)
Rose, K.: Statistical mechanics of phase transition in clustering. Phys. Rev. Lett. 65, 945–948 (1990)
Rose, K.: Deterministic annealing for clustering, compression, classification, regression and related optimization problems. In: Proceedings for the IEEE, pp. 2210–2239 (1998)
Suárez, A., Hernández-Lobato, D., Martínez-Muñoz, G.: An analysis of ensemble pruning techniques based on ordered aggregation. IEEE Trans. Pattern Anal. Mach. Intell. 31, 245–259 (2009)
Acknowledgements
The authors acknowledge financial support from the Spanish Ministry of Economy, Industry and Competitiveness, project TIN2016-76406-P.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Cela, J., Suárez, A. (2018). Energy-Based Clustering for Pruning Heterogeneous Ensembles. In: Kůrková, V., Manolopoulos, Y., Hammer, B., Iliadis, L., Maglogiannis, I. (eds) Artificial Neural Networks and Machine Learning – ICANN 2018. ICANN 2018. Lecture Notes in Computer Science(), vol 11139. Springer, Cham. https://doi.org/10.1007/978-3-030-01418-6_34
Download citation
DOI: https://doi.org/10.1007/978-3-030-01418-6_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01417-9
Online ISBN: 978-3-030-01418-6
eBook Packages: Computer ScienceComputer Science (R0)