Abstract
Bagging is an ensemble method proposed to improve the predictive performance of learning algorithms, being specially effective when applied to unstable predictors. It is based on the aggregation of a certain number of prediction models, each one generated from a bootstrap sample of the available training set. We introduce an alternative method for bagging classification models, motivated by the reduced bootstrap methodology, where the generated bootstrap samples are forced to have a number of distinct original observations between two values k 1 and k 2. Five choices for k 1 and k 2 are considered, and the five resulting models are empirically studied and compared with bagging on three real data sets, employing classification trees and neural networks as the base learners. This comparison reveals for this reduced bagging technique a trend to diminish the mean and the variance of the error rate.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Breiman, L.: Bagging Predictors. Mach. Learn. 24, 123–140 (1996)
Bühlman, P., Yu, B.: Analyzing Bagging. Ann. Stat. 30(4), 927–961 (2002)
Buja, A., Stuetzle, W.: The effect of bagging on variance, bias, and mean squared error. AT&T Labs-Research (2000) (preprint)
Rao, C.R., Pathak, P.K., Koltchinskii, V.I.: Bootstrap by sequential resampling. J. Statist. Plan. Infer. 64, 257–281 (1997)
Jiménez-Gamero, M.D., Muñoz-García, J., Pino-Mejías, R.: Reduced bootstrap for the median. Stat. Sinica (2004) (in press)
Efron, B.: Bootstrap methods: another look at the jackknife. Ann. Stat. 7, 1–26 (1979)
Muñoz-García, J., Pino-Mejías, R., Muñoz-Pichardo, J.M., Cubiles-de-la-Vega, M.D.: Identification of outlier bootstrap samples. J. Appl. Stat. 24(3), 333–342 (1997)
Hall, P.: Antithetic resampling for the bootstrap. Biometrika, 713–724 (1989)
Johns, M.V.: Importance sampling for bootstrap confidence intervals. J. Am. Stat. Assoc. 83, 709–714 (1988)
Jiménez-Gamero, M.D., Muñoz-García, J., Muñoz-Reyes, A., Pino-Mejías, R.: On Efronś method II with identification of outlier bootstrap samples. Computation. Stat. 13, 301–318 (1998)
Ihaka, R., Gentleman, R.: R: A Language for Data Analysis and Graphics. J. Comput. Graph. Stat. 5, 299–314 (1996)
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth (1984)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer, Heidelberg (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pino-Mejías, R., Cubiles-de-la-Vega, MD., López-Coello, M., Silva-Ramírez, EL., Jiménez-Gamero, MD. (2004). Bagging Classification Models with Reduced Bootstrap. In: Fred, A., Caelli, T.M., Duin, R.P.W., Campilho, A.C., de Ridder, D. (eds) Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2004. Lecture Notes in Computer Science, vol 3138. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27868-9_106
Download citation
DOI: https://doi.org/10.1007/978-3-540-27868-9_106
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22570-6
Online ISBN: 978-3-540-27868-9
eBook Packages: Springer Book Archive