Improving the Predictive Power of AdaBoost: A Case Study in Classifying Borrowers

Boonyanunta, Natthaphan; Zeephongsekul, Panlop

doi:10.1007/3-540-45034-3_68

Natthaphan Boonyanunta^3,4 &
Panlop Zeephongsekul³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2718))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

3719 Accesses

Abstract

Boosting is one of the recent major developments in classification methods. The technique works by creating different versions of a classifier using an adaptive resampling procedure and then combining these classifiers using weighted voting. In this paper, several modifications of the original version of boosting, the AdaBoost algorithm introduced by Y. Freund and R.E. Schapire in 1996, will be explained. These will be shown to substantially improve the predictive power of the original version. In the first modification, weighted error estimation in AdaBoost is replaced by unweighted error estimation and this is designed to reduce the impact of observations that possess large weight. In the second modification, only a selection of base classifiers, i.e. those that contribute significantly to predictive power of the boosting model, will be included in the final model. In addition to these two modifications, we will also utilise different classification techniques as base classifiers in order to product a final boosting model. Applying these proposed modifications to three data sets from the banking industry provides results which indicate a significant and substantial improvement in predictive power over the original AdaBoost algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A systematic approach for learning imbalanced data: enhancing zero-inflated models through boosting

Article Open access 08 July 2024

Boosting

Generalized Estimating Equations Boosting (GEEB) machine for correlated data

Article Open access 22 January 2024

References

Schapire, R. E.: The strength of weak learnability. Machine Learning 5 (1990) 197–227
Google Scholar
Freund, Y., Schapire, R.: A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. System Sci. 55 (1997) 119–139
Article MATH MathSciNet Google Scholar
Freund, Y., Schapire, R.: Experiment with a new boosting algorithm. In Machine Learning: Proceedings of the Thirteenth International Conferences (L. Saitta, ed.). Morgan Kaufmann, San Francisco (1996) pp 148–156
Google Scholar
Friedman, J., Hastie, T., Tibshirani, R.: Additive Logistic Regression: A Statistical View of Boosting. The Annals of Statistics 28(2) (2000) 337–407
Article MATH MathSciNet Google Scholar
Breiman, L.: Arcing Classifiers. The Annals of Statistics 26(3) (1998) 801–849
Article MATH MathSciNet Google Scholar
Krogh, A., Vedelsby, J.: Neural network ensembles, cross validation, and active learning. Tesauro, G., Touretzky, D., Leen, T. (Eds.), Advances in Neural Information Processing Systems, Vol. 7. MIT Press, Boston, MA (1995)
Google Scholar
Webb, G. I.: MultiBoosting: A Technique for Combining Boosting and Wagging. Machine Learning. 40 (2000) 159–196
Article MathSciNet Google Scholar
Breiman, L., Friedman, J. H., Olshen, R. A., Stone, C. J.: Classification and Regression Trees. Wadsworth Internation Group, Belmont, CA (1984)
MATH Google Scholar
Thomas, L. C.: A survey of credit and behavioural scoring: forecasting finance risk of lending to consumers. International Journal of Forecasting 16 (2000) 149–172
Article MATH Google Scholar
Schapire, R., Freund, Y., Bartlett, P., Lee, W. S.: Boosting the margin: A new explanation for effectiveness of voting methods. The Annals of Statistics. 26 (1998) 1651–1686
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics and Statistics, RMIT University, Melbourne, Australia
Natthaphan Boonyanunta & Panlop Zeephongsekul
Experian Asia Pacific, Melbourne, Australia
Natthaphan Boonyanunta

Authors

Natthaphan Boonyanunta
View author publications
You can also search for this author in PubMed Google Scholar
Panlop Zeephongsekul
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Science, Loughborough University, Loughborough, LE11 3TU, England
Paul W. H. Chung & Chris Hinde &
Dept. of Computer Science, Southwest Texas State University, 601 University Drive, San Marcos, TX, 78666, USA
Moonis Ali

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Boonyanunta, N., Zeephongsekul, P. (2003). Improving the Predictive Power of AdaBoost: A Case Study in Classifying Borrowers. In: Chung, P.W.H., Hinde, C., Ali, M. (eds) Developments in Applied Artificial Intelligence. IEA/AIE 2003. Lecture Notes in Computer Science(), vol 2718. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45034-3_68

Download citation

DOI: https://doi.org/10.1007/3-540-45034-3_68
Published: 24 June 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40455-2
Online ISBN: 978-3-540-45034-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics