Abstract
Credit scoring is very important for financial institutions. With the advent of machine learning, credit scoring problems can be considered as classification problems. In recent years, credit scoring problems have been attracted to researchers. They explored machine learning and data preprocessing methods for specific datasets. The difficulties of the credit scoring problem reside in the imbalance of datasets and the categorical features. In this paper, we consider a Taiwan credit dataset which is shared publicly. The small number of studies on this dataset motivates us to carry out the investigation. We first proposed methods to transform and balance the dataset and then explore the performance of classical classification models. Finally, we use ensemble learning, namely Voting which combines the results of some classifiers to improve the performance. The experimental results show that our approach is better than the recent publishes and the Voting approach is very promising.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13(2), 281–305 (2012)
Boser, B.E., Guyon, I.M., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 144–152 (1992)
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)
Hamori, S., Kawai, M., Kume, T., Murakami, Y., Watanabe, C.: Ensemble learning or deep learning? Application to default risk analysis. J. Risk Fin. Manage. 11(1), 12 (2018). https://doi.org/10.3390/jrfm11010012, https://www.mdpi.com/1911-8074/11/1/12
He, H., Zhang, W., Zhang, S.: A novel ensemble method for credit scoring: adaption of different imbalance ratios. Expert Syst. Appl. 98, 105–117 (2018)
Leong, O.J., Jayabalan, M.: A comparative study on credit card default risk predictive model. J. Comput. Theor. Nanosci. 16(8), 3591–3595 (2019)
Tan, P.N., Steinbach, M., Kumar, V.: Introduction to Data Mining. Pearson Education, Noida (2016)
Xia, Y., Liu, C., Da, B., Xie, F.: A novel heterogeneous ensemble credit scoring model based on bstacking approach. Expert Syst. Appl. 93, 182–199 (2018)
Yeh, I.C., Lien, C.H.: The comparisons of data mining techniques for the predictive accuracy of probability of default of credit card clients. Expert Syst. Appl. 36(2), 2473–2480 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Tran, D.Q., Nguyen, D.D., Nguyen, H.H., Nguyen, Q.T. (2022). An Ensemble Learning Approach for Credit Scoring Problem: A Case Study of Taiwan Default Credit Card Dataset. In: Le Thi, H.A., Pham Dinh, T., Le, H.M. (eds) Modelling, Computation and Optimization in Information Systems and Management Sciences. MCO 2021. Lecture Notes in Networks and Systems, vol 363. Springer, Cham. https://doi.org/10.1007/978-3-030-92666-3_24
Download citation
DOI: https://doi.org/10.1007/978-3-030-92666-3_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92665-6
Online ISBN: 978-3-030-92666-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)