Skip to main content

An Ensemble Learning Approach for Credit Scoring Problem: A Case Study of Taiwan Default Credit Card Dataset

  • Conference paper
  • First Online:
Modelling, Computation and Optimization in Information Systems and Management Sciences (MCO 2021)

Abstract

Credit scoring is very important for financial institutions. With the advent of machine learning, credit scoring problems can be considered as classification problems. In recent years, credit scoring problems have been attracted to researchers. They explored machine learning and data preprocessing methods for specific datasets. The difficulties of the credit scoring problem reside in the imbalance of datasets and the categorical features. In this paper, we consider a Taiwan credit dataset which is shared publicly. The small number of studies on this dataset motivates us to carry out the investigation. We first proposed methods to transform and balance the dataset and then explore the performance of classical classification models. Finally, we use ensemble learning, namely Voting which combines the results of some classifiers to improve the performance. The experimental results show that our approach is better than the recent publishes and the Voting approach is very promising.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://github.com/doandongnguyen/TaiwanCreditScoring.

References

  1. Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13(2), 281–305 (2012)

    MathSciNet  MATH  Google Scholar 

  2. Boser, B.E., Guyon, I.M., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 144–152 (1992)

    Google Scholar 

  3. Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)

    Article  Google Scholar 

  4. Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)

    Article  Google Scholar 

  5. Hamori, S., Kawai, M., Kume, T., Murakami, Y., Watanabe, C.: Ensemble learning or deep learning? Application to default risk analysis. J. Risk Fin. Manage. 11(1), 12 (2018). https://doi.org/10.3390/jrfm11010012, https://www.mdpi.com/1911-8074/11/1/12

  6. He, H., Zhang, W., Zhang, S.: A novel ensemble method for credit scoring: adaption of different imbalance ratios. Expert Syst. Appl. 98, 105–117 (2018)

    Article  Google Scholar 

  7. Leong, O.J., Jayabalan, M.: A comparative study on credit card default risk predictive model. J. Comput. Theor. Nanosci. 16(8), 3591–3595 (2019)

    Article  Google Scholar 

  8. Tan, P.N., Steinbach, M., Kumar, V.: Introduction to Data Mining. Pearson Education, Noida (2016)

    Google Scholar 

  9. Xia, Y., Liu, C., Da, B., Xie, F.: A novel heterogeneous ensemble credit scoring model based on bstacking approach. Expert Syst. Appl. 93, 182–199 (2018)

    Article  Google Scholar 

  10. Yeh, I.C., Lien, C.H.: The comparisons of data mining techniques for the predictive accuracy of probability of default of credit card clients. Expert Syst. Appl. 36(2), 2473–2480 (2009)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Duc Quynh Tran .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tran, D.Q., Nguyen, D.D., Nguyen, H.H., Nguyen, Q.T. (2022). An Ensemble Learning Approach for Credit Scoring Problem: A Case Study of Taiwan Default Credit Card Dataset. In: Le Thi, H.A., Pham Dinh, T., Le, H.M. (eds) Modelling, Computation and Optimization in Information Systems and Management Sciences. MCO 2021. Lecture Notes in Networks and Systems, vol 363. Springer, Cham. https://doi.org/10.1007/978-3-030-92666-3_24

Download citation

Publish with us

Policies and ethics