Skip to main content

An Adaptive and Dynamic Heterogeneous Ensemble Model for Credit Scoring

  • Conference paper
  • First Online:
Digital-for-Development: Enabling Transformation, Inclusion and Sustainability Through ICTs (IDIA 2022)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1774))

Included in the following conference series:

  • 227 Accesses

Abstract

The determination of the financial credibility of a person for a loan is a challenging task as many variables are taken into consideration. Recently, there has been a surge in the application of machine learning approaches in the design of robust and effective credit scoring models as part of the human social development agenda under the assumption that the variables will remain stable for a long time. However, in real-life, the behavior of customers changes over time and the variables used to quantify the financial credibility of a person for a loan such as past performances on debt obligations, profiling, main household, income and demographics tend to drift and evolve over time. This paper considers credit scoring as an ephemeral scenario as variables tend to drift over time and proposes the application of data stream learning techniques in credit scoring since they are tailored for incremental learning. This makes the scoring model to be able to detect and adapt to changes in the customer behavior.

We propose the Adaptive and Dynamic Heterogeneous Ensemble (ADHE) approach that is capable of learning incrementally and adapting to drifting variables and consists of models derived from different learning algorithms to exploit diversity. The prediction performance of ADHE is evaluated using datasets that are publicly available and we compared the accuracy and computational cost of ADHE with existing state of the art models. Our proposed approach performs significantly well when compared to existing state of the art benchmark models on prediction accuracy according to the non-parametric test.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Caldarelli, A., Fiondella, C., Maffei, M., Zagaria, C.: Managing risk in credit cooperative banks: lessons from a case study. Manage. Account. Res. 32, 1–15 (2016)

    Article  Google Scholar 

  2. Frame, W.S., Srinivasan, A., Woosley, L.: The effect of credit scoring on small business lending. J. Money Credit Banking 813–825 (2001)

    Google Scholar 

  3. Crook, J.N., Edelman, D.B., Thomas, L.C.: Recent developments in consumer credit risk assessment. Eur. J. Oper. Res. 183, 1447–1465 (2007)

    Article  MathSciNet  MATH  Google Scholar 

  4. Rudin, C., Wagstaff, K.L.: Machine learning for science and society. Mach. Learn. 95(1), 1–9 (2014)

    Article  MathSciNet  Google Scholar 

  5. Wang, G., Hao, J., Ma, J., Jiang, H.: A comparative assessment of ensemble learning for credit scoring. Expert syst. Appl. 38(1), 223–230 (2011)

    Article  Google Scholar 

  6. Tsai, C.F.: Combining cluster analysis with cluster ensembles to predict financial distress. Inf. Fusion 16, 46–58 (2014)

    Article  Google Scholar 

  7. Xiao, H., Xiao, Z., Wang, Y.: Ensemble classification based on supervised clustering for credit scoring. Appl. Soft Comput. 43, 73–86 (2016)

    Article  Google Scholar 

  8. He, H., Zhang, W., Zhang, S.: A novel ensemble method for credit scoring: adaption of different imbalance ratios. Expert Syst. Appl. 98, 105–117 (2018)

    Article  Google Scholar 

  9. Yao, J., Zhongyi Wang, L., Wang, M.L., Jiang, H., Chen, Y.: A novel hybrid ensemble credit scoring model with stacking based noise detection and weight assignment. Expert Syst. Appl. 198, 15 (2022)

    Article  Google Scholar 

  10. Singh, I., Kumar, N., Srinivasa, K.G., Maini, S., Ahuja, U., Jain, S.: A multi-level classification and modified PSO clustering based ensemble approach for credit scoring. Appl. Soft Comput. 111, 107687 (2021)

    Article  Google Scholar 

  11. Hou, W.H., Wang, X.K., Zhang, H.Y., Wang, J.Q., Li, L.: A novel dynamic ensemble selection classifier for an imbalanced dataset: an application for credit risk assessment. Knowl. Based Syst. 208, 106462 (2020)

    Article  Google Scholar 

  12. Nalic, J., Martinovia, G., Zagar, D.: New hybrid data mining model for credit scoring based on feature selection algorithm and ensemble classifiers. Adv. Eng. Inform. 45, 101130 (2020)

    Article  Google Scholar 

  13. Xia, Y., Zhao, J., He, L., Li, Y., Niu, M.: A novel tree-based dynamic heterogeneous ensemble method for credit scoring. Expert Syst. Appl. 159, 113615 (2020)

    Article  Google Scholar 

  14. Barddal, J.P., Loezer, L., Enembreck, F., Lanzuolo, R.: Lessons learned from data stream classification applied to credit scoring. Expert Syst. Appl. 162, 113899 (2020)

    Article  Google Scholar 

  15. Jin Xiao, X., Zhong, Z.Y., Xie, L., Xin, G., Liu, D.: Cost-Sensitive semi-supervised selective ensemble model for customer credit scoring. Knowl. Based Syst. 189, 15 (2020)

    Google Scholar 

  16. Chen, X., Li, S., Xu, X., Meng, F., Cao, W.: A novel GSCI-based ensemble approach for credit scoring. IEEE Access 8, 222449–222465 (2020)

    Article  Google Scholar 

  17. Chen, T., Guestrin, C.: XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785–794. ACM (2016)

    Google Scholar 

  18. Wang, S.X., Dong, P.F., Tian, Y.J.: A novel method of statistical line loss estimation for distribution feeders based on feeder cluster and modified XGBoost. Energies 10(12), 2067 (2017)

    Article  Google Scholar 

  19. Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)

    Article  MATH  Google Scholar 

  20. Xia, Y., Li, C., Liu, N.: A boosted decision tree approach using Bayesian hyper-parameter optimization for credit scoring. Expert Syst. Appl. 78, 225–241 (2017)

    Article  Google Scholar 

  21. Chang, K.H., Chu, H.H., Tong, L.I.: Establish decision tree-based short term default credit risk assessment models. Commun. Stat. Theory Methods 45(23), 6803–6815 (2016)

    Article  MATH  Google Scholar 

  22. Engelbrecht, A.P.: Computational Intelligence: An introduction. Wiley, Chichester (2002)

    Google Scholar 

  23. Kennedy, J., Eberhart, R.: Particle swarm optimization. In: Proceedings of IEEE International Conference on Neural Networks, Perth, Australia, 27 November 27–1 December 1995, vol. 4 pp. 1942–1948 (1995)

    Google Scholar 

  24. Cruz, R.M., Sabourin, R., Cavalcanti, G.D.: META-DES oracle: meta-learning and feature selection for dynamic ensemble selection. Inf. Fusion 38, 84–103 (2017)

    Article  Google Scholar 

  25. Minku, L.L., Yao, X.: DDD: a new ensemble approach for dealing with concept drift. IEEE Trans. Knowl. Data Eng. 24(4), 619–633 (2012)

    Article  Google Scholar 

  26. Yang, L.: Classifier selection for ensemble learning based on accuracy and diversity. Proc. Eng. 15, 4266–4270 (2011)

    Article  Google Scholar 

  27. Yule, G.: On the association of attributes in statistics. Philos. Trans. Roy. Soc. London Ser. A 194, 257–319 (1900)

    MATH  Google Scholar 

  28. Avery, R.B., Calem, P.S., Canner, G.B.: Consumer credit scoring: do situational circumstances matter? J. Bank. Finance 28, 835–856 (2004)

    Article  Google Scholar 

  29. Asuncion, A., Newman, D.: UCI Machine Learning Repository. Publishing (2007)

    Google Scholar 

  30. Zhang, W., He, H., Zhang, S.: A novel multi-stage hybrid model with enhanced multi-population niche genetic algorithm: an application in credit scoring. Expert Syst. Appl. 121, 221–232 (2019)

    Article  Google Scholar 

  31. Lessmann, S., Baesens, B., Seow, H.V., Thomas, L.C.: Benchmarking state-of-the-art classification algorithms for credit scoring: an update of research. Eur. J. Oper. Res. 247, 124–136 (2015)

    Article  MATH  Google Scholar 

  32. Demsar, J.: Statistical comparison of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)

    MathSciNet  MATH  Google Scholar 

Download references

Acknowledgement

All resources used mobilized by the author. This work was not funded by an individual or organization.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tinofirei Museba .

Editor information

Editors and Affiliations

Ethics declarations

The author declares that he has no known competing financial interest or personal relationships that could have appeared to influence the work reported in this paper.

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Museba, T. (2023). An Adaptive and Dynamic Heterogeneous Ensemble Model for Credit Scoring. In: Ndayizigamiye, P., Twinomurinzi, H., Kalema, B., Bwalya, K., Bembe, M. (eds) Digital-for-Development: Enabling Transformation, Inclusion and Sustainability Through ICTs. IDIA 2022. Communications in Computer and Information Science, vol 1774. Springer, Cham. https://doi.org/10.1007/978-3-031-28472-4_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-28472-4_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-28471-7

  • Online ISBN: 978-3-031-28472-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics