Abstract
The determination of the financial credibility of a person for a loan is a challenging task as many variables are taken into consideration. Recently, there has been a surge in the application of machine learning approaches in the design of robust and effective credit scoring models as part of the human social development agenda under the assumption that the variables will remain stable for a long time. However, in real-life, the behavior of customers changes over time and the variables used to quantify the financial credibility of a person for a loan such as past performances on debt obligations, profiling, main household, income and demographics tend to drift and evolve over time. This paper considers credit scoring as an ephemeral scenario as variables tend to drift over time and proposes the application of data stream learning techniques in credit scoring since they are tailored for incremental learning. This makes the scoring model to be able to detect and adapt to changes in the customer behavior.
We propose the Adaptive and Dynamic Heterogeneous Ensemble (ADHE) approach that is capable of learning incrementally and adapting to drifting variables and consists of models derived from different learning algorithms to exploit diversity. The prediction performance of ADHE is evaluated using datasets that are publicly available and we compared the accuracy and computational cost of ADHE with existing state of the art models. Our proposed approach performs significantly well when compared to existing state of the art benchmark models on prediction accuracy according to the non-parametric test.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Caldarelli, A., Fiondella, C., Maffei, M., Zagaria, C.: Managing risk in credit cooperative banks: lessons from a case study. Manage. Account. Res. 32, 1–15 (2016)
Frame, W.S., Srinivasan, A., Woosley, L.: The effect of credit scoring on small business lending. J. Money Credit Banking 813–825 (2001)
Crook, J.N., Edelman, D.B., Thomas, L.C.: Recent developments in consumer credit risk assessment. Eur. J. Oper. Res. 183, 1447–1465 (2007)
Rudin, C., Wagstaff, K.L.: Machine learning for science and society. Mach. Learn. 95(1), 1–9 (2014)
Wang, G., Hao, J., Ma, J., Jiang, H.: A comparative assessment of ensemble learning for credit scoring. Expert syst. Appl. 38(1), 223–230 (2011)
Tsai, C.F.: Combining cluster analysis with cluster ensembles to predict financial distress. Inf. Fusion 16, 46–58 (2014)
Xiao, H., Xiao, Z., Wang, Y.: Ensemble classification based on supervised clustering for credit scoring. Appl. Soft Comput. 43, 73–86 (2016)
He, H., Zhang, W., Zhang, S.: A novel ensemble method for credit scoring: adaption of different imbalance ratios. Expert Syst. Appl. 98, 105–117 (2018)
Yao, J., Zhongyi Wang, L., Wang, M.L., Jiang, H., Chen, Y.: A novel hybrid ensemble credit scoring model with stacking based noise detection and weight assignment. Expert Syst. Appl. 198, 15 (2022)
Singh, I., Kumar, N., Srinivasa, K.G., Maini, S., Ahuja, U., Jain, S.: A multi-level classification and modified PSO clustering based ensemble approach for credit scoring. Appl. Soft Comput. 111, 107687 (2021)
Hou, W.H., Wang, X.K., Zhang, H.Y., Wang, J.Q., Li, L.: A novel dynamic ensemble selection classifier for an imbalanced dataset: an application for credit risk assessment. Knowl. Based Syst. 208, 106462 (2020)
Nalic, J., Martinovia, G., Zagar, D.: New hybrid data mining model for credit scoring based on feature selection algorithm and ensemble classifiers. Adv. Eng. Inform. 45, 101130 (2020)
Xia, Y., Zhao, J., He, L., Li, Y., Niu, M.: A novel tree-based dynamic heterogeneous ensemble method for credit scoring. Expert Syst. Appl. 159, 113615 (2020)
Barddal, J.P., Loezer, L., Enembreck, F., Lanzuolo, R.: Lessons learned from data stream classification applied to credit scoring. Expert Syst. Appl. 162, 113899 (2020)
Jin Xiao, X., Zhong, Z.Y., Xie, L., Xin, G., Liu, D.: Cost-Sensitive semi-supervised selective ensemble model for customer credit scoring. Knowl. Based Syst. 189, 15 (2020)
Chen, X., Li, S., Xu, X., Meng, F., Cao, W.: A novel GSCI-based ensemble approach for credit scoring. IEEE Access 8, 222449–222465 (2020)
Chen, T., Guestrin, C.: XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785–794. ACM (2016)
Wang, S.X., Dong, P.F., Tian, Y.J.: A novel method of statistical line loss estimation for distribution feeders based on feeder cluster and modified XGBoost. Energies 10(12), 2067 (2017)
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)
Xia, Y., Li, C., Liu, N.: A boosted decision tree approach using Bayesian hyper-parameter optimization for credit scoring. Expert Syst. Appl. 78, 225–241 (2017)
Chang, K.H., Chu, H.H., Tong, L.I.: Establish decision tree-based short term default credit risk assessment models. Commun. Stat. Theory Methods 45(23), 6803–6815 (2016)
Engelbrecht, A.P.: Computational Intelligence: An introduction. Wiley, Chichester (2002)
Kennedy, J., Eberhart, R.: Particle swarm optimization. In: Proceedings of IEEE International Conference on Neural Networks, Perth, Australia, 27 November 27–1 December 1995, vol. 4 pp. 1942–1948 (1995)
Cruz, R.M., Sabourin, R., Cavalcanti, G.D.: META-DES oracle: meta-learning and feature selection for dynamic ensemble selection. Inf. Fusion 38, 84–103 (2017)
Minku, L.L., Yao, X.: DDD: a new ensemble approach for dealing with concept drift. IEEE Trans. Knowl. Data Eng. 24(4), 619–633 (2012)
Yang, L.: Classifier selection for ensemble learning based on accuracy and diversity. Proc. Eng. 15, 4266–4270 (2011)
Yule, G.: On the association of attributes in statistics. Philos. Trans. Roy. Soc. London Ser. A 194, 257–319 (1900)
Avery, R.B., Calem, P.S., Canner, G.B.: Consumer credit scoring: do situational circumstances matter? J. Bank. Finance 28, 835–856 (2004)
Asuncion, A., Newman, D.: UCI Machine Learning Repository. Publishing (2007)
Zhang, W., He, H., Zhang, S.: A novel multi-stage hybrid model with enhanced multi-population niche genetic algorithm: an application in credit scoring. Expert Syst. Appl. 121, 221–232 (2019)
Lessmann, S., Baesens, B., Seow, H.V., Thomas, L.C.: Benchmarking state-of-the-art classification algorithms for credit scoring: an update of research. Eur. J. Oper. Res. 247, 124–136 (2015)
Demsar, J.: Statistical comparison of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)
Acknowledgement
All resources used mobilized by the author. This work was not funded by an individual or organization.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Ethics declarations
The author declares that he has no known competing financial interest or personal relationships that could have appeared to influence the work reported in this paper.
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Museba, T. (2023). An Adaptive and Dynamic Heterogeneous Ensemble Model for Credit Scoring. In: Ndayizigamiye, P., Twinomurinzi, H., Kalema, B., Bwalya, K., Bembe, M. (eds) Digital-for-Development: Enabling Transformation, Inclusion and Sustainability Through ICTs. IDIA 2022. Communications in Computer and Information Science, vol 1774. Springer, Cham. https://doi.org/10.1007/978-3-031-28472-4_19
Download citation
DOI: https://doi.org/10.1007/978-3-031-28472-4_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-28471-7
Online ISBN: 978-3-031-28472-4
eBook Packages: Computer ScienceComputer Science (R0)