A novel multi-stage hybrid model with enhanced multi-population niche genetic algorithm: An application in credit scoring

doi:10.1016/j.eswa.2018.12.020

Expert Systems with Applications

Volume 121, 1 May 2019, Pages 221-232

https://doi.org/10.1016/j.eswa.2018.12.020 Get rights and content

Highlights

•
A novel multi-stage hybrid model is proposed and applied to credit scoring.
•
Multi-population niche GA (MPNGA) is proposed to improve search efficiency.
•
Feature/classifier selection enables the acquisition of optimal subset.
•
The stacking-based ensemble is constructed to enhance predictive effectiveness.
•
The proposed model is validated on five datasets over four performance metrics.

Abstract

In recent years, artificial intelligence and machine learning technology have made great progress and development. Various novel models have been constructed to enhance prediction performance of binary classification from different aspects. Credit scoring model is a typical application of artificial intelligence and machine learning technology. In this study, we propose a novel multi-stage hybrid model, which combines feature selection and classifier selection to obtain optimal feature subset and optimal classifier subset, then uses classifier ensemble to improve the prediction performance based on the two optimal subsets mentioned above. We also extend genetic algorithm, i.e., propose an enhanced multi-population niche genetic algorithm (EMPNGA), to improve the ability of optimization effectively by enhancing the selection, crossover, and mutation steps, and adding niche and migration steps. Furthermore, EMPNGA is applied to combine several filter methods and priori knowledge in feature selection and classifier selection respectively to further increase the search efficiency. The proposed model is applied to credit scoring to verify its prediction performance. Finally, five datasets and four evaluation metrics are applied in the experiment. The experimental results confirm that the performance of proposed model is superior to the other comparative models, proving that this study is of significance and effectiveness.

Introduction

In recent years, artificial intelligence and machine learning technology have been greatly developed. In previous studies, several typical classification models have been applied in binary classification, such as linear discriminant analysis (LDA; Fisher, 1936), logistic regression (LR; Hand & Kelly, 2002), decision tree (DT; Li, Ying, Tuo, & Li, 2004), support vector machine (SVM; Huang, Chen, Hsu, Chen, & Wu, 2004), and multilayer perceptron network (MLP; West, 2000).

In general, datasets for machine learning are typically multidimensional. However, irrelevant and redundant features not only reduce the prediction performance of a classification model but can also increase the computational complexity. Feature selection methods are recognized as promising approaches in machine learning, and it is applied to identify the key features to reduce the computing time cost of the classification models and improve the prediction performance. Some previous studies have explored feature selection methods, including Chen and Li, 2010, Hajek and Michalak, 2013, Maldonado, Pérez and Bravo, 2017, Oreski and Oreski, 2014, and Wang, Zhang, Bai, Mao (2017). But, there still remain new capabilities to be discovered and explored.

Ensemble models have also been widely considered to improve the performance of classification models in recent years. Many ensemble models have been applied to machine learning, such as homogeneous ensemble models based on DT, random forest (RF; Friedman, 2001), gradient boosting decision tree (GBDT; Friedman, 2001), and XGBoost (Chen & Guestrin, 2016). The heterogeneous ensemble models, which combine multiple base classifiers, have also garnered widespread attention (Ala’ Raj & Abbod, 2016a, and Ala'Raj and Abbod, 2016b, Xia, Liu, Da and Xie, 2018). Lessmann, Baesens, Seow, and Thomas (2015) proved that the performance of heterogeneous ensembles is frequently superior to individual classifiers. However, how to determine the most effective ensemble model for different datasets has not yet been completely solved. In addition, the problem complexity and computational time of classifier selection in the original feature is usually large. Therefore, effective classifier selection methods should be considered to obtain a more appropriate ensemble model within a certain complexity.

Credit scoring has gained considerable attention in financial industry owing to its importance in credit risk management. A small improvement in credit scoring model can bring large profits to financial institutions, therefore, many artificial intelligence and machine learning models have been applied to credit scoring to verify their performance in binary classification. In this study, we propose a novel multi-stage hybrid model, which combines feature selection and classifier selection, to obtain a superior prediction performance. Furthermore, an enhanced multi-population niche genetic algorithm (EMPNGA) is proposed to combine several filter methods and priori knowledge in feature selection and classifier selection respectively, to enable the acquisition of optimal feature/classifier subset. Then classifier ensemble is used to improve the prediction performance of the model based on these optimal subsets mentioned above. The proposed model is applied to credit scoring to verify its prediction performance in binary classification. The experimental results demonstrate that these multiple stages of the hybrid model have played a significant role in improving the prediction performance and the final prediction performance of the proposed model is superior to other comparative models. This confirms that the proposed model is effective and practical, and provides a new research direction for future machine learning research.

The remainder of this study is organized as follows. Section 2 describes related work regarding genetic algorithm, feature selection and classifier ensemble. Section 3 describes the mechanism of the proposed model. Section 4 presents the experimental design. Section 5 describes the experimental results and comparative analysis. The conclusions and future works are listed in Section 6.

Section snippets

Related work

Our studies in this paper can be divided into three parts in relation to: (1) genetic algorithm, (2) feature selection, and (3) classifier ensemble. As important sub-fields of machine learning research, these issues have attracted much attention from scholars. In this section, these three issues are reviewed and their applications in credit scoring are elaborated.

The proposed multi-stage hybrid model

In this section, the multi-stage hybrid model is presented, and its framework is described in Fig. 1. This hybrid model can be divided into three stages: feature selection, classifier selection, and classifier ensemble. In the feature selection stage, the preprocessed data are used as input data and several filter methods are combined to determine the synthetic feature importance of all the features. The synthetic feature importance combines the respective characteristics of the several filter

Credit datasets

In the experiment, five real-world credit datasets are used to verify the performance of the proposed model. That is, three credit scoring datasets from the UCI Machine Learning Repository (Asuncion & Newman, 2007), namely, Australian, German, and Japanese datasets, PPDai dataset, which is a part of a loan dataset provided by the Chinese internet finance enterprise named PaiPaiDai,¹ and GMSC dataset, which is published by a famous data competition platform (Kaggle²

Experimental results

In this section, experiment results are presented to validate the advantages of the proposed model compared to other comparative classifiers and demonstrate the effectiveness of the proposed model. All of the experiments used Python Version 3.6 on a PC with 3.2 GHz Intel CORE i7 processor. The PC had 32 GB of RAM, and ran the Microsoft Windows 7 operating system.

Conclusions and future work

In recent years, artificial intelligence and machine learning technology have made rapid development, and various novel models have been constructed to enhance prediction performance in binary classification. Researchers have conducted numerous valuable explorations in some fields, including feature selection, classifier selection, and classifier ensemble. Although some studies have done a combinatorial research of the above-mentioned approaches, the optimal integration of them has not been

Acknowledgment

The work has been supported by National Natural Science Foundation of China (Nos. 51875503, 51475410), and Zhejiang Natural Science Foundation of China (No. LY17E050010).

References (40)

M. Ala'Raj et al.
Classifiers consensus system approach for credit scoring
Knowledge-Based Systems
(2016)
M. Ala'Raj et al.
A new hybrid ensemble credit scoring model based on classifiers consensus system approach
Expert Systems with Applications
(2016)
A. Asuncion et al.
UCI machine learning repository
(2007)
A. Bequé et al.
Approaches for credit scorecard calibration: An empirical analysis
Knowledge-Based Systems
(2017)
L. Breiman
Bagging predictors
Machine Learning
(1996)
ChenF.L. et al.
Combination of feature selection approaches with SVM in credit scoring
Expert Systems with Applications
(2010)
ChenN. et al.
A genetic algorithm-based approach to cost-sensitive bankruptcy prediction
Expert Systems with Applications
(2011)
ChenT. et al.
Xgboost: A scalable tree boosting system
ChouC.H. et al.
Hybrid genetic algorithm and fuzzy clustering for bankruptcy prediction
Applied Soft Computing
(2017)
T.M. Cover et al.
Elements of information theory
(1991)

J. Demšar

Statistical comparisons of classifiers over multiple data sets

The Journal of Machine Learning Research

(2006)

S. Finlay

Multiple classifier architectures and their application to credit risk assessment

European Journal of Operational Research

(2011)

R.A. Fisher

Studies in crop variation. I. An examination of the yield of dressed grain from broadbalk

The Journal of Agricultural Science

(1921)

R.A. Fisher

The use of multiple measurements in taxonomic problems

Annals of Human Genetics

(1936)

J.H. Friedman

Greedy function approximation: A gradient boosting machine

Annals of Statistics

(2001)

M. Friedman

A comparison of alternative tests of significance for the problem of m rankings

The Annals of Mathematical Statistics

(1940)

P. Hajek et al.

Feature selection in corporate credit rating prediction

Knowledge-Based Systems

(2013)

D.J. Hand

Measuring classifier performance: A coherent alternative to the area under the roc curve

Machine Learning

(2009)

D.J. Hand et al.

A better beta for the H measure of classification performance

Pattern Recognition Letters

(2014)

D.J. Hand et al.

Superscorecards

Ima Journal of Management Mathematics

(2002)

Cited by (108)

Consumer credit risk assessment: A review from the state-of-the-art classification algorithms, data traits, and learning methods
2024, Expert Systems with Applications
Credit risk assessment is a crucial element in credit risk management. With the extensive research on consumer credit risk assessment in recent decades, the abundance of literature on this topic can be overwhelming for researchers. Therefore, this article aims to provide a more systematic and comprehensive analysis from three perspectives: classification algorithms, data traits, and learning methods. Firstly, the state-of-the-art classification algorithms are categorized into traditional single classifiers, intelligent single classifiers, hybrid and ensemble multiple classifiers. Secondly, considering the diversity of data traits in the credit dataset, data traits are divided into external structure information traits, data quality traits, data quantity traits, and internal information traits. Data traits-driven modeling framework based on multiple classifiers is proposed for solving credit risk assessment. Thirdly, considering the differences in data modeling methods, learning methods are classified into data status, label status, and structure form. Furthermore, model interpretability, model bias, model multi-pattern, and model fairness are discussed. Finally, the limitations and future research directions are presented. This review article serves as a helpful guide for researchers and practitioners in the field of credit risk modeling and analysis.
A novel federated learning approach with knowledge transfer for credit scoring
2024, Decision Support Systems
The expanding availability of data in the financial sector promises to take the performance of machine learning models to a new level. However, given the high business value and confidentiality of credit data, the integration of datasets from multiple institutions for credit scoring modeling may result in privacy leakage. Consequently, in this paper, a horizontal federated learning paradigm is used to protect the local private data of each participant and collaborate to train a powerful shared global model. However, in the collaborative training process, heterogeneous data distributions can result in insufficient learning of the model. To overcome this issue, we propose the federated knowledge transfer (FedKT) method, which exploits the advantages of fine-tuning and knowledge distillation to effectively extract generic and specific knowledge from the early layers and outputs of the global model, respectively, thus improving the learning performance of the local models. We adopt five credit datasets and four performance measures to demonstrate the effectiveness of our proposed method. The experimental results show that the proposed method can securely utilize credit data from different parties to improve the performance of the credit scoring model. This also supports the potential of our proposed method for further applications in credit scoring.
A Genetic Algorithm-based sequential instance selection framework for ensemble learning
2024, Expert Systems with Applications
The accumulation of large amounts of historical data has led to the wide application of ensemble learning over the past few decades, but the balance between the individual accuracy of base classifiers (BCs) and the diversity among these BCs is rarely considered in the construction of ensemble models. Since such a balance is crucial to the success of ensemble models, this paper proposes a Genetic Algorithm-based sequential instance selection framework to address this research gap. The novelties of the proposed framework include: transforming the balance between the individual accuracy of BCs and the diversity among BCs into a general combinatorial optimization model and designing a Genetic Algorithm-based evolutionary instance selection method to solve this model. The proposed framework not only overcomes the inherent limitations of the Genetic Algorithm in some high-dimensional tasks but also provides an explicit and automatic way to balance the accuracy and diversity by searching appropriate training data subsets for different component BCs. Based on obtained training data subsets, the component BCs of the ensemble model are generated sequentially, and their predictions are further combined with the weighted majority voting rule. Using 30 real datasets collected from various practical applications, such as medicine, business, and industry, the effectiveness of the proposed framework in constructing powerful ensemble models is examined and compared with six benchmark ensemble learning methods. In addition, the capability of the proposed framework to improve convergence performances is also examined by the comparison with the traditional Genetic Algorithm.
A shapelet-based behavioral pattern extraction method for credit risk classification with behavior sparsity
2023, Advanced Engineering Informatics
In this paper, a shapelet-based behavioral pattern extraction method (Behavior2Shapelets) is proposed to address the behavior-sparsity issue in credit risk classification. In the proposed method, three steps are involved. First, by introducing the shapelets module, the proposed model can learn the behavioral patterns from the sparse behavioral features. Then, these extracted patterns are combined with a logistic regression model to construct an interpretable classifier. Finally, to enhance the robustness of the model, a dynamic strategy for threshold determination is used, which is based on the Kolmogorov-Smirnov statistic. The performance of the Behavior2Shapelets model is validated by using the Taiwan credit dataset and its seven derivative datasets. The empirical results demonstrate that the Behavior2Shapelets model outperforms the baseline methods in terms of classification accuracy and robustness. This highlights the feasibility of the proposed model as a solution to the behavior-sparsity problem in credit risk classification.
Enriching the green economy through sustainable investments: An ESG-based credit rating model for green financing
2023, Journal of Cleaner Production
Environmental, social, and governance (ESG) measures have grown significantly as sustainable investment has become a key driver of capital allocation. Sustainable investment is one of the main agendas for fulfilling sustainable development. Thus, financial institutions can significantly tackle socio-ecological concerns by recognising socially conscientious borrowers for long-term investments. They can invest in or lend to enterprises involved in sustainable development to construct a sustainable future. Unfortunately, financial institutions confront several obstacles in selecting such borrowers from a large pool of applicants. To accomplish the goal of sustainable investments, this study proposes an ESG-based credit rating model that considers a firm's ESG performance. The suggested model was built by applying the fuzzy Best Worst Method (BWM) and the newly developed fuzzy Technique for Order Preferences by Similarity to an Ideal Solution (TOPSIS) Sorting. The fuzzy BWM was used to determine the weight of criteria, while the fuzzy TOPSIS-Sorting was used to evaluate firms against the identified criteria. A practical case has been demonstrated to show the utility of the proposed model. This study identifies the financial pillar as the most important, accounting for 43% of the overall importance, followed by the environmental pillar (24%), the social pillar (19%), and the governance pillar (14%). The suggested credit rating model has shown an accuracy rate of 84.31% and a true positive rate of 87.5%. Regarding policy implementations, financial institutions, regulators, and other authorities may employ it to assist sustainable investments in fulfilling sustainable development goals. Banks may use the suggested method to calculate the capital required under the internal rating-based approach of Basel norms.
A test paper generation algorithm based on diseased enhanced genetic algorithm
2023, Heliyon
With the continuous progress of society, tests, and exams appear more and more frequently in people's lives. Faced with the ever-increasing demand for test papers, efficient test paper generation algorithms have become more important. In this paper, we improved and proposed a Diseased Enhanced Genetic Algorithm (DEGA) based on the Genetic Algorithm (GA), and applied it to the test paper generation algorithm. I the crossover operator, the crossover probability that will change in different situations of the population is adopted. According to the characteristics of the test paper generation algorithm, we use the method based on the hamming distance to calculate the distance between individuals in the population. Aiming at the shortcoming that the mutation operator is too random, we designed and used a disease operator that includes three modules: natural disease, infection, and mutation. It effectively guarantees the distance between individuals in the population and improves the shortcoming that GA is easy to fall into a locally optimal solution. Finally, using the College English Test Band 4 (CET-4) questions from 2014 to 2021 as the data set, comparative experiments were carried out on the test paper generation algorithm based on Random Sampling Algorithm (RSA), GA, Enhanced Genetic Algorithm (EGA) and DEGA. The results show that when using the test paper generation algorithm based on DEGA, the generation of test papers is faster, the number of iterations is less, and the algorithm results are significantly better than other algorithms.

View all citing articles on Scopus

View full text

A novel multi-stage hybrid model with enhanced multi-population niche genetic algorithm: An application in credit scoring

Highlights

Abstract

Introduction

Section snippets

Related work

The proposed multi-stage hybrid model

Credit datasets

Experimental results

Conclusions and future work

Acknowledgment

Classifiers consensus system approach for credit scoring

Knowledge-Based Systems

A new hybrid ensemble credit scoring model based on classifiers consensus system approach

Expert Systems with Applications

UCI machine learning repository

Approaches for credit scorecard calibration: An empirical analysis

Knowledge-Based Systems

Bagging predictors

Machine Learning

Combination of feature selection approaches with SVM in credit scoring

Expert Systems with Applications

A genetic algorithm-based approach to cost-sensitive bankruptcy prediction

Expert Systems with Applications

Xgboost: A scalable tree boosting system

Hybrid genetic algorithm and fuzzy clustering for bankruptcy prediction

Applied Soft Computing

Elements of information theory

Statistical comparisons of classifiers over multiple data sets

The Journal of Machine Learning Research

Multiple classifier architectures and their application to credit risk assessment

European Journal of Operational Research

Studies in crop variation. I. An examination of the yield of dressed grain from broadbalk

The Journal of Agricultural Science

The use of multiple measurements in taxonomic problems

Annals of Human Genetics

Greedy function approximation: A gradient boosting machine

Annals of Statistics

A comparison of alternative tests of significance for the problem of m rankings

The Annals of Mathematical Statistics

Feature selection in corporate credit rating prediction

Knowledge-Based Systems

Measuring classifier performance: A coherent alternative to the area under the roc curve

Machine Learning

A better beta for the H measure of classification performance

Pattern Recognition Letters

Superscorecards

Ima Journal of Management Mathematics