An Automatic Credit Scoring Strategy (ACSS) using memetic evolutionary algorithm and neural architecture search

doi:10.1016/j.asoc.2021.107871

Applied Soft Computing

Volume 113, Part A, December 2021, 107871

https://doi.org/10.1016/j.asoc.2021.107871 Get rights and content

Highlights

•
We utilize an improved SMOTE that could decrease the impact of data imbalance that exists in the credit data.
•
We propose to leverage the credit feature pruning algorithm and memetic optimization algorithm.
•
We propose an automated credit scoring strategy (ACSS) method with a cost-effective neural architecture search (C-NAS) scheme.

Abstract

Credit scoring is playing an increasingly critical role with the rising number of lending operations for micro and small enterprises as well as individuals. A large number of research is primarily based on the combination and construction methods of credit scoring models by the experts. However, different credit data have distinct requirements for the models, and how to automatically search and construct credit scoring models according to credit data has become essential, that is the main concern of this paper. In response to the current challenges for credit scoring research, we proposed an Automatic Credit Scoring Strategy (ACSS), designed a credit assessment platform which includes data import, classification model automatic search, feature selection, hyperparameter optimization, data mining, classification output and other modules. Aiming at the problem of substantial imbalance in credit data, we propose an improved SMOTE algorithm that is capable of generating supplementary data for the lack of minority in credit data, thereby making the credit data distribution well balanced. As for classification model selection, features engineering, parameter optimization and other parts, we further incorporate automatic search ways to reduce manual interaction. We utilize public and self-owned credit data sets to conduct experiments and compare them with the latest credit assessment methods. Extensive experiments have demonstrated that our ACSS method achieves relatively noticeable performance improvements using the German credit dataset, the Taiwan credit dataset and the personal credit dataset for credit scoring. The best results achieved by our proposed method are 0.98, 0.99, 0.895 and 0.901 for MAE, RMSE, accuracy and precision respectively. In addition, the experimental results also show that our proposed improved SMOTE algorithm contributes to the credit scoring performance enhancement. The experimental findings suggest that our proposed ACSS balances automation and accuracy, which can be implemented to the consumption industry to enhance the reliability of credit assessments.

Introduction

Nowadays, credit assessment has been widely seen in finance, consumption, insurance, education and other industries, offering extremely significant industrial applications for medical purposes, loans, education, employment and other tasks. The sudden outbreak of the COVID-19 epidemic in early 2020 proliferated across the globe, making it the most serious global crisis to confront humanity since World War II. The epidemic brought unprecedented shocks to countries around the world and significantly increased economic instability, leading to a noticeable credit crisis for a great number of companies, individuals and financial institutions, which in turn further worsened the economic environment worldwide. Therefore, the efficient and stable credit scoring model construction is essential for research to alleviate the financial and credit crisis caused by the epidemic and gradually boost the global economic recovery.

The assessment of the credit risk is typically based on credit scoring models that are extensively seen in assessing the applicant’s default probability. The key issue in credit risk evaluation is how applicants can be classified into two main groups: default and non-default. The assessor may then determine to refuse or accept the loan application after credit assessment procedure. Due to its role in management of credit risk, credit scoring has drawn tremendous attention in financial industries. There are thus a series of artificial intelligence and machine training models often used monitor credit scoring to check its efficiency in classification through a slight improvements in the credit scoring model. Credit data analysis and mining can address the issues of marketing, pricing, fraud, and credit induced by information asymmetry with the widespread implementation of data mining techniques. The most fundamental advancement of credit evaluation methods in the sense of big data, compared with conventional credit evaluation methods, lies in the use of a vast volume of non-financial data for analysis in order to take full account of the evaluator’s credit status. There are also many work using data mining methods. A credit evaluation algorithm based on the Bayes formula, which takes into account data from alternative sources and the option of the bank’s cross-product provided to the client, was proposed by Sergei [1]. Li et al. [2] developed a predictive model for the automated evaluation by machine learning technique of healthy elderly service credit efficiency. To obtain credit ratings-related attributes, the credit data is computed and analyzed. Data mining techniques are also used in corporate credit evaluation analysis, in addition to the individual credit assessment. He et al. [3] proposed a new heterogeneous ensemble credit model that incorporates the stacking algorithm. A wide variety of models, which include individual classifiers, homogeneous and heterogeneous ensembling models, are adopted as benchmarks in order to validate the efficiency of proposed stacking strategy. Wei Wang et al. [4] investigated a blockchain technology-based distributed credit assessment system which has intelligent protocols and decentralized features to provide credit histories through unchanged timestamps and distributed ledgers, and then strengthen the defects in existing centralized credit evaluation systems. Current research on credit scoring revolves around ensemble methods and single classifiers, with different researchers setting up various classifier integration methods based on the characteristics of credit data. However, this approach requires too much manual intervention, and it is tough to filter out suitable credit scoring ensemble models for those without any certain modeling experiences. Therefore, the existing study on credit scoring raises the question of whether it is possible to propose a credit scoring model generation method that reduces human intervention, is highly automated, and at the same time has a decent performance.

There are several challenges in credit scoring. The first challenge is the imbalance of credit data. Credit data is of diverse types and from a wide range of sources. Existing credit scoring methods pay less attention to the impact of credit data imbalance on model performance. Therefore, it is also a pressing issue to pre-process credit data so that the distribution of different credit categories in the data is as balanced as possible, thus improving the accuracy of credit scoring models. The second challenge is that the diversity of credit data types leads to credit models that are not universally applicable and scalable. The existing credit scoring models are developed by training the credit data, selecting the appropriate credit scoring model based on the experience of the experts, and combining a combination of human-set hyperparameters and other optimization approaches to achieve a satisfying credit scoring model. However, as credit data varies greatly between industries and regions, it is extremely difficult for existing credit scoring models to achieve comparable credit evaluation results under different credit data. The third challenge is that the credit scoring model building process requires a great deal of human involvement, which is not conducive to the widespread adoption of credit scoring models. As a typical machine learning classification problem, credit scoring modeling involves a great deal of manual intervention in model construction, hyperparameter selection, model training and other steps. However, in the face of the global economic downturn caused by the COVID-19 pandemic, it is imperative for banks, micro and small businesses, audit firms and other financial institutions that are in need of credit scoring services to obtain the most efficient and automated credit scoring model construction method feasible. Fortunately, the advancement of neural architecture search(NAS) shed light on this issue.

While many existing NAS methods are able to learn network architectures, most of them have been designed for problems with the classification of images that generally have high-quality labels. Since the recent methods concerning credit scoring require a considerable work on design of ensemble models, including the research of same-sex ensemble models and opposite-sex ensemble models, such approaches improve the credit scoring accuracy by assembling the ratio of base classifiers. However, since applying NAS methods directly to credit data would consume significantly more time, we propose an economical NAS method for credit scoring, which prunes the candidate model by calculating the importance, thus improving the search effect of NAS and reducing unnecessary search consumption time.

Credit scoring is typically a classification model construction issue. In general, machine learning datasets are typically multi-dimensional. Even so, irrelevant and redundant features not only affect classification model’s prediction efficiency but may also raise computational complexity. Therefore, feature extraction and selection methods are regarded as promising methods in machine study, and key features are found to minimize computation time costs and also to enhance predictive efficiency for the classification models. A variety of experiments have been carried out and the results have shown the effectiveness of our design in quantitatively evaluating the performance of the $C$ -NAS framework. We also used the optimal model and extract features that best reflect personal credits using the public and personal credit datasets, and compare results to the state-of-the-art scoring models. The main contributions are shown as follows.

(1) We propose an improved SMOTE that could decrease the impact of data imbalance that exists in the credit data and uses Improved SMOTE to augment the data for some small samples.

(2) We propose to leverage the credit feature pruning algorithm and memetic optimization algorithm which are capable of reducing more irrelevant features by the calculated credit feature importance and shortening the model search time.

(3) Additionally, we propose an automated credit scoring strategy (ACSS) method with an automatic cost-effective neural architecture search ( $C$ -NAS) method to improve the accuracy of credit classification and reduce the unnecessary human efforts to design the model and adjust the hyperparameters.

The subsequent sections of the paper are structured as follows. In Section 2, we will introduce the current research on credit scoring methods and the evolutionary algorithms that have been adopted. In Section 3, we will introduce our proposed ACSS method for automatic credit scoring. In Section 4 , we will introduce experimental introduction. In Section 5, we will present the results of the experiments separately from the three research questions as well as discuss and analyze the results. In Section 6, we will further analyze the advantages of our ACSS approach in terms of balancing automation and accuracy. In Section 7, we will summarize the primary research contributions and drawbacks of this paper, and provide an insight into our future research.

Section snippets

Related work

In order to effectively understand the details of credit scoring modeling research, we have described and summarized the relevant background.

Automatic credit scoring method

The automatic credit scoring modeling approach shown in this paper incorporates the key factors of credit data mining, in particular the fully automated machine learning pipeline for credit classification, which consists of four critical components: (1) Extraction of credit data feature; (2) Selection of features for credit assessment; (3) Searching for classification modeling; (4) Optimization of the Hyperparameters

These phases are completely automatic, with both the input being credit data

System overview

Because experiments of searching the credit scoring models is relatively time-consuming, we employ the bootstrapping methods to analyze variability over numerous repeats of each experiment. We run each AutoML framework 30 times per credit data set, and after that select 5 of the 30 results at random and choose the best of these 5 results as the final result. This is repeated 200 times for each AutoML platform and credit data set, and statistics are computed over all these result distributions.

Results and discussion

The aim of this study was to validate the efficiency and superiority of our proposed memetic $C$ -NAS based ACSS approach for unbalanced credit data sets. Hence, the main research questions are outlined in this section and the experiments designed to solve them are explained.

RQ1: Does the proposed ASCC using memetic cost-effective NAS prove better than the other widely used techniques in the credit datasets?

RQ2: Does our approach solve the problem of data imbalance? How effective are the

Is it possible to balance automation and high performance? a further analysis of the results for ACSS

With the advantage of credit scoring model automatization, our proposed ACSS presents for the first time a technique for determining credit scoring models employing neural architecture search techniques without the necessity of extensive human intervention for model design and hyperparameter selection. Accordingly, we therefore need to compare our approach with the currently available mainstream credit scoring methods to verify whether our ACSS approach can balance automation and high

Conclusion and future work

Credit appraisal has been extended to all fields of contemporary life, influencing and transforming the lives of all consumers. Current study on credit scores primarily focuses on the development of integrated classification models that cannot incorporate the entire data classification process. This paper therefore proposes a credit score model based on automated machine learning approaches that can efficiently combine data collection, feature discovery, model search, model detection and other

CRediT authorship contribution statement

Fan Yang: Conceptualization, Methodology, Software, Writing – original draft. Yanan Qiao: Supervision. Cheng Huang: Methodology, Validation. Shan Wang: Data curation. Xiao Wang: Writing – reviewing and editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This research is supported by the National key R&D Program of China under Grant No. 2018YFB1402700. This work is also supported by the Fundamental Research Funds for the Central Universities, Chinaunder Grant No. xzy022020056. This work is also supported by a scholarship from the China Scholarship Council (CSC) under Grant No. 201906280499 while the first author studying at Leiden University. This work is also supported by the the Blockchain Core Technology Strategic Research Program, China

References (55)

HeH. et al.
A novel ensemble method for credit scoring: Adaption of different imbalance ratios
Expert Syst. Appl.
(2018)
PławiakP. et al.
Application of new deep genetic cascade ensemble of SVM classifiers to predict the Australian credit scoring
Appl. Soft Comput.
(2019)
LuoC. et al.
A deep learning approach for credit scoring using credit default swaps
Eng. Appl. Artif. Intell.
(2017)
BequéA. et al.
Extreme learning machines for credit scoring: An empirical evaluation
Expert Syst. Appl.
(2017)
YuL. et al.
Credit risk assessment with a multistage neural network ensemble learning approach
Expert Syst. Appl.
(2008)
MalekipirbazariM. et al.
Risk assessment in social lending via random forests
Expert Syst. Appl.
(2015)
NanniL. et al.
An experimental comparison of ensemble of classifiers for bankruptcy prediction and credit scoring
Expert Syst. Appl.
(2009)
PaleologoG. et al.
Subagging for credit scoring models
European J. Oper. Res.
(2010)
TsaiC.-F. et al.
A comparative study of classifier ensembles for bankruptcy prediction
Appl. Soft Comput.
(2014)
RathoreS.S. et al.
Linear and non-linear heterogeneous ensemble methods to predict the number of faults in software systems
Knowl.-Based Syst.
(2017)

SouiM. et al.

Rule-based credit risk assessment model using multi-objective evolutionary algorithms

Expert Syst. Appl.

(2019)

FuX. et al.

Topology optimization against cascading failures on wireless sensor networks using a memetic algorithm

Comput. Netw.

(2020)

GongG. et al.

An effective memetic algorithm for multi-objective job-shop scheduling

Knowl.-Based Syst.

(2019)

AlsmadiM.K.

An efficient similarity measure for content based image retrieval using memetic algorithm

Egypt. J. Basic Appl. Sci.

(2017)

IaccaG. et al.

Ockham’s razor in memetic computing: three stage optimal memetic exploration

Inform. Sci.

(2012)

LessmannS. et al.

Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research

European J. Oper. Res.

(2015)

XiaY. et al.

A novel heterogeneous ensemble credit scoring model based on bstacking approach

Expert Syst. Appl.

(2018)

WangL. et al.

Imbalanced credit risk evaluation based on multiple sampling, multiple kernel fuzzy self-organizing map and local accuracy ensemble

Appl. Soft Comput.

(2020)

ShenF. et al.

A new deep learning ensemble credit risk evaluation model with an improved synthetic minority oversampling technique

Appl. Soft Comput.

(2021)

WuC.-F. et al.

A predictive intelligence system of credit scoring based on deep multiple kernel learning

Appl. Soft Comput.

(2021)

ZakharovS. et al.

Development of analytical CRM — the system of assesment of solvency of borrowers of commercial bank

C. Li, Y. Zhao, S. Li, P. Wang, Z. Zhao, Design and implementation of credit evaluation system for healthy aged...

W. Wang, A SME Credit Evaluation System Based on Blockchain, in: 2020 International Conference on E-Commerce and...

van RijnJ.N. et al.

The online performance estimation framework: heterogeneous ensemble learning for data streams

Mach. Learn.

(2018)

GuoD. et al.

Heterogeneous ensemble-based infill criterion for evolutionary multiobjective optimization of expensive problems

IEEE Trans. Cybern.

(2018)

LiW. et al.

Heterogeneous ensemble for default prediction of peer-to-peer lending in China

IEEE Access

(2018)

SherstjukV. et al.

Forest fire fighting using heterogeneous ensemble of unmanned aerial vehicles

Cited by (13)

Consumer credit risk assessment: A review from the state-of-the-art classification algorithms, data traits, and learning methods
2024, Expert Systems with Applications
Credit risk assessment is a crucial element in credit risk management. With the extensive research on consumer credit risk assessment in recent decades, the abundance of literature on this topic can be overwhelming for researchers. Therefore, this article aims to provide a more systematic and comprehensive analysis from three perspectives: classification algorithms, data traits, and learning methods. Firstly, the state-of-the-art classification algorithms are categorized into traditional single classifiers, intelligent single classifiers, hybrid and ensemble multiple classifiers. Secondly, considering the diversity of data traits in the credit dataset, data traits are divided into external structure information traits, data quality traits, data quantity traits, and internal information traits. Data traits-driven modeling framework based on multiple classifiers is proposed for solving credit risk assessment. Thirdly, considering the differences in data modeling methods, learning methods are classified into data status, label status, and structure form. Furthermore, model interpretability, model bias, model multi-pattern, and model fairness are discussed. Finally, the limitations and future research directions are presented. This review article serves as a helpful guide for researchers and practitioners in the field of credit risk modeling and analysis.
Consensus reaching with heterogeneous stochastic dominance in the enterprise credit rating under linguistic distribution assessments context
2023, Expert Systems with Applications
Credit rating is an essential method for credit risk management and has been applied in many fields. However, in the process of credit rating, the risk attitude of credit rating manager (i.e., decision maker) and linguistic distribution assessments by experts (i.e., individuals) are rarely considered. Inspired by this, in this paper, First, based on the linguistic distribution assessments provided by different individuals, we build a minimum adjustment cost consensus model to promote the consensus efficiency among different individuals. Then, we integrate the linguistic decision matrix and numerical decision matrix into the decision matrix. Next, based on the heterogeneous types of risk attitudes of decision makers, the dominance relationships between the candidate enterprises and the representative enterprises in different grades are determined. Further, the corresponding dominance degree between the candidate enterprises and the representative enterprises in different grades is calculated to determine the credit rating result of enterprises. Finally, an illustrative example is used to demonstrate the applications of the proposed method. And a simulation analysis is conducted to verify the effectiveness of the proposal.
Stacking ensemble method for personal credit risk assessment in Peer-to-Peer lending
2023, Applied Soft Computing
Over the last decade, China’s Peer-to-Peer (P2P) lending industry has been seen as an important credit source but it has recently suffered from a wave of bankruptcies. Using 126,090 P2P loan deals from RenRen Dai, one of the biggest online P2P websites in China, this paper attempts to predict credit default probabilities for P2P lending by implementing machine-learning techniques. More specifically, this study proposes a stacking ensemble machine-learning model to assess credit default risk for P2P lending platforms. A Max-Relevance and Min-Redundancy (MRMR) method is used for feature selection and then irrelevant features are eliminated by using k-means clustering method. Finally, the stacking ensemble model is performed to produce accurate and stable predictions in the feature subset. Experimental results show that stacking ensemble model yields high performance, not only in prediction accuracy but also in precision and recall. In comparison to single classifiers, the stacking ensemble machine-learning model has a minimum error rate and provides more accurate credit default risk prediction. The results also confirm the efficiency of the proposed stacking ensemble model through the area under the ROC curve.
Bagging Supervised Autoencoder Classifier for credit scoring
2023, Expert Systems with Applications
Citation Excerpt :
Some attempts have been made to produce and extract useful information from the raw data, i.e., hand-craft feature engineering for credit scoring. However, these approaches require a prior knowledge of credit scoring based on extensive and comprehensive experience in the financial sector, which may be expensive to acquire (Yang et al., 2021). Besides, hand-craft feature learning is time and labor-intensive.
Automatic credit scoring, a crucial risk management tool for banks and financial institutes, has attracted much attention in the past few decades. As such, various approaches have been developed to accurately and efficiently estimate defaults in loan applicants and seamlessly improve and facilitate decision-making in the lending process. However, the imbalanced nature of credit scoring datasets, as well as the heterogeneous nature of features in credit scoring task pose many challenges in developing and implementing effective credit scoring models, targeting the generalization power of classification models on unseen data. To mitigate these challenges, in this paper, we propose the Bagging Supervised Autoencoder Classifier (BSAC). BSAC is a learning model which simultaneously leverages the superior power of supervised autoencoders and representation learning in classification, as well as the Bagging mechanism to handle the irregularities in feature space. Supervised autoencoder has been exploited to learn an optimal latent space from heterogeneous features and perform classification on top of the learned latent space. In particular, the Bagging mechanism has been employed in the learning process to construct various samples of original data to tackle the problem that arises from imbalanced data and irregularities of features in latent space. Extensive experiments on various real-world and benchmark datasets validate the superiority and robustness of the proposed method in predicting the outcome of loan applications.
An explainable federated learning and blockchain-based secure credit modeling method
2023, European Journal of Operational Research
Federated learning has drawn a lot of interest as a powerful technological solution to the “credit data silo” problem. The interpretability of federated learning is a crucial issue due to the lack of user interaction and the complexity of credit data monitoring. We advocate the importance of a credit data processing-as-a-service model, which completes conventional credit models in local environments, in order to overcome these restrictions. In particular, we describe an explainable federated learning and blockchain-based credit scoring system (EFCS) in this work. First, we propose an explainable federated learning method with controllable machine learning efficiency and controllable credit model decision making, thus having controllable credit model complexity and transparent and traceable credit decision-making mechanism. Then, we suggest an explainable federated learning training mechanism for credit data that prevents leakage of the model gradients trained by individual nodes during the training of the overall model. Neither the credit data provider nor the data user has access to the raw data in the credit model training ecosystem. Therefore, privacy protection, model performance, and algorithm efficiency, the core triangular cornerstones of federated learning, when added with model interpretability, together constitute a more secure and trustworthy federated learning-based methodology, thus providing a more reliable service for credit model training and construction. The EFCS scheme is presented via simulations of different types of federated learning and their resistance to system attack, applying the proposed model to six different credit scoring datasets. Extensive experimental analyses support the efficiency, security, and explainability of the EFCS.
Credit scoring methods: Latest trends and points to consider
2022, Journal of Finance and Data Science
Citation Excerpt :
As of June, 2022 (article acceptance date), we would highlight a research paper published by Saudi Central Bank15 that covers the changes in the credit scores of borrowers in Saudi Arabia. Some other recent publications on credit scoring modelling16,17 mention the changes caused by the pandemic, but the datasets employed by the authors might not yet include the COVID-19 period. We expect that more research papers on the topic will be published as soon as enough data is accumulated.
Credit risk is the most significant risk by impact for any bank and financial institution. Accurate credit risk assessment affects an organisation's balance sheet and income statement, since credit risk strategy determines pricing, and might even influence seemingly unrelated domains, e.g. marketing, and decision-making. This article aims at providing a systemic review of the most recent (2016–2021) articles, identifying trends in credit scoring using a fixed set of questions. The survey methodology and questionnaire align with previous similar research that analyses articles on credit scoring published in 1991–2015. We seek to compare our results with previous periods and highlight some of the recent best practices in the field that might be useful for future researchers.

View all citing articles on Scopus

View full text

An Automatic Credit Scoring Strategy (ACSS) using memetic evolutionary algorithm and neural architecture search

Highlights

Abstract

Introduction

Section snippets

Related work

Automatic credit scoring method

System overview

Results and discussion

Is it possible to balance automation and high performance? a further analysis of the results for ACSS

Conclusion and future work

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Expert Syst. Appl.

Appl. Soft Comput.

Eng. Appl. Artif. Intell.

Expert Syst. Appl.

Expert Syst. Appl.

Expert Syst. Appl.

Expert Syst. Appl.

European J. Oper. Res.

Appl. Soft Comput.

Knowl.-Based Syst.

Expert Syst. Appl.

Comput. Netw.

Knowl.-Based Syst.

Egypt. J. Basic Appl. Sci.

Inform. Sci.

European J. Oper. Res.

Expert Syst. Appl.

Appl. Soft Comput.

Appl. Soft Comput.

Appl. Soft Comput.

Development of analytical CRM — the system of assesment of solvency of borrowers of commercial bank

The online performance estimation framework: heterogeneous ensemble learning for data streams

Mach. Learn.

Heterogeneous ensemble-based infill criterion for evolutionary multiobjective optimization of expensive problems

IEEE Trans. Cybern.

Heterogeneous ensemble for default prediction of peer-to-peer lending in China

IEEE Access

Forest fire fighting using heterogeneous ensemble of unmanned aerial vehicles