Skip to main content
Log in

Machine learning techniques for credit risk evaluation: a systematic literature review

  • Original Article
  • Published:
Journal of Banking and Financial Technology Aims and scope Submit manuscript

Abstract

Credit risk is the risk of financial loss when a borrower fails to meet the financial commitment. While there are many factors that constitute credit risk, due diligence while giving loan (credit scoring), continuous monitoring of customer payments and other behaviour patterns could reduce the probability of accumulating non-performing assets (NPA) and frauds. In the past few years, the quantum of NPAs and frauds have gone up significantly, and therefore it has become imperative that banks and financial institutions use robust mechanisms to predict the performance of loans. The past two decades has seen an immense growth in the area of artificial intelligence, most notably machine learning (ML) with improved access to internet, data, and compute. Whilst there are credit rating agencies and credit scoring companies that provide their analysis of a customer to banks on a fee, the researchers continue to explore various ML techniques to improve the accuracy level of credit risk evaluation. In this survey paper, we performed a systematic literature review on existing research methods and ML techniques for credit risk evaluation. We reviewed a total of 136 papers on credit risk evaluation published between 1993 and March 2019. We studied the implications of hyper parameters on ML techniques being used to evaluate credit risk and, analyzed the limitations of the current studies and research trends. We observed that Ensemble and Hybrid models with neural networks and SVM are being more adopted for credit scoring, NPA prediction and fraud detection. We also realized that lack of comprehensive public datasets continue to be an area of concern for researchers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Notes

  1. https://tinyurl.com/y5ft3hd7.

  2. https://tinyurl.com/RBINPA.

  3. https://tinyurl.com/peupj3s.

  4. https://tinyurl.com/y2vzvdqx.

  5. https://www-03.ibm.com/press/us/en/pressrelease/31670.wss.

  6. https://www.investopedia.com.

  7. https://tinyurl.com/LCDataDictionary.

References

  1. IMF says. https://tinyurl.com/y5c4hyvj. Accessed 13 Aug 2019

  2. Jarrow RA, Turnbull SM (1995) Pricing derivatives on financial securities subject to credit risk. J Financ 50(1):53–85

    Article  Google Scholar 

  3. Global Economy - NPA Concerns. https://tinyurl.com/y5tpl7t5, 2018. Accessed 13 Aug 2019

  4. NPA countries. https://tinyurl.com/y4a6n6bt. Accessed 13 Aug 2019

  5. IBM - Banking Analytics Services. https://tinyurl.com/y2exerck, 2018. Accessed 13 Aug 2019

  6. McKinsey - Analytics in Banking. https://tinyurl.com/y23ny7mv, 2018. Accessed 13 Aug 2019

  7. Khandani AE, Kim AJ, Lo AW (2010) Consumer credit risk models via machine learning algorithms. J Bank Financ 34(11):2767–2787

    Article  Google Scholar 

  8. Akoglu L, Tong H, Koutra D (2015) Graph based anomaly detection and description: a survey. Data Min Knowl Disc 29(3):626–688

    Article  MathSciNet  Google Scholar 

  9. Bagherpour A (2017) Predicting mortgage loan default with machine learning methods

  10. Galindo J, Tamayo P (2000) Credit risk assessment using statistical and machine learning: basic methodology and risk modeling applications. Comput Econ 15(1–2):107–143

    Article  MATH  Google Scholar 

  11. Albrecht WS, Albrecht CO, Albrecht CC, Zimbelman MF (2011) Fraud examination. Cengage Learn

  12. Hicks D, Caplain J, Faulkner N, Olcina E (2019) Global banking fraud survey

  13. Hannes L, Peter SJ, Peltonen TA (2018) A framework for early-warning modeling with an application to banks. European Central Bank

  14. Tony H, Stewart T, Kristin T (2009) The fourth paradigm: data-intensive scientific discovery. Microsoft Research

  15. Bose I, Mahapatra RK (2001) Business data mining—a machine learning perspective. Inf Manag 39(3):211–225

    Article  Google Scholar 

  16. Ravi V (2017) IDRBT Staff Papers - Analytics. https://tinyurl.com/yy9s85pd. Accessed 13 Aug 2019

  17. Vives X (2017) The impact of FinTech on banking. Eur Econ 2:97–105

    Google Scholar 

  18. Jagtiani JA, Lemieux CM (2019) The roles of alternative data and machine learning in fintech lending: evidence from the lendingclub consumer platform

  19. Ann KB, David B, Pearl B (2015) Evidence-based software engineering and systematic reviews. Chapman & Hall/CRC, New York

    Google Scholar 

  20. Russell Stuart J, Peter N (2016) Artificial intelligence: a modern approach. Pearson Education Limited, Malaysia

    MATH  Google Scholar 

  21. AI Boom and subsequent winter. https://tinyurl.com/y2yk9kqb. Accessed 13 Aug 2019

  22. SLR files. https://tinyurl.com/SLRfiles. Accessed 13 Aug 2019

  23. Marikkannu P, Shanmugapriya K (2011) Classification of customer credit data for intelligent credit scoring system using fuzzy set and mc2—domain driven approach. In: 2011 3rd International conference on electronics computer technology, 3:410–414

  24. Romanyuk K (2015) Concept of a decision support system for a loan granting based on continuous price function. In: 2015 SAI intelligent systems conference (IntelliSys), pp 105–111

  25. Wei R (2008) Development of credit risk model based on fuzzy theory and its application for credit risk management of commercial banks in china. In: 2008 4th International conference on wireless communications, networking and mobile computing, pp 1–4

  26. Hoffmann F, Bart B, Christophe M, Van GT, Jan V (2007) Inferring descriptive and approximate fuzzy rules for credit scoring using evolutionary algorithms. Eur J Oper Res 177:540–555

    Article  Google Scholar 

  27. Kotsiantis SB, Kanellopoulos D, Karioti V, Tampakas V (2009) An ontology-based portal for credit risk analysis. In: 2009 2nd IEEE international conference on computer science and information technology, 165–169

  28. Baesens B, Mues C, De Backer M, Vanthienen J, Setiono R (2005) Building intelligent credit scoring systems using decision tables. In: Camp O, Filipe JBL, Hammoudi S, Piattini M (eds) Enterprise Information Systems V, pp 131–137, Springer, Amsterdam

  29. Pedro JS, Proserpio D, Oliver N (2015) Mobiscore: towards universal credit scoring from mobile phone data. In: Ricci F, Bontcheva K, Conlan O, Lawless S (eds), User modeling, adaptation and personalization, pp 195–207, Springer, Cham

  30. Xia Y, Liu C, Da B, Xie F (2017) A novel heterogeneous ensemble credit scoring model based on bstacking approach. Expert Syst Appl 93:10

    Google Scholar 

  31. Mahmoud M, Algadi N, Ali A (2008) Expert system for banking credit decision. In: 2008 International conference on computer science and information technology, 813–819

  32. Lin S, Wu S-J, Ma H-L, Wu D-B (2009) Development of credit risk model in banking industry based on gra. In: 2009 International conference on machine learning and cybernetics, 5, pp 2903–2909

  33. Huang J, Chen M (2018) Domain adaptation approach for credit risk analysis. In: Proceedings of the 2018 international conference on software engineering and information management, ICSIM2018, pp 104–107, New York, NY

  34. Gadi MFA, Wang X, do Lago AP (2008) Credit card fraud detection with artificial immune system. In: Bentley PJ, Lee D, Jung S (eds), Artificial immune systems, pp 119–131, 2008. Springer, Berlin

  35. Duman E, Özçelik M (2011) Detecting credit card fraud by genetic algorithm and scatter search. Expert Syst Appl 38:13057–13063

  36. Van Vlasselaer V, Bravo C, Caelen O, Eliassi-Rad T, Akoglu L, Snoeck M, Baesens B (2015) Apate: a novel approach for automated credit card transaction fraud detection using network-based extensions. Decis Support Syst 75:05

    Google Scholar 

  37. Vac Gelu I, Găban Lucian V (2016) A new perspective over the risk assessment in credit scoring analysis using the adaptive reference system. In: Abramowicz W, Alt R, Franczyk B (eds), Business information systems, pp 130–143, Springer, Cham

  38. Liu K, Lai KK, Guu S (2009) Dynamic credit scoring on consumer behavior using fuzzy markov model. In: 2009 Fourth International Multi-Conference on Computing in the Global Information Technology, 235–239

  39. Anzilli L, Facchinetti G, Pirotti T (2017) Credit risk profiling using a new evaluation of interval-valued fuzzy sets based on alpha-cuts. In: 2017 IEEE international conference on fuzzy systems (FUZZ-IEEE), 1–6

  40. Qiao H, Dong X-C (2009) Research on the risk evaluation in loan projects of commercial bank in financial crisis. Int Conf Mach Learn Cybern 2:776–781

    Google Scholar 

  41. Yazdani H, Kwasnicka H (2012) Fuzzy classification method in credit risk. In: Nguyen N-T, Hoang K, Jedrzejowicz P (eds), Computational collective intelligence. Technologies and applications, Springer, Berlin, pp 495–504

  42. Kültür Y, Çaǧlayan M U (Oct 2015) A novel cardholder behavior model for detecting credit card fraud. In: 2015 9th International conference on application of information and communication technologies (AICT), pp 148–152

  43. Li G, Wu Y (2010) Empirical research about credit risk on neural network based bp algorithm. In: 2010 3rd international conference on information management, innovation management and industrial engineering, 3:461–463

  44. Li R-Z, Pang S-L, Xu J-M (2002) Neural network credit-risk evaluation model based on back-propagation algorithm. In: Proceedings of international conference on machine learning and cybernetics, 4:1702–1706 vol.4

  45. Zhu C, Zhan Y, Jia S (2010) Research on bp neural network evaluation model of credit risk of bank clients. In: 2010 International conference on management and service science, 1–5

  46. Dima AM, Vasilache S (2009) Ann model for corporate credit risk assessment. In: International conference on information and financial engineering, 94–98

  47. Hu X-Y, Tang Y (2006) Ann-based credit risk identificaion and control for commercial banks. 2006:3110 – 3114, 09

  48. Wang F, Song Z (2008) Research on the credit risk evaluation and forecast of housing mortgage loans. In: 2008 International symposium on intelligent information technology application workshops, pp 940–943

  49. Peng Y, Tu X (2005) A study on the ann-based credit risk prediction model and its application. In: Li D, Wang B (eds), Artificial intelligence applications and innovations, pages 459–468, Boston, MA, Springer US

  50. Bozsik J, Ilonczai Z (2012) Echo state network-based credit rating system. In: 4th IEEE international symposium on logistics and industrial informatics, pp 185–190

  51. Lai KK, Yu L, Wang S, Zhou L (2006) Neural network metalearning for credit scoring. In: Huang D-S, Li K, Irwin GW (eds), Intelligent computing, Springer, Berlin, pp 403–408

  52. Hsieh N-C (2005) Hybrid mining approach in the design of credit scoring models. Expert Syst Appl 28:655–665

  53. Derelioğlu G, Gürgen F, Okay N (2009) A neural approach for sme’s credit risk analysis in turkey. In: Perner P (ed), Machine learning and data mining in pattern recognition, Springer, Berlin, pp 749–759

  54. Lean Y, Shouyang W, Keung LK (2008) Credit risk assessment with a multistage neural network ensemble learning approach. Expert Syst Appl 34:1434–1444, 02

    Article  Google Scholar 

  55. Baesens B, Setiono R, Mues C, Vanthienen J (2003) Using neural network rule extraction and decision tables for credit - risk evaluation. Manag Sci 49:312–329

    Article  MATH  Google Scholar 

  56. Marin-de-la-Barcena A, Marcano-Cedeño A, Jimenez-Trillo J, Piñuela J A, Andina D (2010) Artificial metaplasticity: an approximation to credit scoring modeling. In: IECON 2010 - 36th annual conference on IEEE industrial electronics society, pp 2817–2822

  57. Tomczak J, Ziȩba M (2014) Classification restricted boltzmann machine for comprehensible credit scoring model. Expert Syst Appl 42:10

    Google Scholar 

  58. Zhang Y, Wang D, Chen Y, Zhao Y, Shao P, Meng Q (2017) Credit risk assessment based on flexible neural tree model. In: Cong F, Leung A, Wei Q (eds), Advances in neural networks - ISNN 2017, Springer, Cham, pp 215–222

  59. Zhaoji Y, Qiang M, Wenjuan W (2010) The application of wn based on pso in bank credit risk assessment. In: 2010 International conference on artificial intelligence and computational intelligence, 3:444–448

  60. Fan Q, Yang J (2018) A denoising autoencoder approach for credit risk analysis. In: Proceedings of the 2018 international conference on computing and artificial intelligence, ICCAI 2018, New York, NY, pp 62–65

  61. Timofeev N, Timofeeva G (2013) Estimation of loan portfolio risk on the basis of markov chain model. In: Hömberg D, Tröltzsch F (eds), System modeling and optimization, Springer, Berlin, pp 207–216

  62. Farquad MAH, Sriramjee VR, Praveen G (2011) Credit scoring using pca-svm hybrid model. In: Das VV, Stephen J, Chaba Y (eds), Computer networks and information technologies, Springer, Berlin

  63. Harris T (2015) Credit scoring using the clustered support vector machine. Expert Syst Appl 42:741–750, 02

    Article  Google Scholar 

  64. Huang S-C (2009) Integrating nonlinear graph based dimensionality reduction schemes with svms for credit rating forecasting. Expert Syst Appl 36:7515–7518, 05

    Article  Google Scholar 

  65. Li Z (2016) A new method of credit risk assessment of commercial banks. In: 2016 International Conference on Robots Intelligent System (ICRIS), p 34–37

  66. Feng W, Zhao Y, Deng J (2009) Application of svm based on principal component analysis to credit risk assessment in commercial banks. In: 2009 WRI Global Congress on Intelligent Systems, 4, p 49–52

  67. Lv G, Peng L (2008) Commercial banks’ credit risk assessment based on rough sets and svm. In: 2008 4th International Conference on Wireless Communications, Networking and Mobile Computing, p 1–4

  68. Yang C-G, Duan X-B (2008) Credit risk assessment in commercial banks based on svm using pca. In: 2008 International Conference on Machine Learning and Cybernetics, volume 2, pp 1207–1211

  69. Wei L, Li W, Xiao Q (2016) Credit risk evaluation using: Least squares support vector machine with mixture of kernel. In: 2016 International Conference on Network and Information Systems for Computers (ICNISC), p 237–241

  70. Zhu C, Zhan Y, Jia S (2010) Credit risk identification of bank client basing on supporting vector machines. In: 2010 Third International Conference on Business Intelligence and Financial Engineering, p 62–66

  71. Ma Y, Liu H (2010) Research of svm applying in the risk of bank’s loan to enterprises. In: 2010 2nd International Conference on Information Engineering and Computer Science, p 1–5

  72. A least squares fuzzy SVM approach to credit risk assessment, pp. 73–84. Springer, Berlin (2008)

  73. Sun W, Yang C (2006) Credit risk assessment in commercial banks based on multi-layer svm classifier. In: Huang D-S, Li K, Irwin GW (eds), Computational intelligence, pp 778–785. Springer, Berlin

  74. Wei L, Li J, Chen Z-Y (2007) Credit risk evaluation using support vector machine with mixture of kernel. 4488:431–438, 05

  75. Lai Kin Keung, Yu Lean, Zhou Ligang, Wang Shouyang (2006) Credit risk evaluation with least square support vector machine. In: Wang Guo-Ying, Peters James F, Skowron Andrzej, YaoYiyu (eds), Rough Sets and Knowledge Technology, p 490–495, Berlin, Heidelberg. Springer Berlin Heidelberg

  76. Li Jianping, Liu Jingli, Xu Weixuan, Shi Yong (2004) Support vector machines approach to credit assessment. In: Bubak Marian, van Albada Geert Dick, Sloot Peter M A, Dongarra Jack, (eds), Computational Science - ICCS 2004, p 892–899, Berlin, Heidelberg. Springer Berlin Heidelberg

  77. Gestel Tony Van, Baesens Bart, Suykens Johan A K, Van den Poel Dirk, Baestaens Dirk-Emma, Willekens Marleen (2006) Bayesian kernel based classification for financial distress detection. European Journal of Operational Research, 172:979–1003

  78. Martens D, Baesens B, Van Gestel T, Vanthienen J (2007) Comprehensible credit scoring models using rule extraction from support vector machines. Eur J Oper Res 183:1466–1476

    Article  MATH  Google Scholar 

  79. Van Gestel Tony, Baesens Bart, Garcia Joao, Van Dijcke Peter (2003) A support vector machine approach to credit scoring

  80. Vedala R, Kumar B R (2012) An application of naive bayes classification for credit scoring in e-lending platform. In: 2012 International Conference on Data Science Engineering (ICDSE), p 81–84

  81. Okesola Olatunji, Okokpujie Kennedy, Adewale Adeyinka, John Samuel, Omoruyi Osemwegie (2017) An improved bank credit scoring model: A naïve bayesian approach. p 228–233, 12

  82. Benyacoub B, El Bernoussi S, Zoglat A (2014) Building classification models for customer credit scoring. In: 2014 International Conference on Logistics Operations Management, p 107–111

  83. Petropoulos Anastasios, Chatzis Sotirios, Xanthopoulos S (2016) A novel corporate credit rating system based on student’s-t hidden markov models. Expert Systems with Applications, 53, 01

  84. Vieira Armando, Duarte João, Ribeiro Bernardete, Neves Joao Carvalho (2009) Improving personal credit scoring with hlvq-c. In: Köppen Mario, Kasabov Nikola, Coghill George, (eds), Advances in Neuro-Information Processing, p 97–103, Berlin, Heidelberg. Springer Berlin Heidelberg

  85. Wei G, Yingjie S, Mu Y X (2015) Commercial bank credit risk evaluation method based on decision tree algorithm. In: 2015 Seventh International Conference on Measuring Technology and Mechatronics Automation, p 285–288

  86. Lang J, Sun J (2014) Sensitivity of decision tree algorithm to class-imbalanced bank credit risk early warning. In: 2014 Seventh International Joint Conference on Computational Sciences and Optimization, p 539–543

  87. Szwabe Andrzej, Misiorek Pawel (2018) Decision trees as interpretable bank credit scoring models. In: Kozielski Stanisław, Mrozek Dariusz, Kasprowski Paweł, Małysiak-Mrozek Bożena, Kostrzewa Daniel, (eds), Beyond Databases, Architectures and Structures. Facing the Challenges of Data Proliferation and Growing Variety, p 207–219, Cham. Springer International Publishing

  88. Xia Y, Liu C, Li YY, Liu N (2017) A boosted decision tree approach using bayesian hyper-parameter optimization for credit scoring. Expert Syst Appl 78:02

    Article  Google Scholar 

  89. Fu Hui, Liu Xiaoyong (2011) A hybrid model for credit evaluation problem. In: Tan Ying, Shi Yuhui, Chai Yi, Wang Guoyin, (eds), Advances in Swarm Intelligence, pages 626–634, Berlin, Heidelberg. Springer Berlin Heidelberg

  90. Ruiz Saulo, Gomes Pedro, Rodrigues Luís, Gama João (2017) Credit scoring in microfinance using non-traditional data. In: Oliveira Eugénio, Gama João, Vale Zita, Lopes Cardoso Henrique, (eds), Progress in Artificial Intelligence, p 447–458, Cham, (2017). Springer International Publishing

  91. Cheng-Lung H, Mu-Chen C, Chieh-Jen W (2007) Credit scoring with a data mining approach based on support vector machines. Expert Syst Appl 33:847–856, 11

    Article  Google Scholar 

  92. Oreski S, Oreski D, Oreški G (2012) Hybrid system with genetic algorithm and artificial neural networks and its application to retail credit risk assessment. Expert Syst Appl 39:12605–12617, 11

    Article  Google Scholar 

  93. Taremian HR, Naeini MP (2011) Hybrid intelligent decision support system for credit risk assessment. In:2011 7th International Conference on Information Assurance and Security (IAS), p 167–172

  94. Rodan Ali, Faris Hossam (2016) Credit risk evaluation using cycle reservoir neural networks with support vector machines readout. In: Nguyen Ngoc Thanh, Trawiński Bogdan, Fujita Hamido, Hong Tzung-Pei, (eds), Intelligent Information and Database Systems, p 595–604, Berlin, Heidelberg. Springer Berlin Heidelberg

  95. Weidong Huang, Xiangwei Zhu (2010) SuQingling. Research on application of personal credit scoring based on bp-logistic hybrid algorithm. In: 2010 International Conference on Computer Application and System Modeling (ICCASM 2010), 4:V4–735–V4–739

  96. Djemaiel Yacine, Labidi Nadia, Boudriga Noureddine (2016) A dynamic hybrid rbf/elman neural networks for credit scoring using big data. In: Abramowicz Witold, Alt Rainer, Franczyk Bogdan, (eds), Business Information Systems, p 102–113, Cham. Springer International Publishing

  97. Huang Y, Tian C (2008) Research on credit risk assessment model of commercial banks based on fuzzy probabilistic neural network. In: 2008 International Conference on Risk Management Engineering Management, p 482–486

  98. Huang Bo, Zhang Qing-Pu, Hu Yun-Quan (2005) Research on credit risk management of the state-owned commercial bank. In: 2005 International Conference on Machine Learning and Cybernetics, volume 7, pages 4038–4043 Vol. 7

  99. Zhou J, Bai T (2008) Credit risk assessment using rough set theory and ga-based svm. In: 2008 The 3rd International Conference on Grid and Pervasive Computing - Workshops, p 320–325

  100. Jiang Ming-hui, Yuan Xu-chuan (2007) Construction and application of pso-svm model for personal credit scoring. In Yong Shi, Geert Dick van Albada, Jack Dongarra, and Peter M. A. Sloot, editors, Computational Science – ICCS 2007, pages 158–161, Berlin, Heidelberg, 2007. Springer Berlin Heidelberg

  101. Hao Yanyou, Chi Zhongxian, Yan Deqin, Yue Xun (2007) An improved fuzzy support vector machine for credit rating. In: Li Keqiu, Jesshope Chris, Jin Hai, Gaudiot Jean-Luc, (eds), Network and Parallel Computing, pages 495–505, Berlin, Heidelberg. Springer Berlin Heidelberg

  102. Van Gestel Tony, Baesens Bart, Van Dijcke Peter, Suykens Johan A K, Garcia Joao (2005) Linear and non-linear credit scoring by combining logistic regression and support vector machines

  103. Alaraj M, Abbod M (2016) Classifiers consensus system approach for credit scoring. Knowl-Based Syst 104:04

    Google Scholar 

  104. Zhen W, Wenjuan S (2016) Commercial bank credit risk assessment method based on improved svm. In: 2016 International Conference on Intelligent Transportation, Big Data Smart City (ICITBS), p 353–356

  105. Zięba Maciej, Świątek Jerzy (2012) Ensemble classifier for solving credit scoring problems. In Camarinha-Matos Luis M, Shahamatnia Ehsan, Nunes Gonçalo, (eds), Technological Innovation for Value Creation, p 59–66, Berlin, Heidelberg. Springer Berlin Heidelberg

  106. Lai Kin Keung, Yu Lean, Wang Shouyang, Zhou Ligang (2006) Credit risk analysis using a reliability-based neural network ensemble model. In: Kollias Stefanos, Stafylopatis Andreas, Duch Włodzisław, Oja Erkki, (eds), Artificial Neural Networks – ICANN 2006, p 682–690, Berlin, Heidelberg. Springer Berlin Heidelberg

  107. Hsieh Nan-Chen, Hung Lun-Ping, Ho Chia-Ling (2009) A data driven ensemble classifier for credit scoring analysis. In: Theeramunkong Thanaruk, Kijsirikul Boonserm, Cercone Nick, Ho Tu-Bao, (eds), Advances in Knowledge Discovery and Data Mining, pages 351–362, Berlin, Heidelberg. Springer Berlin Heidelberg

  108. Chen H, Jiang M, Wang X (2017) Bayesian ensemble assessment for credit scoring. In: 2017 4th International Conference on Industrial Economics System and Industrial Security Engineering (IEIS), p 1–5

  109. Chou T (2007) A novel prediction model for credit card risk management. In: Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007), p 211–211

  110. Makrygianni Ira I, Markopoulos Angelos P (2016) Loan evaluation applying artificial neural networks. In: Proceedings of the SouthEast European Design Automation, Computer Engineering, Computer Networks and Social Media Conference, SEEDA-CECNSM ’16, pages 124–128, New York, NY, USA. ACM

  111. Zhang Z (2011) Research of default risk of commercial bank’s personal loan based on rough sets and neural network. In: 2011 3rd International Workshop on Intelligent Systems and Applications, p 1–4

  112. Miglionico Maria Cristina, Parillo Fernando (2012) An application in bank credit risk management system employing a bp neural network based on sfloat24 custom math library using a low cost fpga device. In: Salvatore Greco, Bernadette Bouchon-Meunier, Giulianella Coletti, Mario Fedrizzi, Benedetto Matarazzo, and Ronald R. Yager, editors, Advances in Computational Intelligence, pages 84–93, Berlin, Heidelberg, 2012. Springer Berlin Heidelberg

  113. Chen Ya-Qi, Zhang Jianjun, Ng Wing (2018) Loan default prediction using diversified sensitivity undersampling. 240–245, 07

  114. Feki A, Ishak A, Feki S (2012) Feature selection using bayesian and multiclass support vector machines approaches: Application to bank risk prediction. Expert Syst Appl 39:3087–3099, 02

    Article  Google Scholar 

  115. Ribeiro B, Silva C, Chen N, Vieira A, Carvalho das Neves J (2012) Enhanced default risk models with svm+. Expert Syst Appl 39:10140–10152, 09

    Article  Google Scholar 

  116. Oguz H T, Gurgen F S (2008) Credit risk analysis using hidden markov model. In: 2008 23rd International Symposium on Computer and Information Sciences, 1–5

  117. Ni Weijian, Liu Tong, Zeng Qingtian, Zhang Xianke, Duan Hua, Xie Nengfu (2018) Robust factorization machines for credit default prediction. In: Geng Xin, Kang Byeong-Ho, (eds), PRICAI 2018: Trends in Artificial Intelligence, pages 941–953, Cham, 2018. Springer International Publishing

  118. Masmoudi K, Abid L, Masmoudi A (2019) Credit risk modeling using bayesian network with a latent variable. Expert Syst Appl 127:03

    Article  Google Scholar 

  119. Zhao Zhenyu, Zhang Wei, Zhou Yayue (2011) National student loans credit risk assessment based on gabp algorithm of neural network. In: 2011 2nd International Conference on Artificial Intelligence, Management Science and Electronic Commerce (AIMSEC), 2196–2199

  120. Su J, Zhang Y (2017) Application of bp neural network optimization algorithm based on genetic algorithm in credit risk early-warning of commercial bank. In: 2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD), p 487–491

  121. Yao Ping, Wu Chong, Yao Minghui (2009) Credit risk assessment model of commercial banks based on fuzzy neural network. In: Yu Wen, He Haibo, Zhang Nian (eds), Advances in Neural Networks – ISNN 2009, p 976–985, Berlin, Heidelberg. Springer Berlin Heidelberg

  122. Mubarek A M, Adalı E (2017) Multilayer perceptron neural network technique for fraud detection. In: 2017 International Conference on Computer Science and Engineering (UBMK), p 383–387

  123. Patil P S, Dharwadkar N V (2017) Analysis of banking data using machine learning. In: 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC), p 876–881

  124. Ghobadi F, Rohani M (2016) Cost sensitive modeling of credit card fraud using neural network strategy. In: 2016 2nd International Conference of Signal Processing and Intelligent Systems (ICSPIS), pages 1–5

  125. Behera T K, Panigrahi S (2015) Credit card fraud detection: A hybrid approach using fuzzy clustering amp;amp; neural network. In: 2015 Second International Conference on Advances in Computing and Communication Engineering, pages 494–499

  126. Zhan Qing, Yin Hang (2018) A loan application fraud detection method based on knowledge graph and neural network. In: Proceedings of the 2Nd International Conference on Innovation in Artificial Intelligence, ICIAI ’18, pages 111–115, New York, NY, USA. ACM

  127. Kazemi Z, Zarrabi H (2017) Using deep networks for fraud detection in the credit card transactions. In: 2017 IEEE 4th International Conference on Knowledge-Based Engineering and Innovation (KBEI), pages 0630–0633

  128. Zamini Mohamad, Montazer Gholamali (2018) Credit card fraud detection using autoencoder based clustering. pages 486–491, 12

  129. Liu Ou, Ma Jian, Poon Pak-Lok, Zhang Jun (2009) On an ant colony-based approach for business fraud detection. In: Huang De-Shuang, Jo Kang-Hyun, Lee Hong-Hee, Kang Hee-Jun, Bevilacqua Vitoantonio, (eds), Emerging Intelligent Computing Technology and Applications, pages 1104–1111, Berlin, Heidelberg. Springer Berlin Heidelberg

  130. Charleonnan A (Oct 2016) Credit card fraud detection using rus and mrn algorithms. In: 2016 Management and Innovation Technology International Conference (MITicon), pages MIT–73–MIT–76

  131. Bouchti A E, Chakroun A, Abbar H, Okar C (2017) Fraud detection in banking using deep reinforcement learning. In: 2017 Seventh International Conference on Innovative Computing Technology (INTECH), pages 58–63

  132. Abakarim Youness, Lahby Mohamed, Attioui Abdelbaki (2018) An efficient real time model for credit card fraud detection based on deep learning. In: Proceedings of the 12th International Conference on Intelligent Systems: Theories and Applications, SITA’18, pages 30:1–30:7, New York, NY, USA. ACM

  133. Karlos Stamatis, Kostopoulos Georgios, Kotsiantis Sotiris, Tampakas Vassilis (2017) Using active learning methods for predicting fraudulent financial statements. In Giacomo Boracchi, Lazaros Iliadis, Chrisina Jayne, and Aristidis Likas, editors, Engineering Applications of Neural Networks, pages 351–362, Cham. Springer International Publishing

  134. Jiang C, Song J, Liu G, Zheng L, Luan W (2018) Credit card fraud detection: A novel approach using aggregation strategy and feedback mechanism. IEEE Internet of Things Journal 5(5):3637–3647

    Article  Google Scholar 

  135. Rahmawati D, Sarno R, Fatichah C, Sunaryono D (2017) Fraud detection on event log of bank financial credit business process using hidden markov model algorithm. In: 2017 3rd International Conference on Science in Information Technology (ICSITech), pages 35–40

  136. Khan A, Singh T, Sinhal A (2012) Implement credit card fraudulent detection system using observation probabilistic in hidden markov model. In: 2012 Nirma University International Conference on Engineering (NUiCONE), pages 1–6

  137. Gyamfi N K, Abdulai J (2018) Bank fraud detection using support vector machine. In: 2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), pages 37–41

  138. Kotsiantis Sotiris, Koumanakos Euaggelos, Tzelepis Dimitris, Tampakas Vasilis (2006) Predicting fraudulent financial statements with machine learning techniques. In: Antoniou Grigoris, Potamias George, Spyropoulos Costas, Plexousakis Dimitris, (eds), Advances in Artificial Intelligence, pages 538–542, Berlin, Heidelberg. Springer Berlin Heidelberg

  139. Pang Sulin, Yuan Jinmeng (2018) Wt model & applications in loan platform customer default prediction based on decision tree algorithms. In: Huang De-Shuang, Bevilacqua Vitoantonio, Premaratne Prashan, Gupta Phalguni, (eds), Intelligent Computing Theories and Application, pages 359–371, Cham. Springer International Publishing

  140. Mareeswari V, Gunasekaran G (2016) Prevention of credit card fraud detection based on hsvm. In: 2016 International Conference on Information Communication and Embedded Systems (ICICES), pages 1–4

  141. Montini Denis, Matuck Gustavo, de Avila Montini Alessandra, Cunha Adilson, Ribeiro Alexandre, Dias Luiz (2013) A sampling diagnostics model for neural system training optimization. Proceedings of the 2013 10th International Conference on Information Technology: New Generations, ITNG 2013, 04

  142. Kamaruddin Sk, Ravi Vadlamani (2016) Credit card fraud detection using big data analytics: Use of psoaann based one-class classification. In: Proceedings of the International Conference on Informatics and Analytics, ICIA-16, pages 33:1–33:8, New York, NY, USA. ACM

  143. Malhotra Rashmi, Malhotra DK (2003) Evaluating consumer loans using neural networks. Omega, 31:83–96, 04

  144. Huang S, Day M (2013) A comparative study of data mining techniques for credit scoring in banking. In: 2013 IEEE 14th International Conference on Information Reuse Integration (IRI), pages 684–691

  145. Khemakhem Sihem, Boujelbene Younes (2017) Artificial intelligence for credit risk assessment: Artificial neural network and support vector machines. ACRN Oxford Journal of Finance and Risk Perspectives, 6:1–17, 01

  146. Nwulu Nnamdi I, Oroja Shola, Ilkan Mustafa (2011) Credit scoring using soft computing schemes: A comparison between support vector machines and artificial neural networks. In: Ariwa Ezendu , El-Qawasmeh Eyas, (eds), Digital Enterprise and Information Systems, pages 275–286, Berlin, Heidelberg. Springer Berlin Heidelberg

  147. FISHER R A The use of multiple measurements in taxonomic problems. Annals of Eugenics, 7(2):179–188

  148. Wiginton JC (1980) A note on the comparison of logit and discriminant models of consumer credit behavior. Journal of Financial and Quantitative Analysis 15(3):757–770

    Article  Google Scholar 

  149. Grablowsky Bernie J, Talley Wayne K (1981) Probit and discriminant functions for classifying credit applicants: a comparison

  150. Glover Fred Improved linear programming models for discriminant analysis*. Decision Sciences, 21(4):771–785

  151. Mangasarian OL (1965) Linear and nonlinear separation of patterns by linear programming. Oper Res 13(3):444–452

    Article  MathSciNet  MATH  Google Scholar 

  152. Henley WE, Hand DJ (1996) A k-nearest-neighbour classifier for assessing consumer credit risk. The Statistician 45(1):77

    Article  Google Scholar 

  153. Mays E (2001) Handbook of Credit Scoring. Global Professional Publishing, Business Series

    Google Scholar 

  154. Li S-T, Shiue W, Huang M-H (2006) The evaluation of consumer loans using support vector machines. Expert Syst Appl 30:772–782, 05

    Article  Google Scholar 

  155. Hand D J, Henley W E Statistical classification methods in consumer credit scoring: a review. Journal of the Royal Statistical Society: Series A (Statistics in Society), 160(3):523–541

  156. Pandey T N, Jagadev A K, Mohapatra S K, Dehuri S (2017) Credit risk analysis using machine learning classifiers. In: 2017 International Conference on Energy, Communication, Data Analytics and Soft Computing (ICECDS), pages 1850–1854

  157. Singh P (2017) Comparative study of individual and ensemble methods of classification for credit scoring. In: 2017 International Conference on Inventive Computing and Informatics (ICICI), pages 968–972

  158. Ravisankar P, Ravi V, Raghava Rao G, Bose I (2011) Detection of financial statement fraud and feature selection using data mining techniques. Decis Support Syst 50(2):491–500

    Article  Google Scholar 

  159. Why SVM is quadratic. https://tinyurl.com/y2xre5zn. Accessed: 2019-08-13

  160. Baesens B, Setiono R, Mues C, Vanthienen J (2003) Using neural network rule extraction and decision tables for credit-risk evaluation. Manage Sci 49(3):312–329

    Article  MATH  Google Scholar 

  161. Crouhy M, Galai D, Mark R (2000) A comparative analysis of current credit risk models. J Bank Finance 24:59–117, 01

    Article  MATH  Google Scholar 

  162. Lessmann S, Baesens B, Seow H-V, Thomas LC (2015) Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research. Eur J Oper Res 247:124–136

    Article  MATH  Google Scholar 

  163. Universities working in the area of credit risk evaluation. https://tinyurl.com/CreditRiskUniversities. Accessed: 2019-08-13

  164. Wolpert DH, Macready WG (1997) No free lunch theorems for optimization. IEEE Trans Evol Comput 1(1):67–82

    Article  Google Scholar 

  165. Hung C, Chen J-H (2009) A selective ensemble based on expected probabilities for bankruptcy prediction. Expert Syst Appl 36:5297–5303, 04

    Article  Google Scholar 

  166. Yu L, Wang S, Lai KK (2008) Credit risk assessment with a multistage neural network ensemble learning approach. Expert Syst Appl 34(2):1434–1444

    Article  Google Scholar 

  167. Wang G, Hao J, Ma J, Jiang H (2011) A comparative assessment of ensemble learning for credit scoring. Expert Syst Appl 38(1):223–230

    Article  Google Scholar 

  168. Piramuthu S (2006) On preprocessing data for financial credit risk evaluation. Expert Syst Appl 30:489–497, 04

    Article  Google Scholar 

  169. Krishna, Gutha Jaya and Ravi, Vadlamani. Feature Subset Selection Using Adaptive Differential Evolution: An Application to Banking. Proceedings of the ACM India Joint International Conference on Data Science and Management of Data, 157–163, 2019

  170. Popat R R, Chaudhary J (2018) A Survey on Credit Card Fraud Detection Using Machine Learning 2nd International Conference on Trends in Electronics and Informatics (ICOEI), 1120–1125

  171. Dastile Xolani, Celik Turgay, Potsane Moshe (2020) Statistical and machine learning models in credit scoring: A systematic literature survey http://www.sciencedirect.com/science/article/pii/ S1568494620302039, 106263

  172. Bakshi S (2018) Credit Card Fraud Detection : A classification analysis 2018 2nd International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud), 152–156

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Siddharth Bhatore.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bhatore, S., Mohan, L. & Reddy, Y.R. Machine learning techniques for credit risk evaluation: a systematic literature review. J BANK FINANC TECHNOL 4, 111–138 (2020). https://doi.org/10.1007/s42786-020-00020-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s42786-020-00020-3

Keywords

Navigation