Abstract
The objective of this paper is to propose a credit scoring approval model using a feature selection technique performed by Principal Component Analysis (PCA) and ReliefF algorithm followed by a decision tree classifier. As a reference classifier, we have chosen Support Vector Machine (SVM). The performance of our proposed model has been tested using the German credit dataset. The experimental results of the proposed signal processing cascade for the credit scoring lead to the best accuracy of 91.67%, while classifiers without feature selection show the best accuracy of only 75.35%. On the other side, using the same combination of feature selection (PCA and ReliefF) but cascaded with SVM classifier, one has obtained an accuracy of only 85.15%. The experimental results confirm the accuracy of the proposed model, and at the same time they show the importance of feature selection and its optimization for credit scoring decision systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abdelmoula, A.K.: Bank credit risk analysis with k-nearest- neighbor classifier: case of Tunisian banks. Account. Manag. Inf. Syst. 14(1), 79–106 (2015)
Agre, G., Dzhondzhorov, A.: A weighted feature selection method for instance-based classification. In: 17th International Conference on Artificial Intelligence : Methodology, Systems and Applications (AIMSA), pp. 14–25. Springer, Switzerland (2016)
Anaei, S.M., Moradi, M.: A new method based on clustering and feature selection for credit scoring of banking customers. Int. J. Mod. Trends Eng. Res. 3(2), 123–128 (2016)
Antunes, F., Ribeiro, B., Pereira, F.: Probabilistic modeling and visualization for bankruptcy prediction. Appl. Soft Comput. J. 60, 831–843 (2017)
Beretta, L., Santaniello, A.: Implementing ReliefF filters to extract meaningful features from genetic lifetime datasets. J. Biomed. Inform. 44(2), 361–369 (2011)
Browne, D., Prestwich, S.: Credit scoring : feature selection on machine learning algorithms (2016)
Damrongsakmethee, T., Neagoe, V.: Data Mining and machine learning for financial analysis. Indian J. Sci. Technol. 10(39), 1–7 (2017)
Gepp, A., Kumar, K.: Predicting financial distress: a comparison of survival analysis and decision tree techniques. In: Eleventh International Multi-Conference on Information Processing-2015 (IMCIP-2015), pp. 396–404. Elsevier (2015)
Go, W., Lee, T., Kim, I., Lee, K.: Feature selection practice for unsupervised learning of credit card fraud detection. J. Theor. Appl. Inf. Technol. 96(2), 408–417 (2018)
Gupta, A.: Classification of complex UCI datasets using machine learning and evolutionary algorithms. Int. J. Sci. Technol. Res. 4(5), 85–94 (2015)
Ha, V., Nguyen, H.: Credit scoring with a feature selection approach based deep learning. In: MATEC web of Conferences (MIMT), pp. 1–5 (2016)
Ilgun, E., Mekic, E., Mekic, E.: Application of Ann in Australian credit card apporval. Int. Multidiscip. J. 69(2), 334–342 (2014)
Louzada, F., Ara, A., Fernandes, G.B.: Surveys in operations research and management science classification methods applied to credit scoring: systematic review and overall comparison. Surv. Oper. Res. Manag. Sci. 21(2), 117–134 (2016)
Neagoe, V., Ciotec, A., Cucu, S.: Deep convolutional neural networks versus multilayer perceptron for financial prediction. In: International Conference on Communications (COMM-2018), pp. 201–206. IEEE, Bucharest (2018)
Pendey, S., Benkatesh, N.: Analysis of German credit data using microsoft azure machine learning. J. Emerg. Technol. Innov. Res. 5(2), 302–305 (2018)
Sang, H., Nam, N., Nhan, N.: A novel credit scoring prediction model based on feature selection approach and parallel random forest. Indian J. Sci. Technol. 9(20), 1–6 (2016)
Shukla, A., Mishra, A., Gwalior, M.: Design of credit approval system using artificial neural network: a case study. Int. J. Eng. Res. Comput. Sci. Eng. 4(1), 1–6 (2017)
Steinberg, D.: CART : Classification And Regression Trees, Chap. 10 (2009)
Urbanowicz, R.J., Olson, R.S., Schmitt, P., Meeker, M.: Benchmarking relief-based feature selection methods for bioinformatics data mining. J. Biomed. Inform. 4(2), 1–21 (2018)
Van, S., Ha, N., Bao, H.: A hybrid feature selection method for credit scoring. EAI Endorsed Trans. Context. Syst. Appl. 4(11), 1–6 (2017)
UCI. https://archive.ics.uci.edu/ml/datasets/statlog+(german+credit+data). Accessed 10 Sept 2018
ReliefF algorithm. https://uk.mathworks.com/help/stats/relieff.html#responsive_offcanvas. Accessed 10 Feb 2019
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Damrongsakmethee, T., Neagoe, VE. (2019). Principal Component Analysis and ReliefF Cascaded with Decision Tree for Credit Scoring. In: Silhavy, R. (eds) Artificial Intelligence Methods in Intelligent Algorithms. CSOC 2019. Advances in Intelligent Systems and Computing, vol 985. Springer, Cham. https://doi.org/10.1007/978-3-030-19810-7_9
Download citation
DOI: https://doi.org/10.1007/978-3-030-19810-7_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-19809-1
Online ISBN: 978-3-030-19810-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)