Abstract
Various machine learning techniques have been explored for credit scoring and management, but no consistent conclusions have been drawn on which method shows the best behaviour. This paper presents an experimental analysis involving five real-world databases with several credit scoring models, including logistic regression, neural networks, support vector machines, decision trees, rule induction algorithms, Bayesian models, k nearest neighbours decision rule, and classifier ensembles. Particularly, we analyse the performance of this set of algorithms by means of a non-parametric statistical test and two post-hoc procedures for making pairwise comparisons.
Keywords
- Support Vector Machine
- Radial Basis Function
- Random Forest
- Credit Risk
- Radial Basis Function Neural Network
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abdou, H.A.: An evaluation of alternative scoring models in private banking. The Journal of Risk Finance 10(1), 38–53 (2009)
Abrahams, C.R., Zhang, M.: Fair Lending Compliance: Intelligence and Implications for Credit Risk Management. Wiley, Hoboken (2008)
Alpaydin, E.: Introduction to Machine Learning. MIT Press, Cambridge (2010)
Altman, E.I.: Financial ratios, discriminant analysis and the prediction of corporate bankruptcy. Journal of Finance 23(4), 589–611 (1968)
Baesens, B., Gestel, T.V., Viaene, S., Stepanova, M., Suykens, J., Vanthienen, J.: Benchmarking state-of-the-art classification algorithms for credit scoring. Journal of the Operational Research Society 54(6), 627–635 (2003)
Bellotti, T., Crook, J.N.: Support vector machines for credit scoring and discovery of significant features. Expert Systems with Applications 36(2), 3302–3308 (2009)
Bensic, M., Sarlija, N., Zekic-Susac, M.: Modelling small-business credit scoring by using logistic regression, neural networks and decision trees. Intelligent Systems in Accounting, Finance and Management 13(3), 133–150 (2005)
Demšar, J.: Statistical comparisons of classifiers over multiple data sets. Journal of Machine Learning Research 7(1), 1–30 (2006)
Desai, V.S., Crook, J.N., Overstreet, G.A.: A comparison of neural networks and linear scoring models in the credit union environment. European Journal of Operational Research 95(1), 24–37 (1996)
Elkan, C.: The foundations of cost-sensitive learning. In: Proc. 17th Intl. Joint Conf. Artificial Intelligence, Seattle, WA, pp. 973–978 (2001)
Elsayad, A.M.: Implementing automated prediction systems for credit scoring. ICGST International Journal on Automatic Control and Systems Engineering 10(1), 11–19 (2010)
Ferri, C., Hernández-Orallo, J., Modroiu, R.: An experimental comparison of performance measures for classification. Pattern Recognition Letters 30(1), 27–38 (2009)
Frank, A., Asuncion, A.: UCI Machine Learning Database Repository (2010), http://archive.ics.uci.edu/ml
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. SIGKDD Explorations Newsletter 11(1), 10–18 (2009)
Hand, D.J.: Good practice in retail credit scorecard assessment. Journal of the Operational Research Society 56(9), 1109–1117 (2005)
Huang, Z., Chen, H., Hsu, C.J., Chen, W.H., Wu, S.: Credit rating analysis with support vector machines and neural networks: A market comparative study. Decision Support Systems 37(4), 543–558 (2004)
Japkowicz, N., Shah, M.: Evaluating Learning Algorithms: A Classification Perspective. Cambridge University Press, New York (2011)
Lee, T.S., Chen, I.F.: A two-stage hybrid credit scoring model using artificial neural networks and multivariate adaptive regression splines. Expert Systems with Applications 28(4), 743–752 (2005)
Pietruszkiewicz, W.: Dynamical systems and nonlinear Kalman filtering applied in classification. In: Proc. of 7th IEEE International Conference on Cybernetic Intelligent Systems, London, UK, pp. 263–268 (2008)
Sabzevari, H., Soleymani, M., Noorbakhsh, E.: A comparison between statistical and data mining methods for credit scoring in case of limited available data. In: Proc. of the 3rd CRC Credit Scoring Conference, Edinburgh, UK (2007)
Sheskin, D.J.: Handbook of Parametric and Nonparametric Statistical Procedures. CRC Press, Boca Raton (2011)
Sokolova, M., Lapalme, G.: A systematic analysis of performance measures for classification tasks. Infornation Processing & Management 45(4), 427–437 (2009)
Thomas, L.C., Edelman, D.B., Crook, J.N.: Credit Scoring and Its Applications. SIAM, Philadelphia (2002)
Yang, Z., Wang, Y., Bai, Y., Zhang, X.: Measuring scorecard performance. In: Proc. 4th Intl. Conf. Computational Science, Krakow, Poland, pp. 900–906 (2004)
Yobas, M.B., Crook, J.N., Ross, P.: Credit scoring using neural and evolutionary techniques. IMA Journal of Mathematics Applied in Business and Industry 11(4), 111–125 (2000)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
García, V., Marqués, A.I., Sánchez, J.S. (2012). Non-parametric Statistical Analysis of Machine Learning Methods for Credit Scoring. In: Casillas, J., Martínez-López, F., Corchado Rodríguez, J. (eds) Management Intelligent Systems. Advances in Intelligent Systems and Computing, vol 171. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30864-2_25
Download citation
DOI: https://doi.org/10.1007/978-3-642-30864-2_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-30863-5
Online ISBN: 978-3-642-30864-2
eBook Packages: EngineeringEngineering (R0)