Abstract
This article presents a real-world application of a data mining approach to credit scoring. It describes the development and the validation of a decision tree, which aims to discriminate between good and bad accounts of Littlewoods Home Shopping customers based on a sample of orders placed between January and November of 2000.
This decision tree was constructed for the orders referred to the Authorisations Department. It showed a great improvement in performance compared to the current manual decisions taken for the orders referred to this Department. The implementation of this tree indicates that Authorisation Advisors should apply a set of simple rules in order to optimise their decision making process. The methodology of the decision tree construction is presented in detail. Furthermore the article discusses alternative approaches to credit scoring. Logistic regression is the most widely used technique and it can be used as a benchmarking to assess competing approaches in credit scoring. Using the Receiver Operating Characteristic (ROC) curve as a performance measure of predictive accuracy, the superiority of the decision tree model against the logistic regression model is indicated.
Similar content being viewed by others
References
SAS Enterprise Miner Software, Version 3.0,
Fisher, R.A (1936). The use of multiple measurements in taxonomic problems.Annals of Eugenics 7, 179–188.
Reichert AK, Cho CC and Wagner GM (1983). An examination of the conceptual issues involved in developing credit-scoring models.Journal of Business and Economic Statistics 1: 101–114.
Eisenbeis RA (1997). Pitfalls in the application of discriminant analysis in business, finance and economics.Journal of Finance 32: 875–900.
Wiginton JC (1994). A note on the comparison of logit and discriminant models of consumer credit behaviour.Journal of Financial and Quantitative Analysis 15: 757–770.
Henley W.E (1994). Statistical aspects of credit scoring, PhD Thesis, The Open University, Milton Keynes, U.K.
McClelland J.L and Rumelhart D.E (1986) Parallel Distributed Processing, Volume 1. MIT Bradford Press.
Hart A (1992). Using neural networks for classification tasks — some experiments on datasets and practical advice.Journal of Operational Research Society 43: 215–226.
Yoon Y, Swales JR G Margavio TM (1993). A comparison of discriminant analysis versus artificial neural networks.Journal of Operational Research Society 44: 51–60.
Slowisnki R and Zopounidis C (1995). Application of the rough set approach to evaluation of bankruptcy risk.International Journal of Intelligent Systems in Accounting, Finance and Management 4, 27–41.
Dimitras A.I., Slowinski R, Susmaga R and Zopounidis C (1999). Business failure prediction using rough sets.European Journal of Operational Research 114 (2), 263–280.
Doumbos M. and Zopounidis C. (2002). Multicriteria Decision Aid Methodology Classification Methods,Kluwer Academic Publishers, Dordrecht.
Zopounidis C., Pardalos P.M., Doumbos M, Mavridou Th. (1998). “Multicriteria decision aid in credit cards assessment”, in: C. Zopounidis and P.M. Pardalos (eds.), Managing in Uncertainty: Theory and Practice,Kluwer Academic Publishers, Dordrecht, 163–173.
Zopounidis and Doumbos (2000), “PREFDIS: A multicriteria decision support system for sorting decision problems”,Computers and Operational Research, 27 (7–8), 779–797.
Zopounidis C, Doumbos M. (2001). Multi-group discrimination using multi-criteria analysis: Illustrations from the field of finance.European Journal of Operational Research 139 (2), 371–389.
Matsatsinis N. (2003). CCAS: An Intelligent Decision Support System for Credit Card Assessment.Journal of Multi-Criteria Decision Analysis 11, 213–235.
Liang P, Chandler J, Han I(1990). Integrating Statistical and Inductive Methods for Knowledge Acquisition.Expert Systems with Applications 1 (4).
Yu W (1992). ELECTRE TRI: Aspects méthodologiques et manuel d’utilisation.Document du LAMSADE, 14, Université de Paris-Dauphine.
Fogarty T.C and Ireson N.S (1993). Evolving Bayesian classifiers for credit control — a comparison with other machine learning methods,IMA Journal of Mathematics Applied in Business and Industry 2, 63–76.
Desai V.S, Convay D.G, Crook J.N, Overstreet G.A (1997). Credit Scoring models in the credit union environment using neural networks and genetic algorithms,IMA Journal of Mathematics applied in Business and Industry 8, 323–346.
Chaterjee S and Barcun S (1970). A non parametric approach to credit screening,Journal of American Statistical Assoc. 65, 150–154.
Freidman J.H (1979). A tree-structured approach to nonparametric multiple regression. In smoothing Techniques for Curve Estimation, ed: Gasser, Th. And Rosenblatt, M., New York: Spinger Verlag, 5–22.
Brieman L., Friedman J.H, Olshen R.A and Stone C.J (1984). Classification and Regression Trees. Wadsworth International.
Makowski P (1985). Credit Scoring Branches Out. Credit World. 75, 30–37.
Coffman J.Y (1986). The proper role of Tree Analysis in Forecasting the Risk behavior of borrowers, MDS Reports, Management Decision System, Atlanta, GA, 3, 3, 7, 9.
Srinivasan V. and Kim Y.H (1987). Credit granting: comparative analysis of classification procedures.Journal of Finance 92, 665–681.
Boyle M., Crook J.N., Hamilton R and Thomas L.C (1992). Methods for credit scoring applied to slow payers. In Proceedings of the IMA conference on credit scoring and credit control, ed: Thomas L.C, Crook J.N and Edelman D.B. 75–90. Clarendon Press, Oxford.
Kass, G.V. (1980). An exploratory technique for investigating large quantities of categorical data.Applied Statistics 29, 119–127.
Thomas L.C, Edelman D. B., Crook J. N (2002). Credit Scoring and its Applications, Society for Industrial and Applied Mathematics (SIAM™).
Wilkinson L (1979). Tests of significance in stepwise regression.Psychological Bulleting, 86 168–174.
Scallan G (1997). Making the Tools — Quality in Scorecard Building.Score Plus Training Seminar, 14–16 May 1997.
Hanley, John A. (1989). “Receiver Operating Characteristic (ROC) Methodology: The State of the Art.”Critical Reviews in Diagnostic Imaging, 29, 3
Thomas L.C (1999). “A survey of credit and behavioural scoring: Forecasting financial risk of lending to consumers”. Working paper 99/2, School of Management, University of Edinburgh.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Sarantopoulos, G. Data mining in retail credit. Oper Res Int J 3, 99–122 (2003). https://doi.org/10.1007/BF02940280
Issue Date:
DOI: https://doi.org/10.1007/BF02940280