Skip to main content
Log in

Data mining in retail credit

  • Published:
Operational Research Aims and scope Submit manuscript

Abstract

This article presents a real-world application of a data mining approach to credit scoring. It describes the development and the validation of a decision tree, which aims to discriminate between good and bad accounts of Littlewoods Home Shopping customers based on a sample of orders placed between January and November of 2000.

This decision tree was constructed for the orders referred to the Authorisations Department. It showed a great improvement in performance compared to the current manual decisions taken for the orders referred to this Department. The implementation of this tree indicates that Authorisation Advisors should apply a set of simple rules in order to optimise their decision making process. The methodology of the decision tree construction is presented in detail. Furthermore the article discusses alternative approaches to credit scoring. Logistic regression is the most widely used technique and it can be used as a benchmarking to assess competing approaches in credit scoring. Using the Receiver Operating Characteristic (ROC) curve as a performance measure of predictive accuracy, the superiority of the decision tree model against the logistic regression model is indicated.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. SAS Enterprise Miner Software, Version 3.0,

  2. Fisher, R.A (1936). The use of multiple measurements in taxonomic problems.Annals of Eugenics 7, 179–188.

    Google Scholar 

  3. Reichert AK, Cho CC and Wagner GM (1983). An examination of the conceptual issues involved in developing credit-scoring models.Journal of Business and Economic Statistics 1: 101–114.

    Article  Google Scholar 

  4. Eisenbeis RA (1997). Pitfalls in the application of discriminant analysis in business, finance and economics.Journal of Finance 32: 875–900.

    Article  Google Scholar 

  5. Wiginton JC (1994). A note on the comparison of logit and discriminant models of consumer credit behaviour.Journal of Financial and Quantitative Analysis 15: 757–770.

    Article  Google Scholar 

  6. Henley W.E (1994). Statistical aspects of credit scoring, PhD Thesis, The Open University, Milton Keynes, U.K.

    Google Scholar 

  7. McClelland J.L and Rumelhart D.E (1986) Parallel Distributed Processing, Volume 1. MIT Bradford Press.

  8. Hart A (1992). Using neural networks for classification tasks — some experiments on datasets and practical advice.Journal of Operational Research Society 43: 215–226.

    Article  Google Scholar 

  9. Yoon Y, Swales JR G Margavio TM (1993). A comparison of discriminant analysis versus artificial neural networks.Journal of Operational Research Society 44: 51–60.

    Article  Google Scholar 

  10. Slowisnki R and Zopounidis C (1995). Application of the rough set approach to evaluation of bankruptcy risk.International Journal of Intelligent Systems in Accounting, Finance and Management 4, 27–41.

    Google Scholar 

  11. Dimitras A.I., Slowinski R, Susmaga R and Zopounidis C (1999). Business failure prediction using rough sets.European Journal of Operational Research 114 (2), 263–280.

    Article  Google Scholar 

  12. Doumbos M. and Zopounidis C. (2002). Multicriteria Decision Aid Methodology Classification Methods,Kluwer Academic Publishers, Dordrecht.

    Google Scholar 

  13. Zopounidis C., Pardalos P.M., Doumbos M, Mavridou Th. (1998). “Multicriteria decision aid in credit cards assessment”, in: C. Zopounidis and P.M. Pardalos (eds.), Managing in Uncertainty: Theory and Practice,Kluwer Academic Publishers, Dordrecht, 163–173.

    Google Scholar 

  14. Zopounidis and Doumbos (2000), “PREFDIS: A multicriteria decision support system for sorting decision problems”,Computers and Operational Research, 27 (7–8), 779–797.

    Article  Google Scholar 

  15. Zopounidis C, Doumbos M. (2001). Multi-group discrimination using multi-criteria analysis: Illustrations from the field of finance.European Journal of Operational Research 139 (2), 371–389.

    Article  Google Scholar 

  16. Matsatsinis N. (2003). CCAS: An Intelligent Decision Support System for Credit Card Assessment.Journal of Multi-Criteria Decision Analysis 11, 213–235.

    Article  Google Scholar 

  17. Liang P, Chandler J, Han I(1990). Integrating Statistical and Inductive Methods for Knowledge Acquisition.Expert Systems with Applications 1 (4).

  18. Yu W (1992). ELECTRE TRI: Aspects méthodologiques et manuel d’utilisation.Document du LAMSADE, 14, Université de Paris-Dauphine.

  19. Fogarty T.C and Ireson N.S (1993). Evolving Bayesian classifiers for credit control — a comparison with other machine learning methods,IMA Journal of Mathematics Applied in Business and Industry 2, 63–76.

    Google Scholar 

  20. Desai V.S, Convay D.G, Crook J.N, Overstreet G.A (1997). Credit Scoring models in the credit union environment using neural networks and genetic algorithms,IMA Journal of Mathematics applied in Business and Industry 8, 323–346.

    Google Scholar 

  21. Chaterjee S and Barcun S (1970). A non parametric approach to credit screening,Journal of American Statistical Assoc. 65, 150–154.

    Article  Google Scholar 

  22. Freidman J.H (1979). A tree-structured approach to nonparametric multiple regression. In smoothing Techniques for Curve Estimation, ed: Gasser, Th. And Rosenblatt, M., New York: Spinger Verlag, 5–22.

    Google Scholar 

  23. Brieman L., Friedman J.H, Olshen R.A and Stone C.J (1984). Classification and Regression Trees. Wadsworth International.

  24. Makowski P (1985). Credit Scoring Branches Out. Credit World. 75, 30–37.

    Google Scholar 

  25. Coffman J.Y (1986). The proper role of Tree Analysis in Forecasting the Risk behavior of borrowers, MDS Reports, Management Decision System, Atlanta, GA, 3, 3, 7, 9.

  26. Srinivasan V. and Kim Y.H (1987). Credit granting: comparative analysis of classification procedures.Journal of Finance 92, 665–681.

    Article  Google Scholar 

  27. Boyle M., Crook J.N., Hamilton R and Thomas L.C (1992). Methods for credit scoring applied to slow payers. In Proceedings of the IMA conference on credit scoring and credit control, ed: Thomas L.C, Crook J.N and Edelman D.B. 75–90. Clarendon Press, Oxford.

    Google Scholar 

  28. Kass, G.V. (1980). An exploratory technique for investigating large quantities of categorical data.Applied Statistics 29, 119–127.

    Article  Google Scholar 

  29. Thomas L.C, Edelman D. B., Crook J. N (2002). Credit Scoring and its Applications, Society for Industrial and Applied Mathematics (SIAM™).

  30. Wilkinson L (1979). Tests of significance in stepwise regression.Psychological Bulleting, 86 168–174.

    Article  Google Scholar 

  31. Scallan G (1997). Making the Tools — Quality in Scorecard Building.Score Plus Training Seminar, 14–16 May 1997.

  32. Hanley, John A. (1989). “Receiver Operating Characteristic (ROC) Methodology: The State of the Art.”Critical Reviews in Diagnostic Imaging, 29, 3

    Google Scholar 

  33. Thomas L.C (1999). “A survey of credit and behavioural scoring: Forecasting financial risk of lending to consumers”. Working paper 99/2, School of Management, University of Edinburgh.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Georgios Sarantopoulos.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sarantopoulos, G. Data mining in retail credit. Oper Res Int J 3, 99–122 (2003). https://doi.org/10.1007/BF02940280

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02940280

Keywords

Navigation