Data mining in retail credit

Sarantopoulos, Georgios

doi:10.1007/BF02940280

Data mining in retail credit

Published: May 2003

Volume 3, pages 99–122, (2003)
Cite this article

Operational Research Aims and scope Submit manuscript

Georgios Sarantopoulos¹

251 Accesses
1 Citation
3 Altmetric
Explore all metrics

Abstract

This article presents a real-world application of a data mining approach to credit scoring. It describes the development and the validation of a decision tree, which aims to discriminate between good and bad accounts of Littlewoods Home Shopping customers based on a sample of orders placed between January and November of 2000.

This decision tree was constructed for the orders referred to the Authorisations Department. It showed a great improvement in performance compared to the current manual decisions taken for the orders referred to this Department. The implementation of this tree indicates that Authorisation Advisors should apply a set of simple rules in order to optimise their decision making process. The methodology of the decision tree construction is presented in detail. Furthermore the article discusses alternative approaches to credit scoring. Logistic regression is the most widely used technique and it can be used as a benchmarking to assess competing approaches in credit scoring. Using the Receiver Operating Characteristic (ROC) curve as a performance measure of predictive accuracy, the superiority of the decision tree model against the logistic regression model is indicated.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

SAS Enterprise Miner Software, Version 3.0,
Fisher, R.A (1936). The use of multiple measurements in taxonomic problems.Annals of Eugenics 7, 179–188.
Google Scholar
Reichert AK, Cho CC and Wagner GM (1983). An examination of the conceptual issues involved in developing credit-scoring models.Journal of Business and Economic Statistics 1: 101–114.
Article Google Scholar
Eisenbeis RA (1997). Pitfalls in the application of discriminant analysis in business, finance and economics.Journal of Finance 32: 875–900.
Article Google Scholar
Wiginton JC (1994). A note on the comparison of logit and discriminant models of consumer credit behaviour.Journal of Financial and Quantitative Analysis 15: 757–770.
Article Google Scholar
Henley W.E (1994). Statistical aspects of credit scoring, PhD Thesis, The Open University, Milton Keynes, U.K.
Google Scholar
McClelland J.L and Rumelhart D.E (1986) Parallel Distributed Processing, Volume 1. MIT Bradford Press.
Hart A (1992). Using neural networks for classification tasks — some experiments on datasets and practical advice.Journal of Operational Research Society 43: 215–226.
Article Google Scholar
Yoon Y, Swales JR G Margavio TM (1993). A comparison of discriminant analysis versus artificial neural networks.Journal of Operational Research Society 44: 51–60.
Article Google Scholar
Slowisnki R and Zopounidis C (1995). Application of the rough set approach to evaluation of bankruptcy risk.International Journal of Intelligent Systems in Accounting, Finance and Management 4, 27–41.
Google Scholar
Dimitras A.I., Slowinski R, Susmaga R and Zopounidis C (1999). Business failure prediction using rough sets.European Journal of Operational Research 114 (2), 263–280.
Article Google Scholar
Doumbos M. and Zopounidis C. (2002). Multicriteria Decision Aid Methodology Classification Methods,Kluwer Academic Publishers, Dordrecht.
Google Scholar
Zopounidis C., Pardalos P.M., Doumbos M, Mavridou Th. (1998). “Multicriteria decision aid in credit cards assessment”, in: C. Zopounidis and P.M. Pardalos (eds.), Managing in Uncertainty: Theory and Practice,Kluwer Academic Publishers, Dordrecht, 163–173.
Google Scholar
Zopounidis and Doumbos (2000), “PREFDIS: A multicriteria decision support system for sorting decision problems”,Computers and Operational Research, 27 (7–8), 779–797.
Article Google Scholar
Zopounidis C, Doumbos M. (2001). Multi-group discrimination using multi-criteria analysis: Illustrations from the field of finance.European Journal of Operational Research 139 (2), 371–389.
Article Google Scholar
Matsatsinis N. (2003). CCAS: An Intelligent Decision Support System for Credit Card Assessment.Journal of Multi-Criteria Decision Analysis 11, 213–235.
Article Google Scholar
Liang P, Chandler J, Han I(1990). Integrating Statistical and Inductive Methods for Knowledge Acquisition.Expert Systems with Applications 1 (4).
Yu W (1992). ELECTRE TRI: Aspects méthodologiques et manuel d’utilisation.Document du LAMSADE, 14, Université de Paris-Dauphine.
Fogarty T.C and Ireson N.S (1993). Evolving Bayesian classifiers for credit control — a comparison with other machine learning methods,IMA Journal of Mathematics Applied in Business and Industry 2, 63–76.
Google Scholar
Desai V.S, Convay D.G, Crook J.N, Overstreet G.A (1997). Credit Scoring models in the credit union environment using neural networks and genetic algorithms,IMA Journal of Mathematics applied in Business and Industry 8, 323–346.
Google Scholar
Chaterjee S and Barcun S (1970). A non parametric approach to credit screening,Journal of American Statistical Assoc. 65, 150–154.
Article Google Scholar
Freidman J.H (1979). A tree-structured approach to nonparametric multiple regression. In smoothing Techniques for Curve Estimation, ed: Gasser, Th. And Rosenblatt, M., New York: Spinger Verlag, 5–22.
Google Scholar
Brieman L., Friedman J.H, Olshen R.A and Stone C.J (1984). Classification and Regression Trees. Wadsworth International.
Makowski P (1985). Credit Scoring Branches Out. Credit World. 75, 30–37.
Google Scholar
Coffman J.Y (1986). The proper role of Tree Analysis in Forecasting the Risk behavior of borrowers, MDS Reports, Management Decision System, Atlanta, GA, 3, 3, 7, 9.
Srinivasan V. and Kim Y.H (1987). Credit granting: comparative analysis of classification procedures.Journal of Finance 92, 665–681.
Article Google Scholar
Boyle M., Crook J.N., Hamilton R and Thomas L.C (1992). Methods for credit scoring applied to slow payers. In Proceedings of the IMA conference on credit scoring and credit control, ed: Thomas L.C, Crook J.N and Edelman D.B. 75–90. Clarendon Press, Oxford.
Google Scholar
Kass, G.V. (1980). An exploratory technique for investigating large quantities of categorical data.Applied Statistics 29, 119–127.
Article Google Scholar
Thomas L.C, Edelman D. B., Crook J. N (2002). Credit Scoring and its Applications, Society for Industrial and Applied Mathematics (SIAM™).
Wilkinson L (1979). Tests of significance in stepwise regression.Psychological Bulleting, 86 168–174.
Article Google Scholar
Scallan G (1997). Making the Tools — Quality in Scorecard Building.Score Plus Training Seminar, 14–16 May 1997.
Hanley, John A. (1989). “Receiver Operating Characteristic (ROC) Methodology: The State of the Art.”Critical Reviews in Diagnostic Imaging, 29, 3
Google Scholar
Thomas L.C (1999). “A survey of credit and behavioural scoring: Forecasting financial risk of lending to consumers”. Working paper 99/2, School of Management, University of Edinburgh.

Download references

Author information

Authors and Affiliations

Department of Management Science, Lancaster University Management School, LA1 4YX, Lancaster
Georgios Sarantopoulos

Authors

Georgios Sarantopoulos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Georgios Sarantopoulos.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sarantopoulos, G. Data mining in retail credit. Oper Res Int J 3, 99–122 (2003). https://doi.org/10.1007/BF02940280

Download citation

Issue Date: May 2003
DOI: https://doi.org/10.1007/BF02940280

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Data mining in retail credit

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Application of Machine Learning in Credit Risk Scorecard

A Comparison Study of Machine Learning Algorithms for Credit Risk Prediction

Comparative Evaluation of Machine Learning Algorithms for Credit Card Fraud Detection

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Data mining in retail credit

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Application of Machine Learning in Credit Risk Scorecard

A Comparison Study of Machine Learning Algorithms for Credit Risk Prediction

Comparative Evaluation of Machine Learning Algorithms for Credit Card Fraud Detection

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation