Abstract
Predicting in advance whether a given customer will end his relationship with a company has an undeniable added value for all organizations, since targeted campaigns can be prepared to promote customer retention. In this work, six different methods using machine learning have been investigated on the retail banking customer churn prediction problem, considering predictions up to 6 months in advance. Different approaches are tested and compared using real data. Out of sample results are very good, even with very challenging out-of-sample sets composed only of churners, that truly test the ability to predict when a customer will churn. The best results are obtained by stochastic boosting, and the most important variables for predicting churn in a 1–2 months horizon are the total value of bank products held in recent months and the existence of debit or credit cards in another bank. For a 3–4 months horizon, the number of transactions in recent months and the existence of a mortgage loan outside the bank are the most important variables.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
De Caigny, A., Coussement, K., De Bock, K.W.: A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees. Eur. J. Oper. Res. 269, 760–772 (2018)
Ganesh, J., Arnold, M.J., Reynolds, K.E.: Understanding the customer base of service providers: an examination of the differences between switchers and stayers. J. Mark. 64, 65–87 (2000)
Reinartz, W.J., Kumar, V.: The impact of customer relationship characteristics on profitable lifetime duration. J. Mark. 67, 77–99 (2003)
Colgate, M., Stewart, K., Kinsella, R.: Customer defection: a study of the student market in Ireland. Int. J. Bank Mark. 14, 23–29 (1996)
Nitzan, I., Libai, B.: Social effects on customer retention. J. Mark. 75, 24–38 (2011)
Verbraken, T., Bravo, C., Weber, R., Baesens, B.: Development and application of consumer credit scoring models using profit-based classification measures. Eur. J. Oper. Res. 238, 505–513 (2014)
Verbeke, W., Martens, D., Mues, C., Baesens, B.: Building comprehensible customer churn prediction models with advanced rule induction techniques. Expert Syst. Appl. 38, 2354–2364 (2011)
Gür Ali, Ö., Arıtürk, U.: Dynamic churn prediction framework with more effective use of rare event data: The case of private banking. Expert Syst. Appl. 41, 7889–7903 (2014)
Moro, S., Cortez, P., Rita, P.: Business intelligence in banking: A literature analysis from 2002 to 2013 using text mining and latent Dirichlet allocation. Expert Syst. Appl. 42, 1314–1324 (2015)
Óskarsdóttir, M., Van Calster, T., Baesens, B., Lemahieu, W., Vanthienen, J.: Time series for early churn detection: Using similarity based classification for dynamic networks. Expert Syst. Appl. 106, 55–65 (2018)
Tsai, C.-F., Chen, M.-Y.: Variable selection by association rules for customer churn prediction of multimedia on demand. Expert Syst. Appl. 37, 2006–2015 (2010)
Tsai, C.-F., Lu, Y.-H.: Customer churn prediction by hybrid neural networks. Expert Syst. Appl. 36, 12547–12553 (2009)
Huang, B., Kechadi, M.T., Buckley, B.: Customer churn prediction in telecommunications. Expert Syst. Appl. 39, 1414–1425 (2012)
Idris, A., Rizwan, M., Khan, A.: Churn prediction in telecom using Random Forest and PSO based data balancing in combination with various feature selection strategies. Comput. Electr. Eng. 38, 1808–1819 (2012)
Vafeiadis, T., Diamantaras, K.I., Sarigiannidis, G., Chatzisavvas, KCh.: A comparison of machine learning techniques for customer churn prediction. Simul. Model. Pract. Theor. 55, 1–9 (2015)
Coussement, K., Benoit, D.F., Van den Poel, D.: Improved marketing decision making in a customer churn prediction context using generalized additive models. Expert Syst. Appl. 37, 2132–2143 (2010)
Miguéis, V.L., Camanho, A., Falcão e Cunha, J.: Customer attrition in retailing: an application of multivariate adaptive regression splines. Expert Syst. Appl. 40, 6225–6232 (2013)
Buckinx, W., Van den Poel, D.: Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting. Eur. J. Oper. Res. 164, 252–268 (2005)
Chen, Kuanchin., Hu, Ya-Han, Hsieh, Yi-Cheng: Predicting customer churn from valuable B2B customers in the logistics industry: a case study. Inf. Syst. e-Bus. Manage. 13(3), 475–494 (2014). https://doi.org/10.1007/s10257-014-0264-1
Clemente-Císcar, M., San Matías, S., Giner-Bosch, V.: A methodology based on profitability criteria for defining the partial defection of customers in non-contractual settings. Eur. J. Oper. Res. 239, 276–285 (2014)
Mavri, M., Ioannou, G.: Customer switching behaviour in Greek banking services using survival analysis. Manag. Financ. 34, 186–197 (2008)
Larivière, B., Van den Poel, D.: Investigating the role of product features in preventing customer churn, by using survival analysis and choice modeling: the case of financial services. Expert Syst. Appl. 27, 277–285 (2004)
Larivière, B., Van den Poel, D.: Predicting customer retention and profitability by using random forests and regression forests techniques. Expert Syst. Appl. 29, 472–484 (2005)
Glady, N., Baesens, B., Croux, C.: Modeling churn using customer lifetime value. Eur. J. Oper. Res. 197, 402–411 (2009)
Xie, Y., Li, X., Ngai, E.W.T., Ying, W.: Customer churn prediction using improved balanced random forests. Expert Syst. Appl. 36, 5445–5449 (2009)
De Bock, K.W., Van den Poel, D.: Reconciling performance and interpretability in customer churn prediction using ensemble learning based on generalized additive models. Expert Syst. Appl. 39, 6816–6826 (2012)
De Bock, K.W., den Poel, D.V.: An empirical evaluation of rotation-based ensemble classifiers for customer churn prediction. Expert Syst. Appl. 38, 12293–12301 (2011)
Shirazi, F., Mohammadi, M.: A big data analytics model for customer churn prediction in the retiree segment. Int. J. Inf. Manage. 48, 238–253 (2018)
Farquad, M.A.H., Ravi, V., Raju, S.B.: Churn prediction using comprehensible support vector machine: an analytical CRM application. Appl. Soft Comput. 19, 31–40 (2014)
Lin, C.-S., Tzeng, G.-H., Chin, Y.-C.: Combined rough set theory and flow network graph to predict customer churn in credit card accounts. Expert Syst. Appl. 38, 8–15 (2011)
Nie, G., Rowe, W., Zhang, L., Tian, Y., Shi, Y.: Credit card churn forecasting by logistic regression and decision tree. Expert Syst. Appl. 38, 15273–15285 (2011)
Van den Poel, D., Larivière, B.: Customer attrition analysis for financial services using proportional hazard models. Eur. J. Oper. Res. 157, 196–217 (2004). https://doi.org/10.1016/S0377-2217(03)00069-9
Burez, J., Van den Poel, D.: Handling class imbalance in customer churn prediction. Expert Syst. Appl. 36, 4626–4636 (2009)
Chen, C.: Using Random Forest to Learn Imbalanced Data. University of California, Berkeley (2004)
Weiss, G.M.: Mining with rarity: a unifying framework. ACM SIGKDD Explor. Newslett. 6, 7 (2004)
Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001)
Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting. Ann. Stat. 28, 337–407 (2000)
Friedman, J.H.: Multivariate adaptive regression splines. Ann. Stat. 1–67 (1991)
Verbeke, W., Dejaeger, K., Martens, D., Hur, J., Baesens, B.: New insights into churn prediction in the telecommunication sector: A profit driven data mining approach. Eur. J. Oper. Res. 218, 211–229 (2012)
Acknowledgments
This study has been funded by national funds, through FCT, Portuguese Science Foundation, under project UIDB/00308/2020.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Dias, J., Godinho, P., Torres, P. (2020). Machine Learning for Customer Churn Prediction in Retail Banking. In: Gervasi, O., et al. Computational Science and Its Applications – ICCSA 2020. ICCSA 2020. Lecture Notes in Computer Science(), vol 12251. Springer, Cham. https://doi.org/10.1007/978-3-030-58808-3_42
Download citation
DOI: https://doi.org/10.1007/978-3-030-58808-3_42
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-58807-6
Online ISBN: 978-3-030-58808-3
eBook Packages: Computer ScienceComputer Science (R0)