Skip to main content

Machine Learning for Customer Churn Prediction in Retail Banking

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12251))

Abstract

Predicting in advance whether a given customer will end his relationship with a company has an undeniable added value for all organizations, since targeted campaigns can be prepared to promote customer retention. In this work, six different methods using machine learning have been investigated on the retail banking customer churn prediction problem, considering predictions up to 6 months in advance. Different approaches are tested and compared using real data. Out of sample results are very good, even with very challenging out-of-sample sets composed only of churners, that truly test the ability to predict when a customer will churn. The best results are obtained by stochastic boosting, and the most important variables for predicting churn in a 1–2 months horizon are the total value of bank products held in recent months and the existence of debit or credit cards in another bank. For a 3–4 months horizon, the number of transactions in recent months and the existence of a mortgage loan outside the bank are the most important variables.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. De Caigny, A., Coussement, K., De Bock, K.W.: A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees. Eur. J. Oper. Res. 269, 760–772 (2018)

    MathSciNet  MATH  Google Scholar 

  2. Ganesh, J., Arnold, M.J., Reynolds, K.E.: Understanding the customer base of service providers: an examination of the differences between switchers and stayers. J. Mark. 64, 65–87 (2000)

    Google Scholar 

  3. Reinartz, W.J., Kumar, V.: The impact of customer relationship characteristics on profitable lifetime duration. J. Mark. 67, 77–99 (2003)

    Google Scholar 

  4. Colgate, M., Stewart, K., Kinsella, R.: Customer defection: a study of the student market in Ireland. Int. J. Bank Mark. 14, 23–29 (1996)

    Google Scholar 

  5. Nitzan, I., Libai, B.: Social effects on customer retention. J. Mark. 75, 24–38 (2011)

    Google Scholar 

  6. Verbraken, T., Bravo, C., Weber, R., Baesens, B.: Development and application of consumer credit scoring models using profit-based classification measures. Eur. J. Oper. Res. 238, 505–513 (2014)

    MathSciNet  MATH  Google Scholar 

  7. Verbeke, W., Martens, D., Mues, C., Baesens, B.: Building comprehensible customer churn prediction models with advanced rule induction techniques. Expert Syst. Appl. 38, 2354–2364 (2011)

    Google Scholar 

  8. Gür Ali, Ö., Arıtürk, U.: Dynamic churn prediction framework with more effective use of rare event data: The case of private banking. Expert Syst. Appl. 41, 7889–7903 (2014)

    Google Scholar 

  9. Moro, S., Cortez, P., Rita, P.: Business intelligence in banking: A literature analysis from 2002 to 2013 using text mining and latent Dirichlet allocation. Expert Syst. Appl. 42, 1314–1324 (2015)

    Google Scholar 

  10. Óskarsdóttir, M., Van Calster, T., Baesens, B., Lemahieu, W., Vanthienen, J.: Time series for early churn detection: Using similarity based classification for dynamic networks. Expert Syst. Appl. 106, 55–65 (2018)

    Google Scholar 

  11. Tsai, C.-F., Chen, M.-Y.: Variable selection by association rules for customer churn prediction of multimedia on demand. Expert Syst. Appl. 37, 2006–2015 (2010)

    Google Scholar 

  12. Tsai, C.-F., Lu, Y.-H.: Customer churn prediction by hybrid neural networks. Expert Syst. Appl. 36, 12547–12553 (2009)

    Google Scholar 

  13. Huang, B., Kechadi, M.T., Buckley, B.: Customer churn prediction in telecommunications. Expert Syst. Appl. 39, 1414–1425 (2012)

    Google Scholar 

  14. Idris, A., Rizwan, M., Khan, A.: Churn prediction in telecom using Random Forest and PSO based data balancing in combination with various feature selection strategies. Comput. Electr. Eng. 38, 1808–1819 (2012)

    Google Scholar 

  15. Vafeiadis, T., Diamantaras, K.I., Sarigiannidis, G., Chatzisavvas, KCh.: A comparison of machine learning techniques for customer churn prediction. Simul. Model. Pract. Theor. 55, 1–9 (2015)

    Google Scholar 

  16. Coussement, K., Benoit, D.F., Van den Poel, D.: Improved marketing decision making in a customer churn prediction context using generalized additive models. Expert Syst. Appl. 37, 2132–2143 (2010)

    Google Scholar 

  17. Miguéis, V.L., Camanho, A., Falcão e Cunha, J.: Customer attrition in retailing: an application of multivariate adaptive regression splines. Expert Syst. Appl. 40, 6225–6232 (2013)

    Google Scholar 

  18. Buckinx, W., Van den Poel, D.: Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting. Eur. J. Oper. Res. 164, 252–268 (2005)

    MATH  Google Scholar 

  19. Chen, Kuanchin., Hu, Ya-Han, Hsieh, Yi-Cheng: Predicting customer churn from valuable B2B customers in the logistics industry: a case study. Inf. Syst. e-Bus. Manage. 13(3), 475–494 (2014). https://doi.org/10.1007/s10257-014-0264-1

    Article  Google Scholar 

  20. Clemente-Císcar, M., San Matías, S., Giner-Bosch, V.: A methodology based on profitability criteria for defining the partial defection of customers in non-contractual settings. Eur. J. Oper. Res. 239, 276–285 (2014)

    MathSciNet  MATH  Google Scholar 

  21. Mavri, M., Ioannou, G.: Customer switching behaviour in Greek banking services using survival analysis. Manag. Financ. 34, 186–197 (2008)

    Google Scholar 

  22. Larivière, B., Van den Poel, D.: Investigating the role of product features in preventing customer churn, by using survival analysis and choice modeling: the case of financial services. Expert Syst. Appl. 27, 277–285 (2004)

    Google Scholar 

  23. Larivière, B., Van den Poel, D.: Predicting customer retention and profitability by using random forests and regression forests techniques. Expert Syst. Appl. 29, 472–484 (2005)

    Google Scholar 

  24. Glady, N., Baesens, B., Croux, C.: Modeling churn using customer lifetime value. Eur. J. Oper. Res. 197, 402–411 (2009)

    MATH  Google Scholar 

  25. Xie, Y., Li, X., Ngai, E.W.T., Ying, W.: Customer churn prediction using improved balanced random forests. Expert Syst. Appl. 36, 5445–5449 (2009)

    Google Scholar 

  26. De Bock, K.W., Van den Poel, D.: Reconciling performance and interpretability in customer churn prediction using ensemble learning based on generalized additive models. Expert Syst. Appl. 39, 6816–6826 (2012)

    Google Scholar 

  27. De Bock, K.W., den Poel, D.V.: An empirical evaluation of rotation-based ensemble classifiers for customer churn prediction. Expert Syst. Appl. 38, 12293–12301 (2011)

    Google Scholar 

  28. Shirazi, F., Mohammadi, M.: A big data analytics model for customer churn prediction in the retiree segment. Int. J. Inf. Manage. 48, 238–253 (2018)

    Google Scholar 

  29. Farquad, M.A.H., Ravi, V., Raju, S.B.: Churn prediction using comprehensible support vector machine: an analytical CRM application. Appl. Soft Comput. 19, 31–40 (2014)

    Google Scholar 

  30. Lin, C.-S., Tzeng, G.-H., Chin, Y.-C.: Combined rough set theory and flow network graph to predict customer churn in credit card accounts. Expert Syst. Appl. 38, 8–15 (2011)

    Google Scholar 

  31. Nie, G., Rowe, W., Zhang, L., Tian, Y., Shi, Y.: Credit card churn forecasting by logistic regression and decision tree. Expert Syst. Appl. 38, 15273–15285 (2011)

    Google Scholar 

  32. Van den Poel, D., Larivière, B.: Customer attrition analysis for financial services using proportional hazard models. Eur. J. Oper. Res. 157, 196–217 (2004). https://doi.org/10.1016/S0377-2217(03)00069-9

    Article  MATH  Google Scholar 

  33. Burez, J., Van den Poel, D.: Handling class imbalance in customer churn prediction. Expert Syst. Appl. 36, 4626–4636 (2009)

    Google Scholar 

  34. Chen, C.: Using Random Forest to Learn Imbalanced Data. University of California, Berkeley (2004)

    Google Scholar 

  35. Weiss, G.M.: Mining with rarity: a unifying framework. ACM SIGKDD Explor. Newslett. 6, 7 (2004)

    Google Scholar 

  36. Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001)

    MATH  Google Scholar 

  37. Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting. Ann. Stat. 28, 337–407 (2000)

    MathSciNet  MATH  Google Scholar 

  38. Friedman, J.H.: Multivariate adaptive regression splines. Ann. Stat. 1–67 (1991)

    Google Scholar 

  39. Verbeke, W., Dejaeger, K., Martens, D., Hur, J., Baesens, B.: New insights into churn prediction in the telecommunication sector: A profit driven data mining approach. Eur. J. Oper. Res. 218, 211–229 (2012)

    Google Scholar 

Download references

Acknowledgments

This study has been funded by national funds, through FCT, Portuguese Science Foundation, under project UIDB/00308/2020.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pedro Godinho .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dias, J., Godinho, P., Torres, P. (2020). Machine Learning for Customer Churn Prediction in Retail Banking. In: Gervasi, O., et al. Computational Science and Its Applications – ICCSA 2020. ICCSA 2020. Lecture Notes in Computer Science(), vol 12251. Springer, Cham. https://doi.org/10.1007/978-3-030-58808-3_42

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-58808-3_42

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-58807-6

  • Online ISBN: 978-3-030-58808-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics