Abstract
Customer churn prediction is crucial for businesses in different industries, such as telecommunications, banking, insurance and e-commerce, because acquiring new customers is often more costly than retaining existing ones. Customer churn prediction (CCP) aims to identify customers who are likely to terminate their relationship with a business, enabling companies to take proactive measures to retain them. Machine learning approaches have emerged as a viable strategy for developing effective churn prediction models, employing past customer data to find churn predictors. A comparative study on the most popular supervised machine learning algorithms, including Logistic Regression, Decision tree and Ensemble approaches such as Bagging, Boosting, Stacking and Voting, was applied to predict customer churn in the telecommunications industry. Since the studied dataset is skewed towards non-churners, we investigated the SMOTE and SMOTEENN sampling strategies to balance the dataset. According to the findings of our study, machine learning is a viable method for predicting customer churn. Furthermore, our results show that ensemble learners outperform single-base learners, and a balanced training dataset is expected to improve the classifiers’ performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Ahmad, A.K., Jafar, A., Aljoumaa, K.: Customer churn prediction in telecom using machine learning in big data platform. J. Big Data 6(1), 1–24 (2019). https://doi.org/10.1186/s40537-019-0191-6
Ahmad, A.K., Jafar, A., Aljoumaa, K.: Customer churn prediction in telecom using machine learning in big data platform. J. Big Data 6(1), 28 (2019). https://doi.org/10.1186/s40537-019-0191-6. https://journalofbigdata.springeropen.com/articles/10.1186/s40537-019-0191-6
Batista, G., Prati, R., Monard, M.C.: A study of the behavior of several methods for balancing machine learning training data. SIGKDD Explor. 6, 20–29 (2004). https://doi.org/10.1145/1007730.1007735
Buckinx, W., Van den Poel, D.: Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting. Eur. J. Oper. Res. 164, 252–268 (2005). https://doi.org/10.1016/j.ejor.2003.12.010
Burez, J., Van den Poel, D.: Handling class imbalance in customer churn prediction. Expert Syst. Appl. 36(3), 4626–4636 (2009)
Chawla, N., Bowyer, K., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. ArXiv abs/1106.1813 (2002)
Jain, H., Yadav, G., Manoov, R.: Churn prediction and retention in banking, telecom and IT sectors using machine learning techniques. In: Patnaik, S., Yang, X.-S., Sethi, I.K. (eds.) Advances in Machine Learning and Computational Intelligence. AIS, pp. 137–156. Springer, Singapore (2021). https://doi.org/10.1007/978-981-15-5243-4_12
Kumar, S., Kumar, M.: Predicting customer churn using artificial neural network. In: Macintyre, J., Iliadis, L., Maglogiannis, I., Jayne, C. (eds.) EANN 2019. CCIS, vol. 1000, pp. 299–306. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-20257-6_25
Andrews, R.: Churn prediction in telecom sector using machine learning. Int. J. Inf. Syst. Comput. Sci. 8(2), 132–134 (2019). https://doi.org/10.30534/ijiscs/2019/31822019. http://www.warse.org/IJISCS/static/pdf/file/ijiscs31822019.pdf
Pebrianti, D., Istinabiyah, D.D., Bayuaji, L., Rusdah: Hybrid method for churn prediction model in the case of telecommunication companies. In: 2022 9th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI), pp. 161–166 (2022). https://doi.org/10.23919/EECSI56542.2022.9946535
Qureshi, S.A., Rehman, A.S., Qamar, A.M., Kamal, A., Rehman, A.: Telecommunication subscribers’ churn prediction model using machine learning. In: Eighth International Conference on Digital Information Management (ICDIM 2013), pp. 131–136 (2013). https://doi.org/10.1109/ICDIM.2013.6693977
Salunkhe, U.R., Mali, S.N.: A hybrid approach for class imbalance problem in customer churn prediction: a novel extension to under-sampling. Int. J. Intell. Syst. Appl. 10, 71–81 (2018)
Shaaban, E., Helmy, Y., Khedr, A., Nasr, M.: A proposed churn prediction model. Int. J. Eng. Res. Appl. (IJERA) 2, 693–697 (2012)
Sharma, T., Gupta, P., Nigam, V., Goel, M.: Customer churn prediction in telecommunications using gradient boosted trees. In: Khanna, A., Gupta, D., Bhattacharyya, S., Snasel, V., Platos, J., Hassanien, A.E. (eds.) International Conference on Innovative Computing and Communications. AISC, vol. 1059, pp. 235–246. Springer, Singapore (2020). https://doi.org/10.1007/978-981-15-0324-5_20
Shitole, A., Priyadarshini, I.: Survey of machine learning algorithms & its applications (2021). https://doi.org/10.5281/zenodo.5090570
Shumaly, S., Neysaryan, P., Guo, Y.: Handling class imbalance in customer churn prediction in telecom sector using sampling techniques, bagging and boosting trees. In: 2020 10th International Conference on Computer and Knowledge Engineering (ICCKE), pp. 082–087 (2020). https://doi.org/10.1109/ICCKE50421.2020.9303698
Umayaparvathi, V., Iyakutti, K.: Applications of data mining techniques in telecom churn prediction. Int. J. Comput. Appl. 42(20), 5–9 (2012). https://doi.org/10.5120/5814-8122. http://research.ijcaonline.org/volume42/number20/pxc3878122.pdf
Verbeke, W., Dejaeger, K., Martens, D., Hur, J., Baesens, B.: New insights into churn prediction in the telecommunication sector: a profit driven data mining approach. Eur. J. Oper. Res. 218(1), 211–229 (2012). https://doi.org/10.1016/j.ejor.2011.09.031. https://www.sciencedirect.com/science/article/pii/S0377221711008599
Xia, G.E., Wang, H., Jiang, Y.: Application of customer churn prediction based on weighted selective ensembles. In: 2016 3rd International Conference on Systems and Informatics (ICSAI), pp. 513–519 (2016). https://doi.org/10.1109/ICSAI.2016.7811009
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Kumar, S., Logofatu, D. (2023). Comparative Study on Customer Churn Prediction by Using Machine Learning Techniques. In: Nguyen, N.T., et al. Recent Challenges in Intelligent Information and Database Systems. ACIIDS 2023. Communications in Computer and Information Science, vol 1863. Springer, Cham. https://doi.org/10.1007/978-3-031-42430-4_28
Download citation
DOI: https://doi.org/10.1007/978-3-031-42430-4_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-42429-8
Online ISBN: 978-3-031-42430-4
eBook Packages: Computer ScienceComputer Science (R0)