Abstract
Improving customer experience is crucial in any industry, especially in telecommunications, where competition is a constant factor. Today, all telecommunications companies rely on the massive amount of data generated daily to get to know the customer or study their behavior and thus create new effective strategies for their business. Within the most varied user experiences, the process of installing new services can be an event that raises doubts about their operation, degrade the user experience, or, in extreme cases, lead to maintenance interventions. Therefore, the use of advanced predictive models that can predict such occurrences become vital. With this, the company can anticipate the cases that will be problematic and reduce the number of negative experiences. The main objective of this work is to create a predictive model that, through all the available data history, can predict which customers will contact the customer service with problems derived from the installation process and have a following maintenance intervention. After analyzing an unbalanced dataset with approximately 560K entries from a Portuguese telecommunications company, and resorting to the CRISP-DM methodology for modeling, the best results were found with LightGBM which obtained an AUPRC of 0.11 and AUROC of 0.62. The best trade-off between precision and recall was found with a threshold model of 0.43 in order to maximize recall while still avoiding a large number of false negatives.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Loveman, G.: Diamonds in the data mine. Harvard Bus. Rev. 81, 108–113+130 (2003)
DECO. ‘Telecomunicações sempre no topo das reclamações’. https://www.deco.proteste.pt/institucionalemedia/imprensa/comunicados/2019/telecons-1705, Accessed 10 May 2020
Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques, 3rd edn. Morgan Kaufmann, Burlington (2011)
Weiss, G.: Data mining in the telecommunications industry. In: Wang, J. (ed.) Encyclopedia of Data Warehousing and Mining, vol. 1, pp. 486–491, 2nd ed. Information Science Publishing (2008)
Neto, C., Brito, M., Lopes, V., Peixoto, H., Abelha, A., Machado, J.: Application of data mining for the prediction of mortality and occurrence of complications for gastric cancer patients. Entropy, 21, MDPI AG (2019)
Prata, M., Peixoto, H., Machado, J., Abelha, A.: Data mining in urgency department: medical specialty discharge prediction. In: 16th International Industrial Simulation Conference 2018, ISC (2018)
Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann, Burlington (2011)
scikit-learn. Compare the effect of different scalers on data with outliers (2020). https://scikit-learn.org/stable/auto_examples/preprocessing/plot_all_scaling.html. Accessed 15 May 2020
Herrera, V.M., Khoshgoftaar, T.M., Villanustre, F. et al.: Random forest implementation and optimization for Big Data analytics on LexisNexis’s high performance computing cluster platform. J. Big Data 6, 68 (2019)
Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13, 281–305 (2012)
Branco, P., Torgo, L., Ribeiro, R.P.: A survey of predictive modeling on imbalanced domains. ACM Comput. Surv. 49(2) (2016). Article 31
Fawcett, T.: An introduction to ROC analysis. Pattern Recogn. Lett. 27(8), 861–874 (2006)
Acknowledgments
This work has been supported by FCT – Fundação para a Ciência e Tecnologia within the RD Units Project Scope: UIDB/00319/2020.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Costa, D., Pereira, C., Peixoto, H., Machado, J. (2020). Anticipating Maintenance in Telecom Installation Processes. In: Analide, C., Novais, P., Camacho, D., Yin, H. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2020. IDEAL 2020. Lecture Notes in Computer Science(), vol 12490. Springer, Cham. https://doi.org/10.1007/978-3-030-62365-4_31
Download citation
DOI: https://doi.org/10.1007/978-3-030-62365-4_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-62364-7
Online ISBN: 978-3-030-62365-4
eBook Packages: Computer ScienceComputer Science (R0)