Abstract
Nowadays in Peru, mobile phone companies have been affected by the problem of mobile number portability because since July 2014 customers can change their mobile operator in just 24 h. Companies look for solutions through the analysis of historical data of their customers in order to generate predictive models and to identify which customers would leave the company. However, the current way how this prediction is performed is too slow. In this paper, we show a Big Data architecture which solves the problems of the “classic architecture” using data from social networks in order to predict which customers may go to the competition company, according to their opinions. Data processing is performed by Hadoop, which implements MapReduce and can process large amounts of data in parallel way. After doing the tests and seeing the results, we got a high percentage of accuracy (90.03% of success).
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Barrientos, F., Ríos, S.A.: Aplicación de Minería de Datos para Predecir Fuga de Clientes en la Industria de las Telecomunicaciones. In: Revista Ingeniería de Sistemas, vol. XXVII, pp. 73–107 (2013)
Organismo Supervisor de Inversion Privada en Telecomunicaciones (OSIPTEL): Estado de la portabilidad numérica en el primer trimestre del 2015, Perú (2015)
Kirui, C., Kirui, H., Hong, L., Cheruiyot, W.: Predicting customer churn in mobile telephony industry using probabilistic classifiers in data mining. Int. J. Comput. Sci. Issues 10(1), 165–172 (2013)
Pérez Villanueva, P.A.: Modelo de Predicción de Fuga de Cliente de Telefonía Móvil Post Pago. In: Memoria para optar al Título de Ingeniero Civil Industrial. Departamento de Ingenieria Industrial, Universidad de Chile, Chile (2014)
Kohavi, R.: a study of cross-validation and bootstrap for accuracy estimation and model selection. In: 14th International Joint Conference on Artificial Intelligence, vol. 2, pp. 1137–1143 (1995)
Morales, G.D.F., Bifet, A.: SAMOA: scalable advanced massive online analysis. J. Mach. Learn. Res. 16(16), 149–153 (2015)
Vasuki, M., Arthi, J., Kayalvizhi, K.: Decision making using sentiment analysis from twitter. Int. J. Innovative Res. Comput. Commun. Eng. 2(12), 71–77 (2014)
Kashyap, K., Deka, C., Rakshit, S.: A review on big data, hadoop and its impact on business. Int. J. Innovatite Res. Dev. 3(12), 78–82 (2014)
Gajjar, D.: Implementing the Naive Bayes classifier in Mahout. J. Emerg. Technol. Innovative Res. 1(6), 449–454 (2014)
Mukhopadhyay, D., Agrawal, C., Maru, D., Yedale, P., Gadekar, P.: Addressing namenode scalability issue in hadoop distributed file system using cache approach. In: 2014 International Conference on Information Technology, Bhubaneswar, India, pp. 321–326 (2014)
Kumar, R., Gupta, N., Charu, S., Bansal, S., Yadav, K.: Comparison of SQL with HiveQL. Int. J. Res. Technol. Stud. 1(9), 28–30 (2014)
Banaei, S.M., Moghaddam, H.K.: Hadoop and its roles in modern image processing. Open J. Marine Sci. 4(4), 239–245 (2014)
Dutta, K., Jayapal, M.: Big data analytics for real time systems. In: Big Data Analytics Seminar, pp. 1–13 (2015)
Mangal, S.B., Goyal, V.: Text news classification system using Nave Bayes classifier. Int. J. Eng. Sci. 3, 209–213 (2014)
Mtafya, A.R., Huang, D., Uwamahoro, G.: On objective keywords extraction: Tf-Idf based forward words pruning algorithm for keywords extraction on youtube. Int. J. Multimedia Ubiquitous Eng. 9(12), 97–106 (2014)
Barlow, M.: Real-Time Big Data Analytics: Emerging Architecture. O’Really Media (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Melgarejo Galvan, A.R., Clavo Navarro, K.R. (2017). Big Data Architecture for Predicting Churn Risk in Mobile Phone Companies. In: Lossio-Ventura, J., Alatrista-Salas, H. (eds) Information Management and Big Data. SIMBig SIMBig 2015 2016. Communications in Computer and Information Science, vol 656. Springer, Cham. https://doi.org/10.1007/978-3-319-55209-5_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-55209-5_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-55208-8
Online ISBN: 978-3-319-55209-5
eBook Packages: Computer ScienceComputer Science (R0)