Big Data Architecture for Predicting Churn Risk in Mobile Phone Companies

Melgarejo Galvan, Alonso Raul; Clavo Navarro, Katerine Rocio

doi:10.1007/978-3-319-55209-5_10

Big Data Architecture for Predicting Churn Risk in Mobile Phone Companies

Alonso Raul Melgarejo Galvan¹² &
Katerine Rocio Clavo Navarro¹²

Conference paper
First Online: 08 March 2017

570 Accesses
1 Citations

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 656))

Abstract

Nowadays in Peru, mobile phone companies have been affected by the problem of mobile number portability because since July 2014 customers can change their mobile operator in just 24 h. Companies look for solutions through the analysis of historical data of their customers in order to generate predictive models and to identify which customers would leave the company. However, the current way how this prediction is performed is too slow. In this paper, we show a Big Data architecture which solves the problems of the “classic architecture” using data from social networks in order to predict which customers may go to the competition company, according to their opinions. Data processing is performed by Hadoop, which implements MapReduce and can process large amounts of data in parallel way. After doing the tests and seeing the results, we got a high percentage of accuracy (90.03% of success).

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Barrientos, F., Ríos, S.A.: Aplicación de Minería de Datos para Predecir Fuga de Clientes en la Industria de las Telecomunicaciones. In: Revista Ingeniería de Sistemas, vol. XXVII, pp. 73–107 (2013)
Google Scholar
Organismo Supervisor de Inversion Privada en Telecomunicaciones (OSIPTEL): Estado de la portabilidad numérica en el primer trimestre del 2015, Perú (2015)
Google Scholar
Kirui, C., Kirui, H., Hong, L., Cheruiyot, W.: Predicting customer churn in mobile telephony industry using probabilistic classifiers in data mining. Int. J. Comput. Sci. Issues 10(1), 165–172 (2013)
Google Scholar
Pérez Villanueva, P.A.: Modelo de Predicción de Fuga de Cliente de Telefonía Móvil Post Pago. In: Memoria para optar al Título de Ingeniero Civil Industrial. Departamento de Ingenieria Industrial, Universidad de Chile, Chile (2014)
Google Scholar
Kohavi, R.: a study of cross-validation and bootstrap for accuracy estimation and model selection. In: 14th International Joint Conference on Artificial Intelligence, vol. 2, pp. 1137–1143 (1995)
Google Scholar
Morales, G.D.F., Bifet, A.: SAMOA: scalable advanced massive online analysis. J. Mach. Learn. Res. 16(16), 149–153 (2015)
Google Scholar
Vasuki, M., Arthi, J., Kayalvizhi, K.: Decision making using sentiment analysis from twitter. Int. J. Innovative Res. Comput. Commun. Eng. 2(12), 71–77 (2014)
Google Scholar
Kashyap, K., Deka, C., Rakshit, S.: A review on big data, hadoop and its impact on business. Int. J. Innovatite Res. Dev. 3(12), 78–82 (2014)
Google Scholar
Gajjar, D.: Implementing the Naive Bayes classifier in Mahout. J. Emerg. Technol. Innovative Res. 1(6), 449–454 (2014)
Google Scholar
Mukhopadhyay, D., Agrawal, C., Maru, D., Yedale, P., Gadekar, P.: Addressing namenode scalability issue in hadoop distributed file system using cache approach. In: 2014 International Conference on Information Technology, Bhubaneswar, India, pp. 321–326 (2014)
Google Scholar
Kumar, R., Gupta, N., Charu, S., Bansal, S., Yadav, K.: Comparison of SQL with HiveQL. Int. J. Res. Technol. Stud. 1(9), 28–30 (2014)
Google Scholar
Banaei, S.M., Moghaddam, H.K.: Hadoop and its roles in modern image processing. Open J. Marine Sci. 4(4), 239–245 (2014)
Article Google Scholar
Dutta, K., Jayapal, M.: Big data analytics for real time systems. In: Big Data Analytics Seminar, pp. 1–13 (2015)
Google Scholar
Mangal, S.B., Goyal, V.: Text news classification system using Nave Bayes classifier. Int. J. Eng. Sci. 3, 209–213 (2014)
Google Scholar
Mtafya, A.R., Huang, D., Uwamahoro, G.: On objective keywords extraction: Tf-Idf based forward words pruning algorithm for keywords extraction on youtube. Int. J. Multimedia Ubiquitous Eng. 9(12), 97–106 (2014)
Article Google Scholar
Barlow, M.: Real-Time Big Data Analytics: Emerging Architecture. O’Really Media (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Systems Engineering, National University of San Marcos, Lima, Peru
Alonso Raul Melgarejo Galvan & Katerine Rocio Clavo Navarro

Authors

Alonso Raul Melgarejo Galvan
View author publications
You can also search for this author in PubMed Google Scholar
Katerine Rocio Clavo Navarro
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alonso Raul Melgarejo Galvan .

Editor information

Editors and Affiliations

College of Medicine, University of Florida, Gainesville, Florida, USA
Juan Antonio Lossio-Ventura
Faculty of Engineering, Universidad del Pacífico, Jesús María, Lima, Peru
Hugo Alatrista-Salas

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Melgarejo Galvan, A.R., Clavo Navarro, K.R. (2017). Big Data Architecture for Predicting Churn Risk in Mobile Phone Companies. In: Lossio-Ventura, J., Alatrista-Salas, H. (eds) Information Management and Big Data. SIMBig SIMBig 2015 2016. Communications in Computer and Information Science, vol 656. Springer, Cham. https://doi.org/10.1007/978-3-319-55209-5_10

Download citation

DOI: https://doi.org/10.1007/978-3-319-55209-5_10
Published: 08 March 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-55208-8
Online ISBN: 978-3-319-55209-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics