Skip to main content

Big Data Architecture for Predicting Churn Risk in Mobile Phone Companies

  • Conference paper
  • First Online:

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 656))

Abstract

Nowadays in Peru, mobile phone companies have been affected by the problem of mobile number portability because since July 2014 customers can change their mobile operator in just 24 h. Companies look for solutions through the analysis of historical data of their customers in order to generate predictive models and to identify which customers would leave the company. However, the current way how this prediction is performed is too slow. In this paper, we show a Big Data architecture which solves the problems of the “classic architecture” using data from social networks in order to predict which customers may go to the competition company, according to their opinions. Data processing is performed by Hadoop, which implements MapReduce and can process large amounts of data in parallel way. After doing the tests and seeing the results, we got a high percentage of accuracy (90.03% of success).

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Barrientos, F., Ríos, S.A.: Aplicación de Minería de Datos para Predecir Fuga de Clientes en la Industria de las Telecomunicaciones. In: Revista Ingeniería de Sistemas, vol. XXVII, pp. 73–107 (2013)

    Google Scholar 

  2. Organismo Supervisor de Inversion Privada en Telecomunicaciones (OSIPTEL): Estado de la portabilidad numérica en el primer trimestre del 2015, Perú (2015)

    Google Scholar 

  3. Kirui, C., Kirui, H., Hong, L., Cheruiyot, W.: Predicting customer churn in mobile telephony industry using probabilistic classifiers in data mining. Int. J. Comput. Sci. Issues 10(1), 165–172 (2013)

    Google Scholar 

  4. Pérez Villanueva, P.A.: Modelo de Predicción de Fuga de Cliente de Telefonía Móvil Post Pago. In: Memoria para optar al Título de Ingeniero Civil Industrial. Departamento de Ingenieria Industrial, Universidad de Chile, Chile (2014)

    Google Scholar 

  5. Kohavi, R.: a study of cross-validation and bootstrap for accuracy estimation and model selection. In: 14th International Joint Conference on Artificial Intelligence, vol. 2, pp. 1137–1143 (1995)

    Google Scholar 

  6. Morales, G.D.F., Bifet, A.: SAMOA: scalable advanced massive online analysis. J. Mach. Learn. Res. 16(16), 149–153 (2015)

    Google Scholar 

  7. Vasuki, M., Arthi, J., Kayalvizhi, K.: Decision making using sentiment analysis from twitter. Int. J. Innovative Res. Comput. Commun. Eng. 2(12), 71–77 (2014)

    Google Scholar 

  8. Kashyap, K., Deka, C., Rakshit, S.: A review on big data, hadoop and its impact on business. Int. J. Innovatite Res. Dev. 3(12), 78–82 (2014)

    Google Scholar 

  9. Gajjar, D.: Implementing the Naive Bayes classifier in Mahout. J. Emerg. Technol. Innovative Res. 1(6), 449–454 (2014)

    Google Scholar 

  10. Mukhopadhyay, D., Agrawal, C., Maru, D., Yedale, P., Gadekar, P.: Addressing namenode scalability issue in hadoop distributed file system using cache approach. In: 2014 International Conference on Information Technology, Bhubaneswar, India, pp. 321–326 (2014)

    Google Scholar 

  11. Kumar, R., Gupta, N., Charu, S., Bansal, S., Yadav, K.: Comparison of SQL with HiveQL. Int. J. Res. Technol. Stud. 1(9), 28–30 (2014)

    Google Scholar 

  12. Banaei, S.M., Moghaddam, H.K.: Hadoop and its roles in modern image processing. Open J. Marine Sci. 4(4), 239–245 (2014)

    Article  Google Scholar 

  13. Dutta, K., Jayapal, M.: Big data analytics for real time systems. In: Big Data Analytics Seminar, pp. 1–13 (2015)

    Google Scholar 

  14. Mangal, S.B., Goyal, V.: Text news classification system using Nave Bayes classifier. Int. J. Eng. Sci. 3, 209–213 (2014)

    Google Scholar 

  15. Mtafya, A.R., Huang, D., Uwamahoro, G.: On objective keywords extraction: Tf-Idf based forward words pruning algorithm for keywords extraction on youtube. Int. J. Multimedia Ubiquitous Eng. 9(12), 97–106 (2014)

    Article  Google Scholar 

  16. Barlow, M.: Real-Time Big Data Analytics: Emerging Architecture. O’Really Media (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alonso Raul Melgarejo Galvan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Melgarejo Galvan, A.R., Clavo Navarro, K.R. (2017). Big Data Architecture for Predicting Churn Risk in Mobile Phone Companies. In: Lossio-Ventura, J., Alatrista-Salas, H. (eds) Information Management and Big Data. SIMBig SIMBig 2015 2016. Communications in Computer and Information Science, vol 656. Springer, Cham. https://doi.org/10.1007/978-3-319-55209-5_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-55209-5_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-55208-8

  • Online ISBN: 978-3-319-55209-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics