Skip to main content

Validating Goal-Oriented Hypotheses of Business Problems Using Machine Learning: An Exploratory Study of Customer Churn

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12402))

Abstract

Organizations are investing in Big Data and Machine Learning (ML) projects, but most of these projects are predicted to fail. A study shows that one of the biggest obstacles is the lack of understanding of how to use data analytics to improve business value. This paper presents Metis, a method for ensuring that business goals and the corresponding business problems are explicitly traceable to ML projects and where potential (i.e., hypothesized) complex problems can be properly validated before investing in costly solutions. Using this method, business goals are captured to provide context for hypothesizing business problems, which can be further refined into more detailed problems to identify features of data that are suitable for ML. A Supervised ML algorithm is then used to generate a prediction model that captures the underlying patterns and insights about the business problems in the data. An ML Explainability model is used to extract from the prediction model the individual features and their degree of contribution to each problem. The extracted weighted data feature are then fed back to the goal-oriented problem model to validate the most important business problems. Our experiment results show that Metis can detect the most influential problem when it was not apparent through data analysis. Metis is illustrated using a real-world customer churn (customer attrition) problem for a bank and a publicly available customer churn dataset.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    A Greek goddess that has been associated with prudence, wisdom, or wise counsel.

References

  1. Asay, M.: 85% of big data projects fail, but your developers can help yours succeed. Techrepublic (2017)

    Google Scholar 

  2. LaValle, S., Lesser, E., Shockley, R., Hopkins, M.S., Kruschwitz, N.: Big data, analytics and the path from insights to value. MIT Sloan Manag. Rev. 52(2), 21–32 (2011)

    Google Scholar 

  3. NewVantage Partners LLC: Big data and AI executive survey 2020: data-driven business transformation connecting data/AI investment to business outcomes (2020)

    Google Scholar 

  4. Zhou, L., Pan, S., Wang, J., Vasilakos, A.V.: Machine learning on big data: opportunities and challenges. Neurocomputing 237, 350–361 (2017)

    Article  Google Scholar 

  5. Tatter, P.: gpk: 100 data sets for statistics education. R package v. 1.0. (2013)

    Google Scholar 

  6. Supakkul, S., Zhao, L., Chung, L.: GOMA: supporting big data analytics with a goal-oriented approach. In: 2016 IEEE International Congress on Big Data (BigData Congress), pp. 149–156, June 2016

    Google Scholar 

  7. Berry, M.J., Linoff, G.S.: Data Mining Techniques: for Marketing, Sales, and Customer Relationship Management. Wiley, Hoboken (2004)

    Google Scholar 

  8. Supakkul, S.: Capturing, organizing, and reusing knowledge of non-functional requirements: an NFR pattern approach. Ph.D. thesis, University of Texas at Dallas (2010)

    Google Scholar 

  9. Chung, L., Nixon, B.A., Yu, E., Mylopoulos, J.: Non-Functional Requirements in Software Engineering, vol. 5. Springer, Cham (2012)

    MATH  Google Scholar 

  10. Supakkul, S., Chung, L.: Extending problem frames to deal with stakeholder problems: an agent-and goal-oriented approach. In: Proceedings of the 2009 ACM Symposium on Applied Computing, pp. 389–394 (2009)

    Google Scholar 

  11. Mylopoulos, J., Chung, L., Nixon, B.: Representing and using nonfunctional requirements: a process-oriented approach. IEEE Trans. Softw. Eng. 18(6), 483–497 (1992)

    Article  Google Scholar 

  12. Wang, W., Chen, L., Thirunarayan, K., Sheth, A.P.: Harnessing Twitter ‘big data’ for automatic emotion identification. In: 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Conference on Social Computing, pp. 587–592. IEEE (2012)

    Google Scholar 

  13. Lundberg, S.M., et al.: From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2(1), 56–67 (2020)

    Article  Google Scholar 

  14. Pedregosa, F., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)

    MathSciNet  MATH  Google Scholar 

  15. Hassine, J., Amyot, D.: A questionnaire-based survey methodology for systematically validating goal-oriented models. Requirements Eng. 21(2), 285–308 (2015). https://doi.org/10.1007/s00766-015-0221-7

    Article  Google Scholar 

  16. Cohen, J.: The earth is round (p\(<\).05). Am. Psychol. 49, 997–1003 (1994)

    Article  Google Scholar 

  17. Berkson, J.: Tests of significance considered as evidence. J. Am. Stat. Assoc. 37(219), 325–335 (1942)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sam Supakkul .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Supakkul, S. et al. (2020). Validating Goal-Oriented Hypotheses of Business Problems Using Machine Learning: An Exploratory Study of Customer Churn. In: Nepal, S., Cao, W., Nasridinov, A., Bhuiyan, M.Z.A., Guo, X., Zhang, LJ. (eds) Big Data – BigData 2020. BIGDATA 2020. Lecture Notes in Computer Science(), vol 12402. Springer, Cham. https://doi.org/10.1007/978-3-030-59612-5_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-59612-5_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-59611-8

  • Online ISBN: 978-3-030-59612-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics