Abstract
Organizations are investing in Big Data and Machine Learning (ML) projects, but most of these projects are predicted to fail. A study shows that one of the biggest obstacles is the lack of understanding of how to use data analytics to improve business value. This paper presents Metis, a method for ensuring that business goals and the corresponding business problems are explicitly traceable to ML projects and where potential (i.e., hypothesized) complex problems can be properly validated before investing in costly solutions. Using this method, business goals are captured to provide context for hypothesizing business problems, which can be further refined into more detailed problems to identify features of data that are suitable for ML. A Supervised ML algorithm is then used to generate a prediction model that captures the underlying patterns and insights about the business problems in the data. An ML Explainability model is used to extract from the prediction model the individual features and their degree of contribution to each problem. The extracted weighted data feature are then fed back to the goal-oriented problem model to validate the most important business problems. Our experiment results show that Metis can detect the most influential problem when it was not apparent through data analysis. Metis is illustrated using a real-world customer churn (customer attrition) problem for a bank and a publicly available customer churn dataset.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
A Greek goddess that has been associated with prudence, wisdom, or wise counsel.
References
Asay, M.: 85% of big data projects fail, but your developers can help yours succeed. Techrepublic (2017)
LaValle, S., Lesser, E., Shockley, R., Hopkins, M.S., Kruschwitz, N.: Big data, analytics and the path from insights to value. MIT Sloan Manag. Rev. 52(2), 21–32 (2011)
NewVantage Partners LLC: Big data and AI executive survey 2020: data-driven business transformation connecting data/AI investment to business outcomes (2020)
Zhou, L., Pan, S., Wang, J., Vasilakos, A.V.: Machine learning on big data: opportunities and challenges. Neurocomputing 237, 350–361 (2017)
Tatter, P.: gpk: 100 data sets for statistics education. R package v. 1.0. (2013)
Supakkul, S., Zhao, L., Chung, L.: GOMA: supporting big data analytics with a goal-oriented approach. In: 2016 IEEE International Congress on Big Data (BigData Congress), pp. 149–156, June 2016
Berry, M.J., Linoff, G.S.: Data Mining Techniques: for Marketing, Sales, and Customer Relationship Management. Wiley, Hoboken (2004)
Supakkul, S.: Capturing, organizing, and reusing knowledge of non-functional requirements: an NFR pattern approach. Ph.D. thesis, University of Texas at Dallas (2010)
Chung, L., Nixon, B.A., Yu, E., Mylopoulos, J.: Non-Functional Requirements in Software Engineering, vol. 5. Springer, Cham (2012)
Supakkul, S., Chung, L.: Extending problem frames to deal with stakeholder problems: an agent-and goal-oriented approach. In: Proceedings of the 2009 ACM Symposium on Applied Computing, pp. 389–394 (2009)
Mylopoulos, J., Chung, L., Nixon, B.: Representing and using nonfunctional requirements: a process-oriented approach. IEEE Trans. Softw. Eng. 18(6), 483–497 (1992)
Wang, W., Chen, L., Thirunarayan, K., Sheth, A.P.: Harnessing Twitter ‘big data’ for automatic emotion identification. In: 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Conference on Social Computing, pp. 587–592. IEEE (2012)
Lundberg, S.M., et al.: From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2(1), 56–67 (2020)
Pedregosa, F., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Hassine, J., Amyot, D.: A questionnaire-based survey methodology for systematically validating goal-oriented models. Requirements Eng. 21(2), 285–308 (2015). https://doi.org/10.1007/s00766-015-0221-7
Cohen, J.: The earth is round (p\(<\).05). Am. Psychol. 49, 997–1003 (1994)
Berkson, J.: Tests of significance considered as evidence. J. Am. Stat. Assoc. 37(219), 325–335 (1942)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Supakkul, S. et al. (2020). Validating Goal-Oriented Hypotheses of Business Problems Using Machine Learning: An Exploratory Study of Customer Churn. In: Nepal, S., Cao, W., Nasridinov, A., Bhuiyan, M.Z.A., Guo, X., Zhang, LJ. (eds) Big Data – BigData 2020. BIGDATA 2020. Lecture Notes in Computer Science(), vol 12402. Springer, Cham. https://doi.org/10.1007/978-3-030-59612-5_11
Download citation
DOI: https://doi.org/10.1007/978-3-030-59612-5_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-59611-8
Online ISBN: 978-3-030-59612-5
eBook Packages: Computer ScienceComputer Science (R0)