Abstract
Business Intelligence is the key technologies that ensures effective decision making through extracting relevant information and providing adapted systems as the Data Warehouses. To access decisional information, the decision maker should express his requirements in Natural Language interfaces without any technical skills, avoiding the IT-Designer intervention. Often, the decision maker’s requirements are expressed as WH-questions (What, Who, Where, etc.) or Keyword-like questions. In this paper, we emphasize on a Why-Question asked in Business Intelligence context. This question has not been well dealt in the literature in terms of produced answers. Indeed, to respond this type of question, it is necessary to provide explanations. These explanations are determined by identifying causal relationships between the phenomenon highlighted in the Why-Question and factors that can influence this phenomenon. In this context, we propose an approach on which a system can address a causality problem related to answering a decisional Why-Question. To validate our approach a tool called BI Why Q/A is developed. In order evaluate our proposal in terms of efficiency and relevance, a set of experimental studies is carried out and presented.
Similar content being viewed by others
Data availability
Enquiries about data availability should be directed to the authors.
Notes
This depends on the periodic supplying of the DW as well as on the exploitation of new external data sources.
Causal Bayesian network CBN requires an initial causal network (variable and value parameters) defined by a domain expert [45]. Unfortunately, the CBN can not be adapted in our context.
Since the data available in the DW concern the duration [2012,2017], then we extract the average temperature for sales territories, recorded in this period.
References
Kuchmann-Beauger, N.: Question answering system in a business intelligence context. Ph.D. dissertation, Ecole Centrale Paris (2013)
Naeem, M.A., Ullah, S., Bajwa, I.S.: Interacting with data warehouse by using a natural language interface. In: Natural language processing and information systems: 17th international conference on applications of natural language to information systems, NLDB 2012, Groningen, The Netherlands, June 26–28, 2012. Proceedings 17. Springer, pp. 372–377 (2012)
Popowich, F., Mosny, M., Lindberg, D.: Interactive natural language query construction for report generation. In: Proceedings of the seventh international natural language generation conference. association for computational linguistics, pp. 115–119 (2012)
Saias, J., Quaresma, P., Salgueiro, P., Santos, T.: Binli: an ontology-based natural language interface for multidimensional data analysis. Intell. Inf. Manag. 4(5), 225–230 (2012)
Kuchmann-Beauger, N., Aufaure, M.-A. : A natural language interface for data warehouse question answering. In: Natural language processing and information systems: 16th international conference on applications of natural language to information systems, NLDB 2011, Alicante, Spain, June 28-30, 2011. Proceedings 16. Springer, pp. 201–208 (2011)
Sangroya, A., Saini, P., Rawat, M., Shroff, G., Anantaram, C.: Natural language business intelligence question answering through seqtoseq transfer learning. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, pp. 286–297 (2019)
Djiroun, R., Boukhalfa, K., Alimazighi, Z.: Designing data cubes in olap systems: a decision makers’ requirements-based approach. Clust. Comput. 22, 783–803 (2019)
Imhoff, C., White, C.: Self-service business intelligence. Empowering Users to Generate Insights, TDWI Best practices report, TWDI, Renton, WA (2011)
Vashisht, V., Dharia, P.: Integrating chatbot application with qlik sense business intelligence (BI) tool using natural language processing (NLP). In: Micro-electronics and telecommunication engineering. Springer, Singapore, pp. 683–692 (2020)
Al-Hawawreh, M., Aljuhani, A., Jararweh, Y.: ChatGPT for cybersecurity: practical applications, challenges, and future directions. Clust. Comput. 26(6), 3421–3436 (2023)
Asghar, N.: Automatic extraction of causal relations from natural language texts: a comprehensive survey. arXiv:1605.07895 (2016)
Mani, S., Cooper, G.F.: Causal discovery using a Bayesian local causal discovery algorithm. In: Medinfo, pp. 731–735 (2004)
Hassani, H., Huang, X., Ghodsi, M.: Big data and causality. Ann. Data Sci. 5, 133–156 (2018)
Girju, R.: Automatic detection of causal relations for question answering. In: Proceedings of the ACL 2003 workshop on multilingual summarization and question answering-Volume 12, Association for Computational Linguistics, pp. 76–83 (2003)
Verberne, S.: Developing an approach for why-question answering. In: Proceedings of the eleventh conference of the European chapter of the association for computational linguistics: student research workshop. Association for Computational Linguistics, pp. 39–46 (2006)
Verberne, S.: Paragraph retrieval for why-question answering. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, pp. 922–922 (2007)
Verberne, S., Boves, L., Oostdijk, N., Coppen, P.-A.: Evaluating discourse-based answer extraction for why-question answering. In: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, pp. 735–736 (2007)
Moriceau, V., Tannier, X., Falco, M.: “Une étude des questions “complexes” en question-réponse,” In Actes de la Conférence Traitement Automatique des Langues Naturelles (TALN 2010, article court), Montréal, Canada, (2010)
Oh, J.-H., Torisawa, K., Hashimoto, C., Kawada, T., De Saeger, S., Kazama, J., Wang, Y.: Why question answering using sentiment analysis and word classes. In: Proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning. Association for Computational Linguistics, pp. 368–378 (2012)
Baral, C., Ha Vo, N., Liang, S.: Answering why and how questions with respect to a frame-based knowledge base: a preliminary report. In: LIPIcs-Leibniz international proceedings in informatics, vol. 17. Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik (2012)
Oh, J.-H., Torisawa, K., Hashimoto, C., Sano, M., De Saeger, S., Ohtake, K.: Why-question answering using intra-and inter-sentential causal relations. In ACL (1), pp. 1733–1743 (2013)
Oh, J.-H., Torisawa, K., Kruengkrai, C., Iida, R., Kloetzer, J.: Multi-column convolutional neural networks with causality-attention for why-question answering. In: Proceedings of the tenth ACM international conference on web search and data mining. ACM, pp. 415–424 (2017)
Pechsiri, C.: Explanation based why question answering system. In: Proceedings of the international multiconference of engineers and computer scientists, vol. 1 (2016)
Higashinaka, R., Isozaki, H.: Automatically acquiring causal expression patterns from relation-annotated corpora to improve question answering for why-questions. ACM Trans. Asian Lang. Inf. Process. (TALIP) 7(2), 6 (2008)
Tirunagari, S.: Data mining of causal relations from text: analysing maritime accident investigation reports. arXiv:1507.02447 (2015)
Sharp, R., Surdeanu, M., Jansen, P., Clark, P., Hammond, M.: Creating causal embeddings for question answering with minimal supervision. arXiv:1609.08097 (2016)
Vazquez-Reyes, S., Black, W.J.: Evaluating causal questions for question answering. In Computer Science, 2008. ENC’08. Mexican International Conference on. IEEE, pp. 132–142 (2008)
Blanco, E., Castell, N., Moldovan, D.I.: Causal relation extraction. In Lrec (2008)
Sorgente, A., Vettigli, G., Mele, F.: Automatic extraction of cause-effect relations in natural language text. DART@ AI* IA, vol. 2013, pp. 37–48 (2013)
Ittoo, A., Bouma, G.: Extracting explicit and implicit causal relations from sparse, domain-specific texts. In International conference on application of natural language to information systems. Springer, pp. 52–63 (2011)
Khoo, C., Chan, S., Niu, Y., Ang, A.: A method for extracting causal knowledge from textual databases. Singap. J. Library Inf. manag. 28, 48–63 (1999)
Do, Q.X., Chan, Y.S., Roth, D.: Minimally supervised event causality identification. In: Proceedings of the conference on empirical methods in natural language processing. Association for Computational Linguistics, pp. 294–303 (2011)
Khoo, C.S., Chan, S., Niu, Y.: Extracting causal knowledge from a medical database using graphical patterns. In: Proceedings of the 38th annual meeting on association for computational linguistics. Association for Computational Linguistics, pp. 336–343 (2000)
Guessoum, M.A., Djiroun, R., Boukhalfa, K.: Dealing with decisional natural language why-question in business intelligence. In 8th international conference on information and communication systems (ICICS). IEEE, pp. 52–57 (2017)
Guessoum, M., Djiroun, R., Boukhalfa, K.: Causality analysis method and model related to why-question answering in business intelligence context. In: International conference on computing systems and applications. Springer, pp. 15–26 (2022)
Azmi, A.M., Alshenaifi, N.A.: Lemaza: an arabic why-question answering system. Nat. Lang. Eng. 23(6), 877–903 (2017)
Jin, Z., Li, J., Liu, L., Le, T.D., Sun, B., Wang, R.: Discovery of causal rules using partial association. In: Data Mining (ICDM), 2012 IEEE 12th International Conference on. IEEE, pp. 309–318 (2012)
Scherp, A., Franz, T., Saathoff, C., Staab, S.: F–a model of events based on the foundational ontology dolce+ dns ultralight. In: Proceedings of the fifth international conference on Knowledge capture. ACM, pp. 137–144 (2009)
Katell, M.: Méthode d’analyse de données en régression non linéaire, pp. 7–8. Hall, Hoboken (2013)
Hobbs, J.R.: Toward a useful concept of causality for lexical semantics. J. Semant. 22(2), 181–209 (2005)
Kim, J.: Causes and counterfactuals. J. Philos. 70(17), 570–572 (1974)
Guessoum, M.A., Djiroun, R., Boukhalfa, K., Benkhelifa, E.: Natural language why-question in business intelligence applications: model and recommendation approach. Clust. Comput. 25, 1–24 (2022)
Guessoum, M.A., Djiroun, R., Boukhalfa, K.: Towards decisional natural language why-question recommendation approach in business intelligence context. In: 2019 International conference on networking and advanced systems (ICNAS).IEEE, pp. 1–6 (2019)
Granger, C.W.: Investigating causal relations by econometric models and cross-spectral methods. Econometrica 37, 424–438 (1969)
Pearl, J.: Causal inference. Causality: objectives and assessment, pp. 39–58 (2010)
Karimi, K., Hamilton, H.J.: Timesleuth: a tool for discovering causal and temporal rules. In Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings. 14th IEEE International Conference on. IEEE, pp. 375–380 (2002)
Shrestha, M.B., Bhatta, G.R.: Selecting appropriate methodological framework for time series data analysis. J. Fin. Data Sci. 4(2), 71–89 (2018)
Cochran, W.G.: Approximate significance levels of the Behrens–Fisher test. Biometrics 20(1), 191–195 (1964)
Liew, V.K.-S.: Which lag length selection criteria should we employ? Econ. Bull. 3(33), 1–9 (2004)
Wang, R., Liu, J., Zhang, Q., Fu, C., Hou, Y.: Federated learning for feature-fusion based requirement classification. Clust. Comput. (2023). https://doi.org/10.1007/s10586-023-04147-y
Funding
The authors have not disclosed any funding.
Author information
Authors and Affiliations
Contributions
All authors wrote and contributed to this manuscript. All authors reviewed the manuscript
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Djiroun, R., Guessoum, M.A., Boukhalfa, K. et al. Natural language why-question answering system in business intelligence context. Cluster Comput 27, 11039–11067 (2024). https://doi.org/10.1007/s10586-024-04327-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-024-04327-4