Abstract
Enterprise data is usually stored in the form of relational databases. Question Answering systems provides an easier way so that business analysts can get data insights without struggling with the syntax of SQL. However, building a supervised machine learning based question answering system is a challenging task involving large manual annotations for a specific domain. In this paper we explore the problem of transfer learning for neural sequence taggers, where a source task with plentiful annotations (e.g., Training samples (NL questions) on IT enetrprize domain) is used to improve performance on a target task with fewer available annotations (e.g., Training samples (NL questions) on pharmaceutical domain). We examine the effects of transfer learning for deep recurrent networks across domains and show that significant improvement can often be obtained. Our question answering framework is based on a set of machine learning models that create an intermediate sketch from a natural language query. Using the intermediate sketch, we generate a final database query over a large knowledge graph. Our framework supports multiple queries such as aggregation, self joins, factoid and transnational.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.P.: Natural language processing (almost) from scratch. CoRR, vol. abs/1103.0398 (2011). http://arxiv.org/abs/1103.0398
Jadeja, M., Varia, N.: Perspectives for evaluating conversational AI. CoRR, vol. abs/1709.04734 (2017)
Sadun, E., Sande, S.: Talking to Siri: Learning the Language of Apple’s Intelligent Assistant, 2nd edn. Que Publishing Company, Indianapolis (2013)
Ehrenbrink, P., Osman, S., Möller, S.: Google now is for the extraverted, Cortana for the introverted: investigating the influence of personality on IPA preference. In: Proceedings of the 29th Australian Conference on Computer-Human Interaction, ser. OZCHI 2017, pp. 257–265. ACM, New York (2017). https://doi.org/10.1145/3152771.3152799
Parks, M.: Microsoft Business Intelligence. POWER BI. CreateSpace Independent Publishing Platform, USA (2014)
Popescu, A.-M., Etzioni, O., Kautz, H.: Towards a theory of natural language interfaces to databases. In: Proceedings of the 8th International Conference on Intelligent User Interfaces, ser. IUI 2003, pp. 149–157. ACM, New York (2003). https://doi.org/10.1145/604045.604070
Xu, X., Liu, C., Song, D.: SQLNet: generating structured queries from natural language without reinforcement learning (2017)
Yaghmazadeh, N., Wang, Y., Dillig, I., Dillig, T.: SQLizer: query synthesis from natural language. Proceedings of ACM Programming Languages, vol. 1, no. OOPSLA, pp. 63:1–63:26, October 2017. https://doi.org/10.1145/3133887
Zhong, V., Xiong, C., Socher, R.: Seq2SQL: generating structured queries from natural language using reinforcement learning (2017)
Yin, P., Lu, Z., Li, H., Kao, B.: Neural enquirer: Learning to query tables with natural language (2015)
Pasupat, P., Liang, P.: Compositional semantic parsing on semi-structured tables (2015)
Li, F., Jagadish, H.V.: Constructing an interactive natural language interface for relational databases. In: Proceedings of VLDB Endowment, vol. 8, no. 1, pp. 73–84, September 2014. https://doi.org/10.14778/2735461.2735468
Yu, T., Li, Z., Zhang, Z., Zhang, R., Radev, D.: TypeSQL: knowledge-based type-aware neural text-to-SQL generation. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), pp. 588–594. Association for Computational Linguistics (2018). http://aclweb.org/anthology/N18-2093
Dong, L., Lapata, M.: Coarse-to-fine decoding for neural semantic parsing. CoRR, vol. abs/1805.04793 (2018)
Yavuz, S., Gur, I., Su, Y., Yan, X.: DialSQL: dialogue based structured query generation. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018, Melbourne, Australia, 15–20 July 2018, Volume 1: Long Papers, pp. 1339–1349 (2018). https://aclanthology.info/papers/P18-1124/p18-1124
Yang, Z., Salakhutdinov, R., Cohen, W.W.: Transfer learning for sequence tagging with hierarchical recurrent networks. CoRR, vol. abs/1703.06345 (2017). http://arxiv.org/abs/1703.06345
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Sangroya, A., Saini, P., Rawat, M., Shroff, G., Anantaram, C. (2019). Natural Language Business Intelligence Question Answering Through SeqtoSeq Transfer Learning. In: U., L., Lauw, H. (eds) Trends and Applications in Knowledge Discovery and Data Mining. PAKDD 2019. Lecture Notes in Computer Science(), vol 11607. Springer, Cham. https://doi.org/10.1007/978-3-030-26142-9_25
Download citation
DOI: https://doi.org/10.1007/978-3-030-26142-9_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-26141-2
Online ISBN: 978-3-030-26142-9
eBook Packages: Computer ScienceComputer Science (R0)