skip to main content
10.1145/3430984.3431011acmotherconferencesArticle/Chapter ViewAbstractPublication PagescodsConference Proceedingsconference-collections
research-article

Bootstrapping Chatbot Interfaces to Databases

Published: 02 January 2021 Publication History

Abstract

In this paper, we present an automated technique for creating a chatbot interface to databases. To the best of our knowledge, our technique is the first one for bootstrapping chatbots for question answering on relational databases. Our method leverages the natural language classifiers present in industrial chatbot platforms for natural language to structure query translation. We use our framework to create usable chatbots starting from the databases in a short time. We instantiated several chatbots in different domains and demonstrate the usefulness of it.

References

[1]
[n.d.]. 5 Metrics every chatbot should track. https://www.topbots.com/5-bot-metrics-every-chatbot-should-track/. ([n. d.]).
[2]
[n.d.]. Ontology. http://tomgruber.org/writing/ontology-definition-2007.htm.
[3]
[n.d.]. W3C. http://www.w3.org/TR/owl-guide/.
[4]
Ricardo Baeza-Yates, Carlos Hurtado, and Marcelo Mendoza. 2004. Query Recommendation Using Query Logs in Search Engines(EDBT’04). Springer-Verlag, Berlin, Heidelberg, 588–596. https://doi.org/10.1007/978-3-540-30192-9_58
[5]
Huanhuan Cao, Daxin Jiang, Jian Pei, Qi He, Zhen Liao, Enhong Chen, and Hang Li. 2008. Context-aware Query Suggestion by Mining Click-through and Session Data. In KDD (Las Vegas, Nevada, USA) (KDD ’08). ACM, New York, NY, USA, 875–883. https://doi.org/10.1145/1401890.1401995
[6]
Basil Ell, Denny Vrandečić, and Elena Simperl. 2015. SPARTIQULATION: Verbalizing SPARQL Queries. In The Semantic Web: ESWC 2012 Satellite Events, Elena Simperl, Barry Norton, Dunja Mladenic, Emanuele Della Valle, Irini Fundulaki, Alexandre Passant, and Raphaël Troncy (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 117–131.
[7]
Raquel Fernández and Jonathan Ginzburg. 2002. Non-sentential utterances: Grammar and dialogue dynamics in corpus annotation. In Proceedings of the 19th international conference on Computational linguistics-Volume 1. Association for Computational Linguistics, 1–7.
[8]
Raquel Fernández, Jonathan Ginzburg, and Shalom Lappin. 2007. Classifying non-sentential utterances in dialogue: A machine learning approach. Computational Linguistics 33, 3 (2007), 397–427.
[9]
Matthew Henderson, Blaise Thomson, and Jason D Williams. 2014. The third dialog state tracking challenge. In Spoken Language Technology Workshop (SLT), 2014 IEEE. IEEE, 324–329.
[10]
William J. Holton. 2015. Translating Database Queries to English for Enhancing Database Education. (02 2015).
[11]
Manasa Jammi, Jaydeep Sen, Ashish Mittal, Sagar Verma, Vardaan Pahuja, Rema Ananthanarayanan, Pranay Lohia, Hima Karanam, Diptikalyan Saha, and Karthik Sankaranarayanan. 2018. Tooling Framework for Instantiating Natural Language Querying System. Proceedings of the VLDB Endowment 11, 12 (2018).
[12]
Heeyoung Lee, Yves Peirsman, Angel Chang, Nathanael Chambers, Mihai Surdeanu, and Dan Jurafsky. 2011. Stanford’s multi-pass sieve coreference resolution system at the CoNLL-2011 shared task. In Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task. Association for Computational Linguistics, 28–34.
[13]
Chuan Lei, Fatma Ozcan, Abdul Quamar, Ashish Mittal, Jaydeep Sen, Diptikalyan Saha, and Karthik Sankaranarayanan. 2018. Ontology-Based Natural Language Query Interfaces for Data Exploration. Data Engineering (2018), 52.
[14]
Fei Li and H. V. Jagadish. 2014. Constructing an Interactive Natural Language Interface for Relational Databases. Proc. VLDB Endow. 8, 1 (2014), 73–84.
[15]
Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, and Bill Dolan. 2016. A Persona-Based Neural Conversation Model. CoRR abs/1603.06155(2016). arxiv:1603.06155http://arxiv.org/abs/1603.06155
[16]
Chia-Wei Liu, Ryan Lowe, Iulian V. Serban, Michael Noseworthy, Laurent Charlin, and Joelle Pineau. 2016. How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation. arXiv:arXiv:1603.08023
[17]
W. S. Luk and Steve Kloster. 1986. ELFS: English Language from SQL. ACM Trans. Database Syst. 11, 4 (Dec. 1986), 447–472. https://doi.org/10.1145/7239.384276
[18]
Christopher D Manning, Mihai Surdeanu, John Bauer, Jenny Rose Finkel, Steven Bethard, and David McClosky. 2014. The stanford corenlp natural language processing toolkit. In ACL (System Demonstrations). 55–60.
[19]
Ashish Mittal, Jaydeep Sen, Diptikalyan Saha, and Karthik Sankaranarayanan. 2018. An Ontology based Dialog Interface to Database. In Proceedings of the 2018 International Conference on Management of Data. ACM, 1749–1752.
[20]
Vincent Ng. 2010. Supervised noun phrase coreference research: The first fifteen years. In Proceedings of the 48th annual meeting of the association for computational linguistics. Association for Computational Linguistics, 1396–1411.
[21]
Axel-Cyrille Ngonga Ngomo, Lorenz Bühmann, Christina Unger, Jens Lehmann, and Daniel Gerber. 2013. Sorry, I Don’T Speak SPARQL: Translating SPARQL Queries into Natural Language. In Proceedings of the 22Nd International Conference on World Wide Web (Rio de Janeiro, Brazil) (WWW ’13). ACM, New York, NY, USA, 977–988. https://doi.org/10.1145/2488388.2488473
[22]
Hoifung Poon and Pedro Domingos. 2008. Joint unsupervised coreference resolution with Markov logic. In Proceedings of the conference on empirical methods in natural language processing. Association for Computational Linguistics, 650–659.
[23]
Ana-Maria Popescu, Oren Etzioni, and Henry Kautz. 2003. Towards a Theory of Natural Language Interfaces to Databases. In IUI.
[24]
Dinesh Raghu, Sathish Indurthi, Jitendra Ajmera, and Sachindra Joshi. 2015. A Statistical Approach for Non-Sentential Utterance Resolution for Interactive QA System. In 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Vol. 335.
[25]
Diptikalyan Saha, Avrilia Floratou, Karthik Sankaranarayanan, Umar Farooq Minhas, Ashish R. Mittal, and Fatma Özcan. 2016. ATHENA: An Ontology-driven System for Natural Language Querying over Relational Data Stores. Proc. VLDB Endow. 9, 12 (Aug. 2016), 1209–1220. https://doi.org/10.14778/2994509.2994536
[26]
Diptikalyan Saha, Neelamadhav Gantayat, Senthil Mani, and Barry Mitchell. 2017. Natural language querying in SAP-ERP platform. In Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering. ACM, 878–883.
[27]
Jaydeep Sen, Ashish R Mittal, Diptikalyan Saha, and Karthik Sankaranarayanan. 2018. Functional Partitioning of Ontologies for Natural Language Query Completion in Question Answering Systems. In IJCAI. 4331–4337.
[28]
Bayu Setiaji and Ferry Wahyu Wibowo. 2016. Chatbot Using A Knowledge in Database. In 2016 7th International Conference on Intelligent Systems, Modelling and Simulation. IEEE, 72–77.
[29]
K. Shabaza, Jim D. O’Shea, Keeley A. Crockett, and A. Latham. 2015. Aneesah: A Conversational Natural Language Interface to Databases. In Proceedings of The World Congress on Engineering 2015. LNCS, 227–232.
[30]
Alane Suhr, Srinivasan Iyer, and Yoav Artzi. 2018. Learning to Map Context-Dependent Sentences to Executable Formal Queries. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), Vol. 1. 2238–2249.
[31]
Sujatha R Upadhyaya and P Sreenivasa Kumar. 2005. ERONTO: a tool for extracting ontologies from extended E/R diagrams. In Proceedings of the 2005 ACM symposium on Applied computing. ACM, 666–670.
[32]
Oriol Vinyals and Quoc V. Le. 2015. A Neural Conversational Model. CoRR abs/1506.05869(2015). arxiv:1506.05869http://arxiv.org/abs/1506.05869
[33]
Yushi Wang, Jonathan Berant, and Percy Liang. 2015. Building a Semantic Parser Overnight. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, ACL 2015, July 26-31, 2015, Beijing, China, Volume 1: Long Papers. 1332–1342. http://aclweb.org/anthology/P/P15/P15-1129.pdf
[34]
Xiaojun Xu, Chang Liu, and Dawn Song. 2017. Sqlnet: Generating structured queries from natural language without reinforcement learning. arXiv preprint arXiv:1711.04436(2017).
[35]
Victor Zhong, Caiming Xiong, and Richard Socher. 2017. Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning. CoRR abs/1709.00103(2017). arxiv:1709.00103http://arxiv.org/abs/1709.00103

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
CODS-COMAD '21: Proceedings of the 3rd ACM India Joint International Conference on Data Science & Management of Data (8th ACM IKDD CODS & 26th COMAD)
January 2021
453 pages
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 January 2021

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

CODS COMAD 2021
CODS COMAD 2021: 8th ACM IKDD CODS and 26th COMAD
January 2 - 4, 2021
Bangalore, India

Acceptance Rates

Overall Acceptance Rate 197 of 680 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 144
    Total Downloads
  • Downloads (Last 12 months)19
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media