skip to main content
10.1145/3373477.3373478acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaissConference Proceedingsconference-collections
research-article

ISQNL: interpretable SQL query synthesizer from natural language input

Published: 15 January 2020 Publication History

Abstract

Databases serve as the forefront for most systems today. Structured query language (SQL) is used to access and manipulate the data stored in a relational database. However, most end users have limited knowledge of SQL and thus face difficulties in accessing such systems. In this paper we describe a novel system (ISQNL) to convert a query provided in Natural Language (English) to an SQL query. By applying several natural language processing techniques ISQNL achieves this conversion without the need for any elaborate schema specific training/modification during setup and is robust enough to handle dynamically changing database states or database schema. ISQNL has demonstrated remarkable accuracy in SQL query synthesis when tested on large sets of natural language input. This paper discusses the methodology and key challenges involved in building ISQNL.

References

[1]
Heimark, Karl Johan V (2013), "A Specialized Question Answering System Using Natural Language: Developed Using the Quepy Framework" International Journal of Computer Applications pp (0975 -- 8887)
[2]
Yaghmazadeh, N., Wang, Y., Dillig, I., & Dillig, T. (2017). Type- and Content-Driven Synthesis of SQL Queries from Natural Language. CoRR, abs/1702.01168.
[3]
Giordani, Alessandra. (2008). "Mapping Natural Language into SQL in a NLIDB". Natural Language and Information Systems: 13th International Conference on Applications of Natural Language to Information Systems, NLDB 2008 London, UK, June 24--27, 2008 Proceedings (pp.367--371)
[4]
S. M. Sarwar and Abdullah-Al-Mamun, (2016) "Next word prediction for phonetic typing by grouping language models," 2nd International Conference on Information Management (ICIM), London, 2016, pp. 73--76.
[5]
Gupta, Ankush, Arvind Agarwal, Prawaan Singh and Piyush Rai. (2018) "A Deep Generative Framework for Paraphrase Generation." AAAI. pp23--27
[6]
Zeng, Daojian and Zhang, Haoran and Xiang, Lingyun and Wang, Jin and Ji, GuoLiang. (2019) User-Oriented Paraphrase Generation with Keywords Controlled Network IEEE Access 1-1
[7]
D. Chesworth, N. Harmon, L. Tanner, S. Guerlain and M. Balazs, (2016) "Named-entity recognition and data visualization techniques to communicate mission command to autonomous systems," IEEE Systems and Information Engineering Design Symposium (SIEDS), Charlottesville, VA, pp. 233--238.
[8]
Aung, Sint & Wai, Myat. (2018). Domain Independent Feature Extraction using Rule Based Approach. Advances in Science, Technology and Engineering Systems Journal. 3. pp218--224.
[9]
Luong, Thai-Le & Son, Cao & Le, Duc-Thang & Phan, Xuan-Hieu. (2017). Intent extraction from social media texts using sequential segmentation and deep learning models. 215--220.
[10]
Glater, Rafael & L.T. Santos, Rodrygo & Ziviani, Nivio. (2017). Intent-Aware Semantic Query Annotation. pp485--494.
[11]
Chen, Hong-You and Lee, Cheng-Syuan and Liao, Keng Te and Lin, Shou-de. (2018) "Word Relation Autoencoder for Unseen Hypemym Extraction Using Word Embeddings", Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing pp 22--28
[12]
Grefenstette, Gregory. (2015) Simple Hypernym Extraction Methods. INRIASAC pp 19--24
[13]
Ngo, V.M., Cao, T.H., & Le, T.M. (2018). WordNet-Based Information Retrieval Using Common Hypernyms and Combined Features. CoRR, abs/1807.05574. S. Padmini, P. A. Patii, S. University and D. Y. Patil, (2017) "Building RMS for a RD using Wordnet," 3rd International Conference on Applied and Theoretical Computing and Communication Technology (iCATccT), Tumkur, pp. 304--309
[14]
Yu, Qian and Wai Lam. (2018) "Product Question Intent Detection using Indicative Clause Attention and Adversarial Learning." ICTIR. pp75--82
[15]
D. Chesworth, N. Harmon, L. Tanner, S. Guerlain and M. Balazs, (2016) "Named-entity recognition and data visualization techniques to communicate mission command to autonomous systems," IEEE Systems and Information Engineering Design Symposium (SIEDS), Charlottesville, VA, pp. 233--238.
[16]
Martínez F., José & Carpio, Martin & Gaspar H., Juana (2018). "Translation of Natural Language Queries to SQL that Involve Aggregate Functions, Grouping and Subqueries for a Natural Language Interface to Databases." Studies in Computational Intelligence
[17]
hu, Xin and Yao, Yingting and Ye, Luting and Dang, Depeng. (2017). "Natural Language Aggregate Query over RDF Data", Information Sciences
[18]
Huang, Po-Sen & Wang, Chenglong & Singh, Rishabh & Yih, Wen-tau & He, Xiaodong. (2018). "Natural Language to Structured Query Generation via Meta-Learning." Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 pp.732--738.
[19]
Zhong, V., Xiong, C., & Socher, R. (2018). "Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning". ArXiv, abs/1709.00103.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
AISS '19: Proceedings of the 1st International Conference on Advanced Information Science and System
November 2019
253 pages
ISBN:9781450372916
DOI:10.1145/3373477
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 January 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. N grams
  2. SQL query
  3. entity recognition
  4. intent recognition
  5. natural language processing
  6. natural language query
  7. tokenization parts of speech tagging
  8. wordnets

Qualifiers

  • Research-article

Conference

AISS 2019

Acceptance Rates

AISS '19 Paper Acceptance Rate 41 of 95 submissions, 43%;
Overall Acceptance Rate 41 of 95 submissions, 43%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 138
    Total Downloads
  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media