Abstract
The purpose of Text-to-SQL is to obtain the correct answer for a textual question from the database, which can take advantage of advanced database system to provide reliable and efficient response. Existing Text-to-SQL methods generally focus on accuracy by designing complex deep neural network models, and hardly consider interpretability, which is very important for serious applications. To address this, in this paper we propose a novel framework for Interpretable Text-to-SQL Generation (ITSG) with joint optimization, which achieves state-of-the-art accuracy and possesses two-level interpretability at the same time. The framework mainly consists of three layers: a sequence encoder which encodes questions, table headers and significant table contents, an attention-based LSTM layer which generates SQL queries and a reinforcement learning layer which boosts the execution accuracy. Comparing with state-of-the-art methods on benchmark datasets, the experimental results show the effectiveness and interpretability of our ITSG framework.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Min, S., Chen, D., Hajishirzi, H., Zettlemoyer, L.: A discrete hard EM approach for weakly supervised question answering. In: 2019 Conference on Empirical Methods in Natural Language Processing & International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 2851–2864. ACL (2019). https://doi.org/10.18653/v1/d19-1284
Han, Z., Jiang, X., Li, M., et al.: An integrated semantic-syntactic SBLSTM model for aspect specific opinion extraction. In: 15th International Conference of Web Information Systems and Applications, pp. 191–199. WISA (2018). https://doi.org/10.1007/978-3-030-02934-0_18
He, P., Mao, Y., Chakrabarti, K., Chen, W.: X-SQL: reinforce schema representation with context. CoRR abs/1908.08113 (2019)
Dong, L., Lapata, M.: Coarse-to-fine decoding for neural semantic parsing. In: 56th Annual Meeting of the Association for Computational Linguistics, pp. 731–742. ACL (2018). https://doi.org/10.18653/v1/p18-1068
Berant, J., Chou, A., Frostig, R., et al.: Semantic parsing on freebase from Question-Answer pairs. In: 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1533–1544. ACL (2013). Doi:10.1.1.408.319
Krishnamurthy, J., Dasigi, P., Gardner, M.: Neural semantic parsing with type constraints for semi-structured tables. In: 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1516–1526. ACL (2017). https://doi.org/10.18653/v1/d17-1160
Iyyer, M., Yih, W., Chang, M.: Search-based neural structured learning for sequential question answering. In: 2017 Annual Meeting of the Association for Computational Linguistics, pp. 1821–1831. ACL (2017). https://doi.org/10.18653/v1/p17-1167
Liang, C., Norouzi, M., Berant, J., et al.: Memory augmented policy optimization for program synthesis and semantic parsing. In: 32rd Conference on Neural Information Processing Systems (NeurIPS), pp. 10015–10027. (NIPS) (2018)
Zhong, V., Xiong, C., Socher, R.: Seq2sql: generating structured queries from natural language using reinforcement learning. CoRR, abs/1709.00103 (2017)
Wang, W., Tian, Y., Xiong, H., et al.: A transfer-learnable natural language interface for databases. CoRR, abs/1809.02649 (2018)
Wang, C., Huang, P., Polozov, A., et al.: Execution-Guided neural program decoding. CoRR, abs/1807.03100 (2018)
Clark, K., Luong, M., Le, Q., et al.: ELECTRA: pre-training text encoders as discriminators rather than generators. In: 8th International Conference on Learning Representations (ICLR) (2020)
Wang, C., Tatwawadi, K., Brockschmidt, M.: Robust Text-to-SQL generation with execution-guided decoding. CoRR, abs/ 1807.03100 (2018)
Zhou, P., Shi, W., Tian, J., et al.: Attention-based bidirectional long short-term memory networks for relation classification. In: 2016 Annual Meeting of the Association for Computational Linguistics, pp. 207–212. ACL (2016). https://doi.org/10.18653/v1/p16-2034
Schulman, J., Heess, N., Weber, T., et al.: Gradient estimation using stochastic computation graphs. In: 29th Annual Conference on Neural Information Processing Systems, pp. 3528–3536. NIPS (2015). https://doi.org/10.5555/2969442.2969633
Dahl, D., Bates, M., Brown, M., et al.: Expanding the scope of the ATIS task: the ATIS-3 corpus. In: Proceedings of the workshop on Human Language Technology(HLT), pp. 43-48. ACM (1994). https://doi.org/10.3115/1075812.1075823
Shi, T., Tatwawadi, K., Chakrabarti, K., et al.: IncSQL: training incremental Text-to-SQL parsers with non-deterministic oracles. CoRR abs/1809.05054 (2018)
Hwang, W., Yim, J., Park, S., et al.: A comprehensive exploration on WikiSQL with table-aware word contextualization. CoRR abs/1902.01069 (2019)
Acknowledgments
This work is supported by the National Natural Science Foundation of China (61802116), the Science and Technology Plan of Henan Province (192102210113, 192102210248, 202102210372).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhu, M., Wang, X., Zhang, Y. (2020). Interpretable Text-to-SQL Generation with Joint Optimization. In: Wang, G., Lin, X., Hendler, J., Song, W., Xu, Z., Liu, G. (eds) Web Information Systems and Applications. WISA 2020. Lecture Notes in Computer Science(), vol 12432. Springer, Cham. https://doi.org/10.1007/978-3-030-60029-7_32
Download citation
DOI: https://doi.org/10.1007/978-3-030-60029-7_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60028-0
Online ISBN: 978-3-030-60029-7
eBook Packages: Computer ScienceComputer Science (R0)