Skip to main content

Interpretable Text-to-SQL Generation with Joint Optimization

  • Conference paper
  • First Online:
Web Information Systems and Applications (WISA 2020)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12432))

Included in the following conference series:

  • 1738 Accesses

Abstract

The purpose of Text-to-SQL is to obtain the correct answer for a textual question from the database, which can take advantage of advanced database system to provide reliable and efficient response. Existing Text-to-SQL methods generally focus on accuracy by designing complex deep neural network models, and hardly consider interpretability, which is very important for serious applications. To address this, in this paper we propose a novel framework for Interpretable Text-to-SQL Generation (ITSG) with joint optimization, which achieves state-of-the-art accuracy and possesses two-level interpretability at the same time. The framework mainly consists of three layers: a sequence encoder which encodes questions, table headers and significant table contents, an attention-based LSTM layer which generates SQL queries and a reinforcement learning layer which boosts the execution accuracy. Comparing with state-of-the-art methods on benchmark datasets, the experimental results show the effectiveness and interpretability of our ITSG framework.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Min, S., Chen, D., Hajishirzi, H., Zettlemoyer, L.: A discrete hard EM approach for weakly supervised question answering. In: 2019 Conference on Empirical Methods in Natural Language Processing & International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 2851–2864. ACL (2019). https://doi.org/10.18653/v1/d19-1284

  2. Han, Z., Jiang, X., Li, M., et al.: An integrated semantic-syntactic SBLSTM model for aspect specific opinion extraction. In: 15th International Conference of Web Information Systems and Applications, pp. 191–199. WISA (2018). https://doi.org/10.1007/978-3-030-02934-0_18

  3. He, P., Mao, Y., Chakrabarti, K., Chen, W.: X-SQL: reinforce schema representation with context. CoRR abs/1908.08113 (2019)

    Google Scholar 

  4. Dong, L., Lapata, M.: Coarse-to-fine decoding for neural semantic parsing. In: 56th Annual Meeting of the Association for Computational Linguistics, pp. 731–742. ACL (2018). https://doi.org/10.18653/v1/p18-1068

  5. Berant, J., Chou, A., Frostig, R., et al.: Semantic parsing on freebase from Question-Answer pairs. In: 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1533–1544. ACL (2013). Doi:10.1.1.408.319

    Google Scholar 

  6. Krishnamurthy, J., Dasigi, P., Gardner, M.: Neural semantic parsing with type constraints for semi-structured tables. In: 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1516–1526. ACL (2017). https://doi.org/10.18653/v1/d17-1160

  7. Iyyer, M., Yih, W., Chang, M.: Search-based neural structured learning for sequential question answering. In: 2017 Annual Meeting of the Association for Computational Linguistics, pp. 1821–1831. ACL (2017). https://doi.org/10.18653/v1/p17-1167

  8. Liang, C., Norouzi, M., Berant, J., et al.: Memory augmented policy optimization for program synthesis and semantic parsing. In: 32rd Conference on Neural Information Processing Systems (NeurIPS), pp. 10015–10027. (NIPS) (2018)

    Google Scholar 

  9. Zhong, V., Xiong, C., Socher, R.: Seq2sql: generating structured queries from natural language using reinforcement learning. CoRR, abs/1709.00103 (2017)

    Google Scholar 

  10. Wang, W., Tian, Y., Xiong, H., et al.: A transfer-learnable natural language interface for databases. CoRR, abs/1809.02649 (2018)

    Google Scholar 

  11. Wang, C., Huang, P., Polozov, A., et al.: Execution-Guided neural program decoding. CoRR, abs/1807.03100 (2018)

    Google Scholar 

  12. Clark, K., Luong, M., Le, Q., et al.: ELECTRA: pre-training text encoders as discriminators rather than generators. In: 8th International Conference on Learning Representations (ICLR) (2020)

    Google Scholar 

  13. Wang, C., Tatwawadi, K., Brockschmidt, M.: Robust Text-to-SQL generation with execution-guided decoding. CoRR, abs/ 1807.03100 (2018)

    Google Scholar 

  14. Zhou, P., Shi, W., Tian, J., et al.: Attention-based bidirectional long short-term memory networks for relation classification. In: 2016 Annual Meeting of the Association for Computational Linguistics, pp. 207–212. ACL (2016). https://doi.org/10.18653/v1/p16-2034

  15. Schulman, J., Heess, N., Weber, T., et al.: Gradient estimation using stochastic computation graphs. In: 29th Annual Conference on Neural Information Processing Systems, pp. 3528–3536. NIPS (2015). https://doi.org/10.5555/2969442.2969633

  16. Dahl, D., Bates, M., Brown, M., et al.: Expanding the scope of the ATIS task: the ATIS-3 corpus. In: Proceedings of the workshop on Human Language Technology(HLT), pp. 43-48. ACM (1994). https://doi.org/10.3115/1075812.1075823

  17. Shi, T., Tatwawadi, K., Chakrabarti, K., et al.: IncSQL: training incremental Text-to-SQL parsers with non-deterministic oracles. CoRR abs/1809.05054 (2018)

    Google Scholar 

  18. Hwang, W., Yim, J., Park, S., et al.: A comprehensive exploration on WikiSQL with table-aware word contextualization. CoRR abs/1902.01069 (2019)

    Google Scholar 

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China (61802116), the Science and Technology Plan of Henan Province (192102210113, 192102210248, 202102210372).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mingdong Zhu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhu, M., Wang, X., Zhang, Y. (2020). Interpretable Text-to-SQL Generation with Joint Optimization. In: Wang, G., Lin, X., Hendler, J., Song, W., Xu, Z., Liu, G. (eds) Web Information Systems and Applications. WISA 2020. Lecture Notes in Computer Science(), vol 12432. Springer, Cham. https://doi.org/10.1007/978-3-030-60029-7_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-60029-7_32

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-60028-0

  • Online ISBN: 978-3-030-60029-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics