Skip to main content

Stochastic Language Generation Using Situated PCFGs

  • Conference paper
  • First Online:
Natural Language Processing and Chinese Computing (NLPCC 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9362))

  • 2291 Accesses

Abstract

This paper presents a purely data-driven approach for generating natural language (NL) expressions from its corresponding semantic representations. Our aim is to exploit a parsing paradigm for natural language generation (NLG) task, which first encodes semantic representations with a situated probabilistic context-free grammar (PCFG), then decodes and yields natural sentences at the leaves of the optimal parsing tree. We deployed our system in two different domains, one is response generation for a Chinese spoken dialogue system, and the other is instruction generation for a virtual environment in English language, obtaining results comparable to state-of-the-art systems both in terms of BLEU scores and human evaluation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Langkilde, I., Knight, K.: Generation that exploits corpus based statistical knowledge. In: Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics (ACL), pp. 704–710 (1998)

    Google Scholar 

  2. Reiter, E., Dale, R.: Building natural language generation systems. Cambridge University Press, New York (2000)

    Book  Google Scholar 

  3. Walker, M.A., Rambow, O., Rogati, M.: Training a sentence planner for spoken dialogue using boosting. Computer Speech and Language 16(3–4), 409–433 (2002)

    Article  Google Scholar 

  4. Angeli, G., Liang, P., Klein, D.: A simple domain-independent probabilistic approach to generation. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, Cambridge, MA, pp. 502–512 (2010)

    Google Scholar 

  5. Kim, J., Mooney, R.: Generative alignment and semantic parsing for learning from ambiguous supervision. In: Proceedings of the 23rd Conference on Computational Linguistics, Beijing, China, pp. 543–551 (2010)

    Google Scholar 

  6. Konstas, I., Lapata, M.: Concept-to-text generation via discriminative reranking. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Jeju, South Korea, pp. 369–378 (2012)

    Google Scholar 

  7. Konstas, I., Lapata, M.: A Global Model for Concept-to-Text Generation. Journal of Artificial Intelligence Research 48(2013), 305–346 (2013)

    Article  Google Scholar 

  8. Ratnaparkhi, A.: Trainable Approaches to Surface Natural Language Generation and Their Application to Conversational Dialog Systems. Computer Speech and Language 16(3–4), 435–455 (2002)

    Article  Google Scholar 

  9. Rieser, E., Lemon, O.: Natural language generation as planning under uncertainty for spoken dialogue systems. In: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, pp. 683–691 (2009)

    Google Scholar 

  10. Huang, L., Chiang, D.: Better k-best parsing. In: Proceedings of the 9th International Workshop on Parsing Technology, Vancouver, British Columbia, pp. 53–64 (2005)

    Google Scholar 

  11. Liang, P., Jordan, M., Klein, D.: Learning semantic correspondences with less supervision. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, Suntec, Singapore, pp. 91–99 (2009)

    Google Scholar 

  12. Lu, W., Ng, H.T.: A probabilistic forest-to-string model for language generation from typed lambda calculus expressions. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, pp. 1611–1622 (2011)

    Google Scholar 

  13. Wong, Y.W., Mooney, R.: Generation by inverting a semantic parser that uses statistical machine translation. In: Proceedings of the Human Language Technology and the Conference of the North American Chapter of the Association for Computational Linguistics, Rochester, NY, pp. 172–179 (2007)

    Google Scholar 

  14. McKinley, N., Ray, S.: A decision-theoretic approach to natural language generation. In: Proceedings of the 52nd Annual Meeting of the Association for Computa-tional Linguistics, Baltimore, Maryland, USA, pp. 552–561 (2014)

    Google Scholar 

  15. Dethlefs, N., Cuayahuitl, H.: Hierarchical reinforcement learning for situated natural language generation. Natural Language Engineering 21(03), 391–435 (2014)

    Article  Google Scholar 

  16. Belz, A.: Automatic Generation of Weather Forecast Texts Using Comprehensive Probabilistic Generation-Space Models. Natural Language Engineering 14(4), 431–455 (2008)

    Article  Google Scholar 

  17. Belz, A., Kow, E.: System building cost vs. output quality in data-to-text generation. In: Proceedings of the 12th European Workshop on Natural Language Generation, Athens, Greece, pp. 16–24 (2009)

    Google Scholar 

  18. Gargett, A., Garoufi, K., Koller, A., Striegnitz K.: The GIVE-2 corpus of giving instructions in virtual environments. In: Proceedings of the 7th Conference on International Language Resources and Evaluation (LREC), Valletta, Malta (2010)

    Google Scholar 

  19. Striegnitz, K., Denis, A., Gargett, A., Garoufi, K., Koller, A., Theune, M.: Report on the second challenge on generating instructions in virtual environments (GIVE-2.5). In: Proceedings of the 13th European Workshop on Natural Language Generation (ENLG), Nancy, France, pp. 270–279 (2011)

    Google Scholar 

  20. Chen, Q., Manning, C.D.: A fast and accurate dependency parser using neural networks. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Lan-guage Processing (EMNLP), Doha, Qatar, pp. 740–750 (2014)

    Google Scholar 

  21. Levy, R., Manning, C.D.: Is it harder to parse Chinese, or the Chinese Tree-bank? In: Proceedings of the ACL 2003, Sapporo, Japan, pp. 439–44 (2003)

    Google Scholar 

  22. Kasami, T.: An efficient recognition and syntax analysis algorithm for context-free languages. Tech. rep. AFCRL-65-758, Air Force Cambridge Research Lab, Bedford, Mas-sachusetts (1965)

    Google Scholar 

  23. Younger, D.H.: Recognition and parsing for context-free languages in time n3. Information and Control 10(2), 189–208 (1967)

    Article  MATH  Google Scholar 

  24. Papineni K., Roukos S., Ward, T., Zhu, W.: BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadelphia, PA, pp. 311–318 (2002)

    Google Scholar 

  25. Benotti, L., Denis, A.: Giving instructions in virtual environments by corpus-based selection. In: Proceedings of the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Portland, Oregon, pp. 68–77 (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Caixia Yuan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Yuan, C., Wang, X., Zhong, Z. (2015). Stochastic Language Generation Using Situated PCFGs. In: Li, J., Ji, H., Zhao, D., Feng, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2015. Lecture Notes in Computer Science(), vol 9362. Springer, Cham. https://doi.org/10.1007/978-3-319-25207-0_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25207-0_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25206-3

  • Online ISBN: 978-3-319-25207-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics