Skip to main content

Finite State Grammar Transduction from Distributed Collected Knowledge

  • Conference paper
Computational Linguistics and Intelligent Text Processing (CICLing 2006)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3878))

  • 1372 Accesses

Abstract

In this paper, we discuss the use of Open Mind Indoor Common Sense (OMICS) project for the purpose of speech recognition of user requests. As part of OMICS data collection, we asked users to enter different ways of asking a robot to perform specific tasks. This paraphrasing data is processed using Natural Language techniques and lexical resources like WordNet to generate a Finite State Grammar Transducer (FSGT). This transducer captures the variations in user requests and captures their structure.

We compare the task recognition performance of this FSGT model with an n-gram Statistical Language Model (SLM). The SLM model is trained with the same data that was used to generate the FSGT. The FSGT model and SLM are combined in a two-pass system to optimize full and partial recognition for both in-grammar and out-of-grammar user requests. Our work validates the use of a web based knowledge capture system to harvest phrases to build grammar models. Work was performed using Nuance Speech Recognition system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Mohri, M., Pereira, F., Riley, M.: Weighted finite state transducers in speech recognition. Computer Speech & Language 16, 69–88 (2002)

    Article  Google Scholar 

  2. Stork, D.G.: The Open Mind Initiative. IEEE Expert Systems and Their Applications 14, 19–20 (1999)

    Google Scholar 

  3. Stork, D.G.: Open data collection for training intelligent software in the open mind initiative. In: Proceedings of the Engineering Intelligent Systems (EIS 2000), Paisley, Scotland (2000)

    Google Scholar 

  4. Gupta, R., Kochenderfer, M.: Common sense data acquisition for indoor mobile robots. In: Nineteenth National Conference on Artificial Intelligence, AAAI 2004 (2004)

    Google Scholar 

  5. Punyakanok, V., Roth, D., Yih, W.: Natural language inference via dependency tree mapping: An application to question answering. Computational Linguistics (2005) (submitted for review)

    Google Scholar 

  6. Steedman, M.: Wide-coverage semantic representations from a ccg parser. In: Proceedings of the 20th International Conference on Computational Linquistics (2004)

    Google Scholar 

  7. Bangalore, S., Joshi, A.K.: Supertagging: An approach to alsmost parsing. Computational Linguistics 25, 237–265 (1999)

    Google Scholar 

  8. Solan, Z., Horn, D., Ruppin, E., Edelman, S.: Unsupervised context sensitive language acquisition from a large corpus. In: NIPS (2003)

    Google Scholar 

  9. Klein, D., Manning, C.D.: Natural language grammar induction with a generative constituent-context model. Pattern Recognition 38, 1407–1419 (2005)

    Article  MATH  Google Scholar 

  10. Klein, D., Manning, C.D.: Distributional phrase structure induction. In: Proceedings of the Fifth Conference on Natural Language Learning (CoNLL), pp. 113–120 (2001)

    Google Scholar 

  11. Sinha, A.K., Landay, J.A.: Towards automatic speech input grammar generation for natural language interfaces. In: CHI 2000 Workshop on Natural Language Interfaces, The Hague, The Netherlands (2000)

    Google Scholar 

  12. Martin, P.: The casual cashmere diaper bag: Constraining speech recognition using examples. In: Proceedings of the Association of Computational Linguistics (1997)

    Google Scholar 

  13. Riccardi, G., Bangalore, S.: Automatic acquisition of phrase grammars for stochastic language modeling. In: 6th Workshop on Very Large Corpora, pp. 186–198 (1998)

    Google Scholar 

  14. Dowding, J., Gawron, J.M., Appelt, D.E., Bear, J., Cherny, L., Moore, R., Moran, D.B.: GEMINI: A natural language system for spoken-language understanding. In: Meeting of the Association for Computational Linguistics, pp. 54–61 (1993)

    Google Scholar 

  15. Acero, A., Wang, Y., Wang, K.: A semantically structured language model. In: Special Workshop in Maui, SWIM (2004)

    Google Scholar 

  16. Wang, Y., Mahajan, M., Huang, X.: A unified context-free grammar and n-gram model for spoken language processing. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Istanbul, Turkey (2000)

    Google Scholar 

  17. Hockey, B.A., Lemon, O., Campana, E., Hiatt, L., Aist, G., Hieronymus, J., Gruenstein, A., Dowding, J.: Targeted help for spoken dialogue systems: intelligent feedback improves naive users’ performance. In: Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics, vol. 1, pp. 147–154 (2003)

    Google Scholar 

  18. Klein, D., Manning, C.D.: Parsing with treebank grammars: Empirical bounds, theoretical models, and the structure of the penn treebank. In: Proceedings of the 39th Annual Meeting of the ACL (2001)

    Google Scholar 

  19. Liu, H.: Montylingua: An end-to-end natural language processor with common sense. Available at: http://web.media.mit.edu/hugo/montylingua (2004)

  20. Brill, E.: A simple rule-based part-of-speech tagger. In: Proceedings of ANLP 1992, 3rd Conference on Applied Natural Language Processing, Trento, IT, pp. 152–155 (1992)

    Google Scholar 

  21. Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by latent semantic analysis. Journal of the American Society of Information Science 41, 391–407 (1990)

    Article  Google Scholar 

  22. Nuance: Nuance Speech Recognition System 8.5: Grammar Developer’s Guide, Nuance Coproration, USA (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gupta, R., Hennacy, K. (2006). Finite State Grammar Transduction from Distributed Collected Knowledge. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2006. Lecture Notes in Computer Science, vol 3878. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11671299_36

Download citation

  • DOI: https://doi.org/10.1007/11671299_36

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-32205-4

  • Online ISBN: 978-3-540-32206-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics