Skip to main content
Log in

Designing Language Models for Voice Portal Applications

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

At HeyAnita we use statistical language models to improve speech recognition performance in a number of our portal applications, including driving directions, traffic, weather, stocks, sports, movies and restaurants. In this paper, language modeling implementations in different recognition environments and some real world data are reviewed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Jelenek, F. (1999). Statistical Methods for Speech Recognition. Cambridge: MIT Press.

    Google Scholar 

  • Kucera, H. and Francis, N. (1967). Computational Analysis of Present-Day American English. Providence: Brown University Press.

    Google Scholar 

  • Mandelbrot, B. (1953). An informational theory of the statistical structure of languages. In W. Jackson (Ed.), Communication Theory, Betterworth, pp. 486-502.

  • Manning, C.D. and Schutze, H. (2000). Foundations of Statistical Natural Language Processing. Cambridge: MIT Press.

    Google Scholar 

  • Microsoft (2001). SAPI 5.1 Documentation, Grammar Format Tags. Redmond: Microsoft Corporation.

    Google Scholar 

  • Nuance (1998). NGB-Nuance Grammar Builder Help, Product Version 1.0, File Version 132. Menlo Park: Nuance Communications.

    Google Scholar 

  • Pierce, J. (1961). An Introduction to Information Theory: Symbols, Signals and Noise. New York: Dover Publications Inc.

    Google Scholar 

  • Rosenfield, R. (2000). Two Decades of Statistical Language Modeling: Where do we go from here? http://www.ima.umn.edu/talks/workshops/10-30-11-3.2000/rosenfeld/rosenfeld.pdf.

  • Shannon, C. (1948). A mathematical theory of communication. Bell System Technical Journal, 27:379–423, 623-656, July and October.

    Google Scholar 

  • set_within_class_probs Usage Notes. Cambridge: SpeechWorks International.

    Google Scholar 

  • SpeechWorks (2002). SpeechWorks Developer's Guide, Open-Speech Recognizer 1.0. Cambridge: SpeechWorks International.

    Google Scholar 

  • Sun Microsystems (2002). Java Speech Grammar Format Specification, Version 1.0, Oct. 1998. Santa Clara: Sun Microsystems. (http://java.sun.com/products/java-media/speech/forDevelopers/JSGF/index.html).

    Google Scholar 

  • W3C (2001). Speech Recognition Grammar Specification for the W3C Speech Interface Framework, W3C Working Draft 20 August 2001, A. Hunt and S. McGlashan (Eds.). (http://www.w3.org/TR/2001/WD-speech-grammar-20010820).

  • Zipf, G. (1932). Selective Studies and the Principle of Relative Frequency in Language. Cambridge: MIT Press.

    Google Scholar 

  • Zipf, G. (1935). Psycho-Biology of Languages. New York: Houghton-Mifflin.

    Google Scholar 

  • Zipf, G. (1949). Human Behavior and the Principle of Least Effort. New York: Addison-Wesley.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shinn, P., Shomphe, M., Lewis, M. et al. Designing Language Models for Voice Portal Applications. International Journal of Speech Technology 7, 93–99 (2004). https://doi.org/10.1023/B:IJST.0000004813.54016.b4

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/B:IJST.0000004813.54016.b4

Navigation