Abstract
At HeyAnita we use statistical language models to improve speech recognition performance in a number of our portal applications, including driving directions, traffic, weather, stocks, sports, movies and restaurants. In this paper, language modeling implementations in different recognition environments and some real world data are reviewed.
Similar content being viewed by others
References
Jelenek, F. (1999). Statistical Methods for Speech Recognition. Cambridge: MIT Press.
Kucera, H. and Francis, N. (1967). Computational Analysis of Present-Day American English. Providence: Brown University Press.
Mandelbrot, B. (1953). An informational theory of the statistical structure of languages. In W. Jackson (Ed.), Communication Theory, Betterworth, pp. 486-502.
Manning, C.D. and Schutze, H. (2000). Foundations of Statistical Natural Language Processing. Cambridge: MIT Press.
Microsoft (2001). SAPI 5.1 Documentation, Grammar Format Tags. Redmond: Microsoft Corporation.
Nuance (1998). NGB-Nuance Grammar Builder Help, Product Version 1.0, File Version 132. Menlo Park: Nuance Communications.
Pierce, J. (1961). An Introduction to Information Theory: Symbols, Signals and Noise. New York: Dover Publications Inc.
Rosenfield, R. (2000). Two Decades of Statistical Language Modeling: Where do we go from here? http://www.ima.umn.edu/talks/workshops/10-30-11-3.2000/rosenfeld/rosenfeld.pdf.
Shannon, C. (1948). A mathematical theory of communication. Bell System Technical Journal, 27:379–423, 623-656, July and October.
set_within_class_probs Usage Notes. Cambridge: SpeechWorks International.
SpeechWorks (2002). SpeechWorks Developer's Guide, Open-Speech Recognizer 1.0. Cambridge: SpeechWorks International.
Sun Microsystems (2002). Java Speech Grammar Format Specification, Version 1.0, Oct. 1998. Santa Clara: Sun Microsystems. (http://java.sun.com/products/java-media/speech/forDevelopers/JSGF/index.html).
W3C (2001). Speech Recognition Grammar Specification for the W3C Speech Interface Framework, W3C Working Draft 20 August 2001, A. Hunt and S. McGlashan (Eds.). (http://www.w3.org/TR/2001/WD-speech-grammar-20010820).
Zipf, G. (1932). Selective Studies and the Principle of Relative Frequency in Language. Cambridge: MIT Press.
Zipf, G. (1935). Psycho-Biology of Languages. New York: Houghton-Mifflin.
Zipf, G. (1949). Human Behavior and the Principle of Least Effort. New York: Addison-Wesley.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Shinn, P., Shomphe, M., Lewis, M. et al. Designing Language Models for Voice Portal Applications. International Journal of Speech Technology 7, 93–99 (2004). https://doi.org/10.1023/B:IJST.0000004813.54016.b4
Issue Date:
DOI: https://doi.org/10.1023/B:IJST.0000004813.54016.b4