Designing Language Models for Voice Portal Applications

Shinn, Phil; Shomphe, Matthew; Lewis, Molly; Carey, Kathy; Kim, David

doi:10.1023/B:IJST.0000004813.54016.b4

Designing Language Models for Voice Portal Applications

Published: January 2004

Volume 7, pages 93–99, (2004)
Cite this article

International Journal of Speech Technology Aims and scope Submit manuscript

Phil Shinn¹,
Matthew Shomphe¹,
Molly Lewis¹,
Kathy Carey¹ &
…
David Kim¹

27 Accesses
Explore all metrics

Abstract

At HeyAnita we use statistical language models to improve speech recognition performance in a number of our portal applications, including driving directions, traffic, weather, stocks, sports, movies and restaurants. In this paper, language modeling implementations in different recognition environments and some real world data are reviewed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A survey on large language model based autonomous agents

Article Open access 22 March 2024

Autoencoders and their applications in machine learning: a survey

Article Open access 03 February 2024

Automatic speech recognition: a survey

Article 10 November 2020

References

Jelenek, F. (1999). Statistical Methods for Speech Recognition. Cambridge: MIT Press.
Google Scholar
Kucera, H. and Francis, N. (1967). Computational Analysis of Present-Day American English. Providence: Brown University Press.
Google Scholar
Mandelbrot, B. (1953). An informational theory of the statistical structure of languages. In W. Jackson (Ed.), Communication Theory, Betterworth, pp. 486-502.
Manning, C.D. and Schutze, H. (2000). Foundations of Statistical Natural Language Processing. Cambridge: MIT Press.
Google Scholar
Microsoft (2001). SAPI 5.1 Documentation, Grammar Format Tags. Redmond: Microsoft Corporation.
Google Scholar
Nuance (1998). NGB-Nuance Grammar Builder Help, Product Version 1.0, File Version 132. Menlo Park: Nuance Communications.
Google Scholar
Pierce, J. (1961). An Introduction to Information Theory: Symbols, Signals and Noise. New York: Dover Publications Inc.
Google Scholar
Rosenfield, R. (2000). Two Decades of Statistical Language Modeling: Where do we go from here? http://www.ima.umn.edu/talks/workshops/10-30-11-3.2000/rosenfeld/rosenfeld.pdf.
Shannon, C. (1948). A mathematical theory of communication. Bell System Technical Journal, 27:379–423, 623-656, July and October.
Google Scholar
set_within_class_probs Usage Notes. Cambridge: SpeechWorks International.
Google Scholar
SpeechWorks (2002). SpeechWorks Developer's Guide, Open-Speech Recognizer 1.0. Cambridge: SpeechWorks International.
Google Scholar
Sun Microsystems (2002). Java Speech Grammar Format Specification, Version 1.0, Oct. 1998. Santa Clara: Sun Microsystems. (http://java.sun.com/products/java-media/speech/forDevelopers/JSGF/index.html).
Google Scholar
W3C (2001). Speech Recognition Grammar Specification for the W3C Speech Interface Framework, W3C Working Draft 20 August 2001, A. Hunt and S. McGlashan (Eds.). (http://www.w3.org/TR/2001/WD-speech-grammar-20010820).
Zipf, G. (1932). Selective Studies and the Principle of Relative Frequency in Language. Cambridge: MIT Press.
Google Scholar
Zipf, G. (1935). Psycho-Biology of Languages. New York: Houghton-Mifflin.
Google Scholar
Zipf, G. (1949). Human Behavior and the Principle of Least Effort. New York: Addison-Wesley.
Google Scholar

Download references

Author information

Authors and Affiliations

HeyAnita Inc., 303 N. Glenoaks Blvd., 5th Floor, Burbank, CA, 91502, USA
Phil Shinn, Matthew Shomphe, Molly Lewis, Kathy Carey & David Kim

Authors

Phil Shinn
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Shomphe
View author publications
You can also search for this author in PubMed Google Scholar
Molly Lewis
View author publications
You can also search for this author in PubMed Google Scholar
Kathy Carey
View author publications
You can also search for this author in PubMed Google Scholar
David Kim
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shinn, P., Shomphe, M., Lewis, M. et al. Designing Language Models for Voice Portal Applications. International Journal of Speech Technology 7, 93–99 (2004). https://doi.org/10.1023/B:IJST.0000004813.54016.b4

Download citation

Issue Date: January 2004
DOI: https://doi.org/10.1023/B:IJST.0000004813.54016.b4

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Designing Language Models for Voice Portal Applications

Abstract

Access this article

Similar content being viewed by others

A survey on large language model based autonomous agents

Autoencoders and their applications in machine learning: a survey

Automatic speech recognition: a survey

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Designing Language Models for Voice Portal Applications

Abstract

Access this article

Similar content being viewed by others

A survey on large language model based autonomous agents

Autoencoders and their applications in machine learning: a survey

Automatic speech recognition: a survey

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation