skip to main content
article
Free access

Continuous speech recognition

Published: 01 February 1977 Publication History

Abstract

This group works towards automatic transcription of continuous speech with a vocabulary and syntax as unrestricted as possible. It is a long-term effort; however, an experimental system is operational. The acoustic processor contains a spectrum analyzer based on the Fast Fourier Transform and a phone segmenter/recognizer which makes use of transitional and steady-state information in its classification. The linguistic processor accepts an imperfect string of phones and produces an estimated transcription of the speech input.

References

[1]
L. R. Bahl, J. K. Baker, P. S. Cohen, N. R. Dixon, F. Jelinek, R. L. Mercer, H. F. Silverman, "Preliminary Results on the Performance of a System for the Automatic Recognition of Continuous Speech", Proc. 1976 International Conference on Acoustics, Speech and Signal Processing, April 1976, RC-5654, Phila., Penn.
[2]
L. R. Bahl and F. Jelinek, "Decoding for Channels with Insertions, Deletions and Substitutions, with Applications to speech Recognition", IEEE Trans. on Information Theory, Vol. IT-21, No. 4, pp. 404--411, July, 1975.
[3]
J. K. Baker, "Stochastic Modeling for Automatic Speech Understanding", in Speech Recognition, Academic Press, pp. 521--542, November 1975.
[4]
J. K. Baker, "The DRAGON System - An Overview", IEEE Trans. on Acoustics, Speech and Signal Processing, February 1975.
[5]
J. M. Baker, "Time Domain Analysis and Segmentation of Connected Speech", Proc. of the Speech Communications Seminar Stockholm, August 1974.
[6]
P. S. Cohen and R. L. Mercer, "The Phonological Component of an Automatic Speech-Recognition System", in Speech Recognition, Academic Press, November 1975, pp. 275--320.
[7]
N. R. Dixon and H. F. Silverman, "A Description of a Parametrically-Controlled Modular Structure for Speech Processing", IEEE Trans. on Acoustics, Speech and Signal Processing, Vol. ASSP-23, No. 1 RC-5085, February 1975.
[8]
N. R. Dixon and H. F. Silverman, "A General Language-Operated Decision Implementation System (GLODIS): Its Application to Continuous-Speech Segmentation", IEEE Trans. on Acoustics, Speech, and Signal Processing, Vol. ASSP-24, April 1976, pp. 137--162.
[9]
F. Jelinek, "Continuous Speech Recognition by Statistical Methods", IEEE Proceedings (Invited Paper), April 1976, Vol. 64, No. 4, pp. 532--556.
[10]
F. Jelinek, L. R. Bahl, and R. L. Mercer, "Design of a Linguistic Statistical Decoder for the Recognition of Continuous Speech", Proceedings of IEEE Symposium on Speech Recognition, Carnegie-Mellon University, pp. 255--259. IEEE Trans. on Information Theory, Vol IT-21, No. 3, pp. 250--256, May 1975.
[11]
H. F. Silverman and N. R. Dixon, "The 1976 Modular Acoustic Processor (MAP): Diadic Segment Classification and Final Phonemic String Estimation", IEEE Trans. on Acoustics, Speech, and Signal Processing, Vol. ASSP-24, No. 4, August 1976, pp. 289--295.
[12]
H. F. Silverman and N. R. Dixon, "An Objective Parallel Evaluator or Segmentation/Classification Performance for Multiple Systems", IEEE Transactions on Acoustics, Speech and Signal Processing, Vol. ASSP-23, No. 1, RC-5086, February 1975.
[13]
H. F. Silverman and N. R. Dixon, "A Parametrically Controlled Spectral Analysis System for Speech", IEEE Transactions on Acoustics, Speech and Signal Processing, Vol. ASSP-23, No. 5, pp. 362--381, RC-4702, October 1974.
[14]
C. C. Tappert, "Phonetic Matching Algorithms in Automatic Speech Recognition and Their Potential Impact on Phonetics and Phonology", Proc. of Eighth International Congress on Acoustics, London, 1974, p. 304, August 1974.

Cited By

View all
  • (2024)Interactive Multimedia Association-Adaptive Differential Pulse Code Modulation Codec With Gated Recurrent Unit PredictorIEEE Access10.1109/ACCESS.2024.349360412(165395-165406)Online publication date: 2024
  • (2017)Temporal probabilistic measure for link prediction in collaborative networksApplied Intelligence10.1007/s10489-016-0883-y47:1(83-95)Online publication date: 1-Jul-2017
  • (2015)Enhancement to community-based multi-relational link prediction using co-occurrence probability featureProceedings of the 2nd ACM IKDD Conference on Data Sciences10.1145/2732587.2732599(86-91)Online publication date: 18-Mar-2015

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGART Bulletin
ACM SIGART Bulletin Just Accepted
February 1977
92 pages
ISSN:0163-5719
DOI:10.1145/1045283
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 February 1977
Published in SIGAI , Issue 61

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)43
  • Downloads (Last 6 weeks)11
Reflects downloads up to 16 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Interactive Multimedia Association-Adaptive Differential Pulse Code Modulation Codec With Gated Recurrent Unit PredictorIEEE Access10.1109/ACCESS.2024.349360412(165395-165406)Online publication date: 2024
  • (2017)Temporal probabilistic measure for link prediction in collaborative networksApplied Intelligence10.1007/s10489-016-0883-y47:1(83-95)Online publication date: 1-Jul-2017
  • (2015)Enhancement to community-based multi-relational link prediction using co-occurrence probability featureProceedings of the 2nd ACM IKDD Conference on Data Sciences10.1145/2732587.2732599(86-91)Online publication date: 18-Mar-2015

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media