Paper
22 October 1993 Audio-visual speech recognition for a vowel discrimination task
Peter L. Silsbee, Alan Conrad Bovik
Author Affiliations +
Proceedings Volume 2094, Visual Communications and Image Processing '93; (1993) https://doi.org/10.1117/12.157855
Event: Visual Communications and Image Processing '93, 1993, Cambridge, MA, United States
Abstract
Among the various methods which have been proposed to improve the robustness and accuracy of automatic speech recognition (ASR) systems, lipreading has received very little attention. In this paper, we provide motivation for the use of lipreading. A novel speaker dependent lipreading system is developed, which uses hidden Markov modeling, a well known and highly successful technique for audio-based ASR. It is used in conjunction with an audio ASR system in order to improve the accuracy of the latter, especially under degraded acoustical conditions. Reductions in error of 30 to over 60% result.
© (1993) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Peter L. Silsbee and Alan Conrad Bovik "Audio-visual speech recognition for a vowel discrimination task", Proc. SPIE 2094, Visual Communications and Image Processing '93, (22 October 1993); https://doi.org/10.1117/12.157855
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visualization

Information visualization

Laser induced plasma spectroscopy

Mouth

Speech recognition

Acoustics

Signal processing

Back to Top