Conferences >2003 IEEE International Confe...

Using speech/non-speech detection to bias recognition search on noisy data

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper focuses on the recognition of noisy speech. We show that the decoding of a noisy speech waveform can be facilitated if the recognizer has explicit knowledge of...Show More

Metadata

Abstract:

This paper focuses on the recognition of noisy speech. We show that the decoding of a noisy speech waveform can be facilitated if the recognizer has explicit knowledge of where it should hypothesize speech phones, and where it should map the acoustics to non-speech phones. We build a speech/non-speech detector and use its output as an additional front-end feature. We show that by appropriately weighting the contribution of this feature in the decoder and by modifying the acoustic models accordingly, we can penalize speech/non-speech confusions and consequently reduce the recognition error rate. This approach gives a 12% overall error rate reduction on a wide variety of recognition tasks and noise characteristics without degrading performance on clean test data. A simple extension of the approach boosts recognition improvements on noisy test sets to 14% overall.

Published in: 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).

Date of Conference: 06-10 April 2003

Date Added to IEEE Xplore: 21 May 2003

Print ISBN:0-7803-7663-3

Print ISSN: 1520-6149

DOI: 10.1109/ICASSP.2003.1198808

Conference Location: Hong Kong, China

Contents

References is not available for this document.

Using speech/non-speech detection to bias recognition search on noisy data

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Using speech/non-speech detection to bias recognition search on noisy data

Alerts

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?