Conferences >2017 IEEE International Confe...

Speech recognition in unseen and noisy channel conditions

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Speech recognition in varying background conditions is a challenging problem. Acoustic condition mismatch between training and evaluation data can significantly reduce re...Show More

Metadata

Abstract:

Speech recognition in varying background conditions is a challenging problem. Acoustic condition mismatch between training and evaluation data can significantly reduce recognition performance. For mismatched conditions, data-adaptation techniques are typically found to be useful, as they expose the acoustic model to the new data condition(s). Supervised adaptation techniques usually provide substantial performance improvement, but such gain is contingent on having labeled or transcribed data, which is often unavailable. The alternative is unsupervised adaptation, where feature-transform methods and model-adaptation techniques are typically explored. This work investigates robust features, feature-space maximum likelihood linear regression (fMLLR) transform, and deep convolutional nets to address the problem of unseen channel and noise conditions. In addition, the work investigates bottleneck (BN) features extracted from deep autoencoder (DAE) networks trained by using acoustic features extracted from the speech signal. We demonstrate that such representations not only produce robust systems but also that they can be used to perform data selection for unsupervised model adaptation. Our results indicate that the techniques presented in this paper significantly improve performance of speech recognition systems in unseen channel and noise conditions.

Published in: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Date of Conference: 05-09 March 2017

Date Added to IEEE Xplore: 19 June 2017

ISBN Information:

Electronic ISSN: 2379-190X

DOI: 10.1109/ICASSP.2017.7953151

Conference Location: New Orleans, LA, USA

Contents

References is not available for this document.

Speech recognition in unseen and noisy channel conditions

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Speech recognition in unseen and noisy channel conditions

Alerts

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?