Journals & Magazines >IEEE/ACM Transactions on Audi... >Volume: 25 Issue: 5

Features for Masking-Based Monaural Speech Separation in Reverberant Conditions

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Monaural speech separation is a fundamental problem in speech and signal processing. This problem can be approached from a supervised learning perspective by predicting a...Show More

Metadata

Abstract:

Monaural speech separation is a fundamental problem in speech and signal processing. This problem can be approached from a supervised learning perspective by predicting an ideal time-frequency mask from features of noisy speech. In reverberant conditions at low signal-to-noise ratios (SNRs), accurate mask prediction is challenging and can benefit from effective features. In this paper, we investigate an extensive set of acoustic-phonetic features extracted in adverse conditions. Deep neural networks are used as the learning machine, and separation performance is evaluated using standard objective speech intelligibility metrics. Separation performance is systematically evaluated in both nonspeech and speech interference, in a variety of SNRs, reverberation times, and direct-to-reverberant energy ratios. Considerable performance improvement is observed by using contextual information, likely due to temporal effects of room reverberation. In addition, we construct feature combination sets using a sequential floating forward selection algorithm, and combined features outperform individual ones. We also find that optimal feature sets in anechoic conditions are different from those in reverberant conditions.

Published in: IEEE/ACM Transactions on Audio, Speech, and Language Processing ( Volume: 25, Issue: 5, May 2017)

Page(s): 1085 - 1094

Date of Publication: 27 March 2017

ISSN Information:

DOI: 10.1109/TASLP.2017.2687829

Funding Agency:

Contents

References is not available for this document.

Features for Masking-Based Monaural Speech Separation in Reverberant Conditions

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Features for Masking-Based Monaural Speech Separation in Reverberant Conditions

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?