Journals & Magazines >IEEE/ACM Transactions on Audi... >Volume: 32

Dual-Branch Modeling Based on State-Space Model for Speech Enhancement

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Traditional time-frequency domain speech enhancement methods either only enhance the amplitude spectral features without changing the phase that contributes to the natura...Show More

Metadata

Abstract:

Traditional time-frequency domain speech enhancement methods either only enhance the amplitude spectral features without changing the phase that contributes to the naturalness, intelligibility and harmonic structure, or improve the estimation of the complex spectral features including the real and imaginary components, which limits the accuracy of amplitude and phase estimation. To address this issue, we propose a joint dual-branch structured state-space model that leverages the strengths of both branches while keeping computational complexity low. Specifically, we introduce interaction modules between the two branches to facilitate information exchange, enabling features learned from one branch to compensate for missing parts in the other. Furthermore, to reduce model complexity, we introduce the diagonal version of structured state-space sequence (S4D) model for speech feature sequence denoising in both branches. Experimental results show that our low-complexity model achieves significant improvements over previous advanced systems on VoiceBank+DEMAND and TIMIT+NOISE92 datasets.

Published in: IEEE/ACM Transactions on Audio, Speech, and Language Processing ( Volume: 32)

Page(s): 1457 - 1467

Date of Publication: 06 February 2024

ISSN Information:

DOI: 10.1109/TASLP.2024.3362691

Funding Agency:

Contents

References is not available for this document.

Dual-Branch Modeling Based on State-Space Model for Speech Enhancement

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Dual-Branch Modeling Based on State-Space Model for Speech Enhancement

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?