Journals & Magazines >IEEE Signal Processing Letters >Volume: 31

CAT-DUnet: Enhancing Speech Dereverberation via Feature Fusion and Structural Similarity Loss

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Reverberation significantly degrades speech intelligibility, posing a substantial challenge in speech processing. While deep learning advancements offer promising solutio...Show More

Metadata

Abstract:

Reverberation significantly degrades speech intelligibility, posing a substantial challenge in speech processing. While deep learning advancements offer promising solutions, current methodologies often overlook the effective integration of low-level and high-level feature representations, causing detrimental effects on overall performance. Simultaneously, prior approaches heavily rely on loss functions grounded in quantitative error metrics, which may not fully capture the perceptual intricacies of speech signals. To address these concerns, we introduce CAT-DUnet, a Unet architecture that integrates channel attention, time-frequency attention, and dilated convolution blocks to enhance feature fusion. We innovatively leverage the structural similarity as the training objective to align more closely with human perception, and investigate the effect of applying various reasonable transformations to spectrograms on the performance of the loss function. Through extensive ablation experiments, we demonstrate the effectiveness of our proposed enhancements. Our model outperforms state-of-the-art models on 6 out of 7 metrics, underscoring its exceptional performance.

Published in: IEEE Signal Processing Letters ( Volume: 31)

Page(s): 456 - 460

Date of Publication: 19 January 2024

ISSN Information:

DOI: 10.1109/LSP.2024.3356420

Funding Agency:

Contents

References is not available for this document.

CAT-DUnet: Enhancing Speech Dereverberation via Feature Fusion and Structural Similarity Loss

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

CAT-DUnet: Enhancing Speech Dereverberation via Feature Fusion and Structural Similarity Loss

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?