skip to main content
10.1145/3488933.3488984acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaiprConference Proceedingsconference-collections
research-article

Single-Channel Speech Enhancement Using Amplitude Estimation and Phase Reconstruction

Published: 25 February 2022 Publication History

Abstract

In traditional speech enhancement algorithms, the amplitude of noise spectrum is only modified, while the phase and power spectral density estimation of noise power spectrum is inaccurate, resulting in serious distortion of synthesized speech signal. Aiming at addressing this problem, a speech enhancement algorithm combining amplitude estimation and phase reconstruction is proposed in this paper. First, we use the minimum controlled recursive averaging (MCRA) function to estimate the power spectrum of the noise, instead of relying only on the initial silent frame to achieve the amplitude estimation of the enhanced signal. Secondly, the phase spectrum of noisy speech signal is reconstructed. Finally, the reconstructed phase spectrum is combined with the estimated speech amplitude spectrum to generate clean speech signal. In order to evaluate the performance of the proposed algorithm, the NOIZEUS data set is used for simulation. The objective experiments of perceptual estimation of speech quality (PESQ), normalized covariance (NCM), and spectrogram analysis show that the proposed method can suppress noise signals greatly, reduce signal distortion at low signal-noise ratio (SNR), and improve speech quality and intelligibility.

References

[1]
S.F. Boll. Suppression of Acoustic Noise in Speech Using Spectral Subtraction[C]//IEEE Transaction on Acoustics, Speech and signal processing. 1977, 27: 113-120.
[2]
Yi Hu, Philipos C. Loizou. Evaluation of Objective Quality Measures for Speech Enhancement[C]//Proceedings of the IEEE conference Transactions on audio, speech, and language processing. 2008,26: 229-238.
[3]
P. C. Loizou, Speech enhancement: theory and practice. Boca Raton[J]. FL: CRC Press, 2013.
[4]
W. Kamil, M. Mitar. Exploiting Conjugate Symmetry of the Short-Time Fourier Spectrum for Speech Enhancement[C]//IEEE Signal Processing Letters. 2008(15).
[5]
G.Singh. Noise estimation for real-time speech enhancement[C]//IEEE Second International Conference on Electronics, Communication and Aerospace Technology (ICECA). 2018: 1201-1204.
[6]
M. S. Kavalekalam, J. K. Nielsen. A Study of Noise PSD Estimators for Single Channel Speech Enhancement[C]//IEEE International Conference on Acoustics, Speech and Signal Processing. 2018: 5464-5468.
[7]
GY Wang, XQ Zhao, W Xia. Musical Noise Reduction Based on Spectral Subtraction Combined with Wiener Filtering for Speech Communication[C]//IEEE International Communication Conference on Wireless Mobile and Computing. 2010(6).
[8]
Gerkmann T, Krawczyk M, Rehr R. Phase estimation in speech enhancement — Unimportant, important, or impossible?[C]//IEEE Electrical & Electronics Engineers in Israel. 2012.
[9]
I. Cohen, B. Berdugo. Noise estimation by minima controlled recursive averaging for robust speech enhancement[C]//Proceedings of the IEEE conference on Signal Processing Letters. 2002, 9(1): 12-15.
[10]
Zhibo Chen, Yuanze Liu. Multiband Spectral Subtraction Speech Enhancement Algorithm with Phase Spectrum Compensation [C]// Proceedings of the IEEE Conference on Advanced Information Technology, Electronic and Automation Control. 2019(4).
[11]
T. Lavanya, T. Nagaraja, Member. Multi-Level Single Channel Speech Enhancement Using a Unified Framework for Estimating Magnitude and Phase Spectra[C]//Proceedings of the IEEE conference Transactions on audio, speech, and language processing. 2020, 28: 229-238.
[12]
M. Pirolt, J. Stahl. Phase estimation in single-channel speech enhancement using phase invariance constraints[C]//Proceedings of the IEEE conference on Acoustics, Speech and Signal Processing (ICASSP). 2017: 5585-5589.
[13]
Q. Zhang, M. Wang. Speech enhancement for nonstationary noise environments[C]//IEEE 17th International Conference on Communication Technology (ICCT). 2017: 1663-1667.

Cited By

View all
  • (2023)The speech signal enhancement approach with multiple sub-frames analysis for complex magnitude and phase spectrum recompenseExpert Systems with Applications10.1016/j.eswa.2023.120746232(120746)Online publication date: Dec-2023

Index Terms

  1. Single-Channel Speech Enhancement Using Amplitude Estimation and Phase Reconstruction
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        AIPR '21: Proceedings of the 2021 4th International Conference on Artificial Intelligence and Pattern Recognition
        September 2021
        715 pages
        ISBN:9781450384087
        DOI:10.1145/3488933
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 25 February 2022

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Noise estimation
        2. Phase reconstruction
        3. Speech enhancement

        Qualifiers

        • Research-article
        • Research
        • Refereed limited

        Funding Sources

        • the Special project of science and technology foundation of the ministry of public security under Grant No. 2019GABJC41

        Conference

        AIPR 2021

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)5
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 20 Feb 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2023)The speech signal enhancement approach with multiple sub-frames analysis for complex magnitude and phase spectrum recompenseExpert Systems with Applications10.1016/j.eswa.2023.120746232(120746)Online publication date: Dec-2023

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format.

        HTML Format

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media