skip to main content
10.1145/3408127.3408134acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicdspConference Proceedingsconference-collections
research-article

Speech Enhancement Based on Adaptive Harmonic Model Using Maximum Likelihood Method

Published: 10 September 2020 Publication History

Abstract

Speech enhancement is a hot topic in the modern society due to its extensive applications such as automatic speech recognition, mobile communication, etc. Spectral subtraction is a very valid and direct denoising algorithm, but it is still needed to be further developed due to complex application. In this paper, we firstly assume adaptive harmonic model to model the speech signal. Speech enhancement is then achieved by spectral subtraction. To further enhance the speech intelligibility, we attempt to estimate the harmonic model. Maximum likelihood method is considered to derive the phase and amplitude update formulae of the modelled harmonic signal, which are aimed to depress the distortion due to spectral subtraction. Different from convention spectral subtraction, both the amplitude and phase parameters of adaptive harmonic model are combined to be updated. We assume the additive noise is correlated along the time sequence. By the optimal solution of maximum likelihood method, we obtain the updated version of speech enhancement. Simulation results show the effectiveness of the new algorithm, and further improvement of spectral subtraction.

References

[1]
Bahja F, Martino J D, Elhaj E I, Aboutajdine D (2015) An overview of the CATE algorithms for real-time pitch determination. Signal Image & Video Processing, 9 (3): 589--599.
[2]
Boll S F (1979) Suppression of acoustic noise in speech using spectral subtraction. IEEE Transaction on Acoustics, Speech and Signal Processing, vol. 27, pp: 113--130.
[3]
Gold B, Morgan N, Ellis D (2011) Speech and Audio Signal Processing: Processing and Perception of Speech and Music. John Wiley & Sons.
[4]
Kamath S, Loizou P C (2002) A multi-band spectral subtraction method for enhancing speech corrupted by colored noise[C]. IEEE International Conference on Acoustics, Speech, and Signal Processing, Orlando, FL, USA: IV-4164--IV-4164.
[5]
Kay S M (1993) Fundamentals of Statistical Signal Processing, Volume 1: Estimation Theory. Prentice Hall.
[6]
Kulmer J, Mowlaee P, Watanabe M K (2014) A probabilistic approach for phase estimation in single-channel speech enhancement using von mises phase priors. IEEE Workshop on Machine Learning for Signal Processing, Sept. 2014.
[7]
Loizou P C, Ma J (2011) Extending the articulation index to account for non-linear distortions introduced by noise suppression algorithms. Journal of the Acoustical Society of America, 130(2), pp: 986--995.
[8]
Martin R (2001) Noise Power Spectral Density Estimation Based on Optimal Smoothing and Minimum Statistics. IEEE Transactions on Speech and Audio Processing 9(5):504--512.
[9]
Mowlaee P, Kulmer J (2015) Harmonic Phase Estimation in Single-Channel Speech Enhancement Using Phase Decomposition and SNR Information. IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 23, No. 9, pp: 1521--1532.
[10]
Schonhoff T, Giordano A A (2007) Detection and Estimation Theory and Its Aplications. Prentice Hall.
[11]
Sugiyama A, Miyahara R (2013) Phase randomization -- a new paradigm for single-channel signal enhancement. ICASSP 2013, pp: 7487--7491.
[12]
Wang D, Lim J (1982) The unimportance of phase in speech enhancement. IEEE Transaction on Acoustics, Speech and Signal Processing, vol. 30, no. 4, pp: 679--681.

Index Terms

  1. Speech Enhancement Based on Adaptive Harmonic Model Using Maximum Likelihood Method

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICDSP '20: Proceedings of the 2020 4th International Conference on Digital Signal Processing
    June 2020
    383 pages
    ISBN:9781450376877
    DOI:10.1145/3408127
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • University of Electronic Science and Technology of China: University of Electronic Science and Technology of China

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 September 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. harmonic model
    2. maximum likelihood
    3. noise
    4. spectral subtraction
    5. speech enhancement

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICDSP 2020

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 34
      Total Downloads
    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 08 Mar 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media