Skip to main content

Speech Enhancement Using Mixtures of Gaussians for Speech and Noise

  • Conference paper
  • First Online:
Text, Speech and Dialogue (TSD 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2448))

Included in the following conference series:

Abstract

In this article we approximate the clean speech spectral magnitude as well as noise spectral magnitude with a mixture of Gaussians pdfs using the Expectation- Maximization algorithm (EM). Subsequently, we apply the Bayesian!inference framework to the degraded spectral coefficients and by employing Minimum Mean Square Error Estimation (MMSE), we derive a closed form solution for the spectral magnitude estimation task adapted to the spectral characteristics and noise variance of each band. We evaluate our algorithm using true, coloured, slowly and quickly varying noise types (Factory and aircraft noise) and demonstrate its robustness at very low SNRs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. McAulay R., Malpass M., (1980), “Speech enhancement using a soft decision noise suppression filter,” IEEE Trans. Speech and Audio Processing, Vol. 28, No. 2, pp. 137–145.

    Google Scholar 

  2. Ephraim Y., Malah D., (1984), “Speech Enhancement using a minimum mean-square error shorttime spectral amplitude estimator,” IEEE Trans. ASSP, Vol. 32, pp. 1109–1121.

    Article  Google Scholar 

  3. Gong Y., (1995), “Speech recognition in noisy environments: A survey,” Speech Communication, 16, pp. 261–291.

    Article  Google Scholar 

  4. Gradshteyn I., Ryzhik M., Jeffrey A. (Eds.), Fifth edition, (1994), “Table of Integrals, Series and Products,” Academic Press, pp. 1094–1095, Eq. 9.247, Eq. 9.254, Eq. 3.462.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Potamitis, I., Fakotakis, N., Liolios, N., Kokkinakis, G. (2002). Speech Enhancement Using Mixtures of Gaussians for Speech and Noise. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_48

Download citation

  • DOI: https://doi.org/10.1007/3-540-46154-X_48

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44129-8

  • Online ISBN: 978-3-540-46154-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics