Skip to main content

Speech Enhancement Using Mixtures of Gaussians for Speech and Noise

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2448))

Abstract

In this article we approximate the clean speech spectral magnitude as well as noise spectral magnitude with a mixture of Gaussians pdfs using the Expectation- Maximization algorithm (EM). Subsequently, we apply the Bayesian!inference framework to the degraded spectral coefficients and by employing Minimum Mean Square Error Estimation (MMSE), we derive a closed form solution for the spectral magnitude estimation task adapted to the spectral characteristics and noise variance of each band. We evaluate our algorithm using true, coloured, slowly and quickly varying noise types (Factory and aircraft noise) and demonstrate its robustness at very low SNRs.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. McAulay R., Malpass M., (1980), “Speech enhancement using a soft decision noise suppression filter,” IEEE Trans. Speech and Audio Processing, Vol. 28, No. 2, pp. 137–145.

    Google Scholar 

  2. Ephraim Y., Malah D., (1984), “Speech Enhancement using a minimum mean-square error shorttime spectral amplitude estimator,” IEEE Trans. ASSP, Vol. 32, pp. 1109–1121.

    Article  Google Scholar 

  3. Gong Y., (1995), “Speech recognition in noisy environments: A survey,” Speech Communication, 16, pp. 261–291.

    Article  Google Scholar 

  4. Gradshteyn I., Ryzhik M., Jeffrey A. (Eds.), Fifth edition, (1994), “Table of Integrals, Series and Products,” Academic Press, pp. 1094–1095, Eq. 9.247, Eq. 9.254, Eq. 3.462.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Potamitis, I., Fakotakis, N., Liolios, N., Kokkinakis, G. (2002). Speech Enhancement Using Mixtures of Gaussians for Speech and Noise. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_48

Download citation

  • DOI: https://doi.org/10.1007/3-540-46154-X_48

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44129-8

  • Online ISBN: 978-3-540-46154-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics