Skip to main content

Perceptually Motivated Generalized Spectral Subtraction for Speech Enhancement

  • Conference paper
Advances in Nonlinear Speech Processing (NOLISP 2009)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5933))

Included in the following conference series:

  • 577 Accesses

Abstract

This paper addresses the problem of single speech enhancement in adverse environment. The common noise reduction techniques are limited by a tradeoff between an efficient noise reduction, a minimum of speech distortion and musical noise.in this work, we propose a new speech enhancement approach based on non-uniform multi-band analysis. The noisy signal is divided into a number of sub-bands using a gammachirp filter bank with non-linear ERB resolution, and the sub-bands signals are individually weighted according the generalized spectral subtraction technique. For evaluating the performance of the proposed speech enhancement, we use the perceptual evaluation measure of speech quality PESQ and the subjective quality rating designed to evaluate speech quality along three dimensions: signal distortion, noise distortion and overall quality. Subjective evaluation tests demonstrate significant improvements results over classical subtractive type algorithms, when tested with speech signal corrupted by various noises at different signal to noise ratios.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Berouti, M., Schwartz, R., Makhoul, J.: Enhancement of speech corrupted by acoustic noise. In: Proc. Int. Conf. on Acoustics, Speech, Signal Processing, April 1979, pp. 208–211 (1979)

    Google Scholar 

  2. Ephraim, Y., Malah, D.: Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator. IEEE transaction on acoustics speech and signal processing assp-33(2) (December 1985)

    Google Scholar 

  3. Tsoukalas, D.E., Mourjopoulos, J.N., Kokkinakis, G.: Speech enhancement based on audible noise suppression. IEEE Trans. Speech and Audio Processing 5, 497–514 (1997)

    Article  Google Scholar 

  4. Virag, N.: Single channel speech enhancement based on masking properties of the human auditory system. IEEE Trans. Speech and Audio Processing 7, 126–137 (1999)

    Article  Google Scholar 

  5. Hohmann, V.: Frequency analysis and synthesis using a Gammatone filterbank. Acta Acustica united with Acustica 88(3), 433–442 (2002)

    Google Scholar 

  6. ITU-T P.862, Perceptual evaluation of speech quality (PESQ), an objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs, International Telecommunication Union, Geneva (2001)

    Google Scholar 

  7. Loizou, P.: Speech Enhancement: Theory and Practice. CRC Press, Boca Raton

    Google Scholar 

  8. Martin, R.: Noise power spectral density estimation based on optimal smoothing and minimum statistics. IEEE Trans. on Speech and Audio Processing 9(5), 504–512 (2001)

    Article  Google Scholar 

  9. Cohen, I., Berdugo, B.: Noise estimation by minima controlled recursive averaging for robust speech enhancement. IEEE Signal Proc. Letters 9(1), 12–15 (2002)

    Article  Google Scholar 

  10. Cohen, I.: Noise Spectrum estimation in adverse environnements: improved minima controlled recursive averaging. IEEE Trans. Speech Audio Process. 11(5), 466–475 (2003)

    Article  Google Scholar 

  11. Rangachari, S.,, P.: A noise estimation algorithm with rapid adaptation for highly non-stationary environments. In: IEEE Int. Conf. on Acoustics, Speech, signal processing, May 17-21, pp. I-305–I-308 (2004)

    Google Scholar 

  12. Irino, T., Unoki, M.: An analysis/synthesis auditory filterbank based on an IIR gammachrp filter. In: Greenberg, S., Slaney, M. (eds.) Computational models of Auditory Function. NATO ASI series, vol. 312. IOS Press, Amsterdam (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zoghlami, N., Lachiri, Z., Ellouze, N. (2010). Perceptually Motivated Generalized Spectral Subtraction for Speech Enhancement. In: Solé-Casals, J., Zaiats, V. (eds) Advances in Nonlinear Speech Processing. NOLISP 2009. Lecture Notes in Computer Science(), vol 5933. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11509-7_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-11509-7_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-11508-0

  • Online ISBN: 978-3-642-11509-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics