Abstract
The ubiquitous noise reduction / speech enhancement problem has gained an increasing interest in recent years. This is due both to progress made by microphone-array systems and to the successful introduction of perceptual models. In the last decade, several methods incorporating psychoacoustic criteria in single channel speech enhancement systems have been proposed, however very few works exploit these features in the multichannel case. In this paper we present a novel psychoacoustically motivated, multichannel speech enhancement system that exploits spatial information and psychoacoustic concepts. The proposed framework offers enhanced flexibility allowing for a multitude of perceptually-based post-filtering solutions. Moreover, the system has been devised on a frame-by-frame basis to facilitate real-time implementation. Objective performance measures and informal subjective listening tests for the case of speech signals corrupted with real car and F-16 cockpit noise demonstrate enhanced performance of the proposed speech enhancement system in terms of musical residual noise reduction compared to conventional multichannel techniques.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Gannot, S., Burshtein, D., Weinstein, E.: Signal enhancement using beamforming and nonstationarity with applications to speech. IEEE Trans. Signal Processing 49(8), 1614–1626 (2001)
Gannot, S., Cohen, I.: Speech enhancement based on the general transfer function GSC and postfiltering. IEEE Trans. Speech and Audio Processing 12(6), 561–571 (2004)
Cohen, I., Berdugo, B.: Speech enhancement for nonstationary noise environments. Signal Processing 81(11), 2403–2418 (2001)
Cohen, I.: Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging. IEEE Trans. Speech and Audio Processing 11(5), 466–475 (2003)
Gustafsson, S., Jax, P., Vary, P.: A novel psychoacoustically motivated audio enhancement algorithm preserving background noise characteristics. In: ICASSP, pp. 397–400 (1998)
Virag, N.: Single channel speech enhancement based on masking properties of the human auditory system. IEEE Trans. on Speech and Audio Processing 7(2), 126–137 (1999)
Wolfe, P., Godsill, S.: The application of psychoacoustic criteria to the restoration of musical recordings. In: Proc. 108th AES Conv. (2000)
Goetze, S., Mildner, V., Kammeyer, K.D.: A psychoacoustic noise reduction approach for stereo hands-free systems. In: Proc. 120th AES Conv., Paris (May 20-23, 2006)
Ephraim, Y., Malah, D.: Speech enhancement using a minimum mean-square error log-spectral amplitude estimator. IEEE Trans. Acoustics, Speech and Sig. Proc. 33(2), 443–445 (1985)
Painter, T., Spanias, A.: Perceptual coding of digital audio. Proc. of the IEEE 88(4), 451–513 (2000)
Hansen, J.H.L., Pellom, B.L.: An effective evaluation protocol for speech enhancement algorithms. In: Proc. of the Int. Conf. on Speech and Language Processing, vol. 6, pp. 2819–2822 (1998)
Shalvi, O., Weinstein, E.: System identification using nonstationary signals. IEEE Trans. signal Processing 44, 2055–2063 (1996)
Cohen, I.: Relative transfer function identification using speech signals. IEEE Trans. Speech and Audio Processing 12(5), 451–459 (2004)
Hussain, A., Squartini, S., Piazza, F.: Novel Sub-band Adaptive Systems Incorporating Wiener Filtering for Binaural Speech Enhancement. In: Faundez-Zanuy, M., Janer, L., Esposito, A., Satue-Villar, A., Roure, J., Espinosa-Duro, V. (eds.) NOLISP 2005. LNCS, vol. 3817, pp. 318–327. Springer, Heidelberg (2006d)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hussain, A., Cifani, S., Squartini, S., Piazza, F., Durrani, T. (2007). A Novel Psychoacoustically Motivated Multichannel Speech Enhancement System. In: Esposito, A., Faundez-Zanuy, M., Keller, E., Marinaro, M. (eds) Verbal and Nonverbal Communication Behaviours. Lecture Notes in Computer Science(), vol 4775. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76442-7_17
Download citation
DOI: https://doi.org/10.1007/978-3-540-76442-7_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-76441-0
Online ISBN: 978-3-540-76442-7
eBook Packages: Computer ScienceComputer Science (R0)