Speech Enhancement in Short-Wave Channel Based on Empirical Mode Decomposition

Shen, Li-Ran; Yin, Qing-Bo; Li, Xue-Yao; Wang, Hui-Qiang

doi:10.1007/11753728_59

Li-Ran Shen¹⁹,
Qing-Bo Yin¹⁹,
Xue-Yao Li¹⁹ &
…
Hui-Qiang Wang¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3967))

Included in the following conference series:

International Computer Science Symposium in Russia

959 Accesses

Abstract

A novel speech enhancement method based on empirical mode decomposition is proposed. The method is a fully data driven approach. Noisy speech signal is decomposed adaptively into oscillatory components called Intrinsic Mode Functions (IMFs) using a process called sifting. The empirical mode decomposition denoising involves thresholding each IMFs. A nonlinear function is introduced for amplitude thresholding. And then reconstructs the estimated speech signal using the processed IMFs. The experimental results show significant improvement in output SNR and quality as compared to recently reported results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Martin, R.: Noise Power Spectral Density Estimation Based on Optimal Smoothing and Minimum Statistics. IEEE Trans. on Speech and Audio Processing 9, 504–512 (2001)
Article Google Scholar
Ephraim, Y., Malah, D.: Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator. IEEE Transactions on Acoustics, Speech and Signal Processing 32, 1109–1121 (1984)
Article Google Scholar
Zheng, W.T., Cao, Z.H.: Speech enhancement based on MMSE-STSA estimation and residual noise reduction. In: 1991 IEEE Region 10 International Conference on EC3-Energy, Computer, Communication and Control Systems, vol. 3, pp. 265–268 (1991)
Google Scholar
Zhibin, L., Naiping, X.: Speech enhancement based on minimum mean-square error short-time spectral estimation and its realization. In: IEEE International conference on intelligent processing system, vol. 28, pp. 1794–1797 (1997)
Google Scholar
Lim, J.S., Oppenheim, A.V.: Enhancement and bandwidth compression of noisy speech. Proc. of the IEEE 67, 1586–1604 (1979)
Article Google Scholar
Goh, Z., Tan, K., Tan, T.: Postprocessing method for suppressing musical noise generated by spectral subtraction. IEEE Trans. Speech Audio Procs 6, 287–292 (1998)
Article Google Scholar
He, C., Zweig, Z.: Adaptive two-band spectral subtraction with multi-window spectral estimation. In: ICASSP, vol. 2, pp. 793–796 (1999)
Google Scholar
Huang, N.E.: The Empirical Mode Decomposition and the Hilbert Spectrum for Nonlinear and Non-stationary Time Series Analysis. J. Proc. R. Soc. Lond. A 454, 903–995 (1998)
Article MathSciNet MATH Google Scholar
Huang, W., Shen, Z., Huang, N.E., Fung, Y.C.: Engineering Analysis of Biological Variables: an Example of Blood Pressure over 1 Day. Proc. Natl. Acad. Sci. USA 95, 4816–4821 (1998)
Article Google Scholar
Huang, W., Shen, Z., Huang, N.E., Fung: Nonlinear Indicial Response of Complex Nonstationary Oscillations as Pulmonary Pretension Responding to Step Hypoxia. Proc. Natl. Acad. Sci, USA 96, 1833–1839 (1999)
Google Scholar
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995)
Book MATH Google Scholar

Download references

Author information

Authors and Affiliations

The College Of Computer Science and Technology, Harbin Engineering University, NO.145 Nantong Street, Nangang District, Harbin, China
Li-Ran Shen, Qing-Bo Yin, Xue-Yao Li & Hui-Qiang Wang

Authors

Li-Ran Shen
View author publications
You can also search for this author in PubMed Google Scholar
Qing-Bo Yin
View author publications
You can also search for this author in PubMed Google Scholar
Xue-Yao Li
View author publications
You can also search for this author in PubMed Google Scholar
Hui-Qiang Wang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

IRMAR, Université de Rennes, Campus de Beaulieu, 35042, Rennes Cedex, France
Dima Grigoriev
Intel Corporation, JF1-13, 2111 NE 25th Avenue, 97124, Hillsboro, OR, USA
John Harrison
Steklov Institute of Mathematics at St. Petersburg, 27 Fontanka, St., 191023, Petersburg, Russia
Edward A. Hirsch

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shen, LR., Yin, QB., Li, XY., Wang, HQ. (2006). Speech Enhancement in Short-Wave Channel Based on Empirical Mode Decomposition. In: Grigoriev, D., Harrison, J., Hirsch, E.A. (eds) Computer Science – Theory and Applications. CSR 2006. Lecture Notes in Computer Science, vol 3967. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11753728_59

Download citation

DOI: https://doi.org/10.1007/11753728_59
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34166-6
Online ISBN: 978-3-540-34168-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics