Single Channel Speech Enhancement for Mixed Non-stationary Noise Environments

Singh, Sachin; Tripathy, Manoj; Anand, R. S.

doi:10.1007/978-3-319-04960-1_47

Sachin Singh⁵,
Manoj Tripathy⁵ &
R. S. Anand⁵

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 264))

2776 Accesses
3 Citations

Abstract

Speech enhancement is very important step for improving quality and intelligibility of noisy speech signal. In practical environment more than one noise sources are present, hence it is necessary to design a technique/ algorithm that can remove mixed noises or more than one noises from single-channel speech signals. In this paper, a single channel speech enhancement method is proposed for reduction of mixed non-stationary noises. The proposed method is based on wavelet packet and ideal binary mask thresholding function for speech enhancement. Db10 mother wavelet packet transform is used for decomposition of speech signal in three levels. After decomposition of speech signal a binary mask threshold function is used to threshold the noisy coefficients from the noisy speech signal coefficients. The performance of the proposed wavelet with ideal mask method is compared with Wiener, Spectral Subtraction, p-MMSE, log-MMSE, Ideal channel selection, Ideal binary mask, hard and soft wavelet thresholding function in terms of PESQ, SNR improvement, Cepstral Distance, and frequency weighted segmental SNR. The proposed method has shown improved performance over conventional speech enhancement methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Loizou, P.C.: Speech enhancement theory and practice. CRC Press, USA (2007)
Google Scholar
Boll, S.F.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoustic, Speech, Signal Processing 113-120 (1979)
Google Scholar
Krishnamurthy, P., Prasanna, S.R.M.: Modified spectral subtraction method for enhancement of noisy speech. In: Proc. 3rd International Conference on Intelligent Sensing and Information Processing, Bangalore, India, pp. 146–150 (2005)
Google Scholar
Scalart, P., Filho, J.: Speech enhancement based on a priori signal to noise estimation. In: Proc. IEEE Int. Conf. on Acoust, Speech, Signal Processing, Atlanta, pp. 629–632 (1996)
Google Scholar
Ephraim, Y., Malah, D.: Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator. IEEE Trans. Audio, Speech, and Language Processing, 1109–1121 (1984)
Google Scholar
Ephraim, Y., Malah, D.: Speech enhancement using a minimum mean square error log-spectral amplitude estimator. IEEE Trans. Audio, Speech, and Language Processing, 443–445 (1995)
Google Scholar
Loizou, P.C.: Speech enhancement based on perceptually motivated Bayesian estimators of the magnitude spectrum. IEEE Trans. Audio, Speech, and Language Processing, 857-869 (2005)
Google Scholar
Dubbelboer, F., Houtgast, T.: The concept of signal-to-noise ratio in the modulation domain and speech intelligibility. J. Acoust. Sociaty America, 3937-3947 (2008)
Google Scholar
Jorgensen, S., Dau, T.: Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing. J. Acoust. Sociaty America 1475-1487 (2011)
Google Scholar
Paliwal, K., Schwerin, B., Wojcicki, K.: Role of modulation magnitude and phase spectrum towards speech intelligibility. Speech Communication, 327-339 (2011)
Google Scholar
Wojcicki, K., Loizou, P.C.: Channel selection in the modulation domain for improved speech intelligibility in noise. J. Acoust. Sociaty America, 2904-2913 (2012)
Google Scholar
Guoshen, Y., Bacry, E., Mallat, S.: Audio signal denoising with complex wavelets and adaptive block attenuation. In: Proc. IEEE Int. Conf. Acoustic, Speech Signal Processing (ICASSP), vol. 3, pp. 869–872 (2007)
Google Scholar
Zhou, B., et al.: An improved wavelet-based speech enhancement method using adaptive block thresholding. In: IEEE Conference (2010)
Google Scholar
Sanam, T.F., Shahnaz, C.: Enhancement of noisy speech based on a custom thresholding function with a statistically determined threshold. Int. J. Speech Technology (April 2012)
Google Scholar
Donoho, D.L.: De-noising by soft thresholding. IEEE Trans. Inform. Theory 41, 613–627 (1995)
Article MATH MathSciNet Google Scholar
Donoho, D.L., Johnstone, I.M.: Ideal spatial adaptation by wavelet shrinkage. Biometrika 81, 425–455 (1994)
Article MATH MathSciNet Google Scholar
Yasser, G., Karami, M.R.: A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets. Speech Communication 48, 927–940 (2006)
Article Google Scholar
Rangachari, S., Loizou, P.C.: A noise-estimation algorithm for highly non-stationary environments. Speech Communication 48, 220–231 (2006)
Google Scholar
Prahallad, K., Kumar, E.N., Keri, V.: The IIIT-H Indic Speech Databases. In: Proceedings of Interspeech, Portland, Oregon, USA (2012), http://speech.iiit.ac.in/index.php/research-svl/69.html
Varga, P., Steeneken, H.J.M.: Technical report, DRA Speech Research Unit) (1992), http://www.speech.cs.cmu.edu/comp.speech/Sect-ion1/Data/noisex.html

Download references

Author information

Authors and Affiliations

Indian Institute of Technology Roorkee, Roorkee, India
Sachin Singh, Manoj Tripathy & R. S. Anand

Authors

Sachin Singh
View author publications
You can also search for this author in PubMed Google Scholar
Manoj Tripathy
View author publications
You can also search for this author in PubMed Google Scholar
R. S. Anand
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sachin Singh .

Editor information

Editors and Affiliations

Technopark Campus, Indian Institute of Information Technology and Management – Kerala (IIITM-K), Trivandrum, Kerala, India
Sabu M. Thampi
Center for Computing Research, National Polytechnic Institute, Mexico City, Mexico
Alexander Gelbukh
Department of Computer Science and Engineering, Indian Institute of Technology, Kharagpur, India
Jayanta Mukhopadhyay

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Singh, S., Tripathy, M., Anand, R.S. (2014). Single Channel Speech Enhancement for Mixed Non-stationary Noise Environments. In: Thampi, S., Gelbukh, A., Mukhopadhyay, J. (eds) Advances in Signal Processing and Intelligent Recognition Systems. Advances in Intelligent Systems and Computing, vol 264. Springer, Cham. https://doi.org/10.1007/978-3-319-04960-1_47

Download citation

DOI: https://doi.org/10.1007/978-3-319-04960-1_47
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04959-5
Online ISBN: 978-3-319-04960-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics