Abstract
This paper focused on development of single channel speech enhancement method. Conventional noise reduction methods based on filtering like Wiener filtering and masking uses spectral magnitudes. These magnitudes are obtained from time-frequency representation of noisy speech signals. Here, speech signal is analyzed using sinusoidal modelling. Filter gain is developed for masking of the background noise based on sinusoidal components. The developed system’s performance is evaluated using Perceptual Evaluation of Speech Quality (PESQ). It is evident from experiments that proposed approach displaying better performance compared to existing approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Benesty, J., Makino, S., Chen, J.: Speech Enhancement. Springer, Cham (2005)
Boll, S.F.: Supression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. ASSP 27, 113–120 (1979)
Ephraim, Y., Malah, D.: Speech enhancement using a minimum mean square error log-spectral amplitude estimator. IEEE Trans. Acoust. Speech Signal Process. 32(6), 1109–1121 (1984)
Ephraim, Y., Malah, D.: Speech enhancement using a minimum mean square error log-spectral amplitude estimator. IEEE Trans. Acoust. Speech Signal Process. 33(1), 443–445 (1985)
Lim, J., Oppenheim, A.: All-pole modeling of degraded speech. IEEE Trans. Acoust. Speech Signal Process. ASSP 26(3), 197–210 (1978)
Cohen, I.: Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging. IEEE Trans. Speech Audio Prcoess. 9(1), 12–15 (2002)
Cohen, I.: Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging. IEEE Trans. Speech Audio Prcoess. 11(5), 466–475 (2003)
McAulay, R.J., Quatieri, T.F.: Speech analysis-synthesis based on a sinusoidal representation. IEEE Trans. Acoust. Speech Signal Process. ASSP 34, pp. 744–754, Lincoln Lab., M.I.T (1986)
Rao, C.V.R., Murthy, M.R., Rao, K.S.: Speech enhancement using sub-band cross-correlation compensated Wiener filter combined with harmonic regeneration. Int. J. Electron. Commun. 66(6), 459–464 (2012)
Lim, J.S., Oppenheim, A.V.: Enhancement and bandwidth compression of noisy speech. Proc. IEEE 67(12), 1586–604 (1979)
McAulay, R.J., Quatieri, T.F.: Magnitude only reconstruction using a sinusoidal speech model. In: Proceedings International Conference on Acoustics, Speech, and Signal Processing, pp. 27.6.1-27.6.4 (1985)
Rix, A., Beerends, J., Hollier, M., Hekstra, A.: Perceptual evaluation of speech quality (PESQ) - a new method for speech quality assessment of telephone networks and codecs. In: Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing, vol 2, pp. 749–52 (2001)
Berouti, M., Schwartz, R., Makhoul, J.: Enhancement of speech corrupted by acoustic noise. In: Proceedings International Conference on Acoustics, Speech, and Signal Processing, pp. 208–211 (1979)
Acknowledgement
This work is supported by the Science and Engineering Research Board (SERB), Department of Science and Technology (DST), Government of India and, file no. is EEQ/2018/001338, dated 27th February 2019.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Buragohain, R., Reddy, R.A., Venkatesh, Y., Prabhakar, G.A., Rao, C.V.R. (2022). Single Channel Speech Enhancement Using Masking Based on Sinusoidal Modeling. In: Santosh, K., Hegadi, R., Pal, U. (eds) Recent Trends in Image Processing and Pattern Recognition. RTIP2R 2021. Communications in Computer and Information Science, vol 1576. Springer, Cham. https://doi.org/10.1007/978-3-031-07005-1_28
Download citation
DOI: https://doi.org/10.1007/978-3-031-07005-1_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-07004-4
Online ISBN: 978-3-031-07005-1
eBook Packages: Computer ScienceComputer Science (R0)