Single Channel Speech Enhancement Using Masking Based on Sinusoidal Modeling

Buragohain, Rantu; Reddy, R. Aditya; Venkatesh, Yenduri; Prabhakar, Gudmalwar Ashishkumar; Rao, Ch. V. Rama

doi:10.1007/978-3-031-07005-1_28

Rantu Buragohain⁸,
R. Aditya Reddy⁸,
Yenduri Venkatesh⁸,
Gudmalwar Ashishkumar Prabhakar⁸ &
…
Ch. V. Rama Rao⁸

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1576))

Included in the following conference series:

International Conference on Recent Trends in Image Processing and Pattern Recognition

384 Accesses

Abstract

This paper focused on development of single channel speech enhancement method. Conventional noise reduction methods based on filtering like Wiener filtering and masking uses spectral magnitudes. These magnitudes are obtained from time-frequency representation of noisy speech signals. Here, speech signal is analyzed using sinusoidal modelling. Filter gain is developed for masking of the background noise based on sinusoidal components. The developed system’s performance is evaluated using Perceptual Evaluation of Speech Quality (PESQ). It is evident from experiments that proposed approach displaying better performance compared to existing approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Benesty, J., Makino, S., Chen, J.: Speech Enhancement. Springer, Cham (2005)
Google Scholar
Boll, S.F.: Supression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. ASSP 27, 113–120 (1979)
Article Google Scholar
Ephraim, Y., Malah, D.: Speech enhancement using a minimum mean square error log-spectral amplitude estimator. IEEE Trans. Acoust. Speech Signal Process. 32(6), 1109–1121 (1984)
Article Google Scholar
Ephraim, Y., Malah, D.: Speech enhancement using a minimum mean square error log-spectral amplitude estimator. IEEE Trans. Acoust. Speech Signal Process. 33(1), 443–445 (1985)
Article Google Scholar
Lim, J., Oppenheim, A.: All-pole modeling of degraded speech. IEEE Trans. Acoust. Speech Signal Process. ASSP 26(3), 197–210 (1978)
Article Google Scholar
Cohen, I.: Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging. IEEE Trans. Speech Audio Prcoess. 9(1), 12–15 (2002)
Google Scholar
Cohen, I.: Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging. IEEE Trans. Speech Audio Prcoess. 11(5), 466–475 (2003)
Article Google Scholar
McAulay, R.J., Quatieri, T.F.: Speech analysis-synthesis based on a sinusoidal representation. IEEE Trans. Acoust. Speech Signal Process. ASSP 34, pp. 744–754, Lincoln Lab., M.I.T (1986)
Google Scholar
Rao, C.V.R., Murthy, M.R., Rao, K.S.: Speech enhancement using sub-band cross-correlation compensated Wiener filter combined with harmonic regeneration. Int. J. Electron. Commun. 66(6), 459–464 (2012)
Article Google Scholar
Lim, J.S., Oppenheim, A.V.: Enhancement and bandwidth compression of noisy speech. Proc. IEEE 67(12), 1586–604 (1979)
Article Google Scholar
McAulay, R.J., Quatieri, T.F.: Magnitude only reconstruction using a sinusoidal speech model. In: Proceedings International Conference on Acoustics, Speech, and Signal Processing, pp. 27.6.1-27.6.4 (1985)
Google Scholar
http://www.utdallas.edu/loizou/speech/oizeus/
Rix, A., Beerends, J., Hollier, M., Hekstra, A.: Perceptual evaluation of speech quality (PESQ) - a new method for speech quality assessment of telephone networks and codecs. In: Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing, vol 2, pp. 749–52 (2001)
Google Scholar
Berouti, M., Schwartz, R., Makhoul, J.: Enhancement of speech corrupted by acoustic noise. In: Proceedings International Conference on Acoustics, Speech, and Signal Processing, pp. 208–211 (1979)
Google Scholar

Download references

Acknowledgement

This work is supported by the Science and Engineering Research Board (SERB), Department of Science and Technology (DST), Government of India and, file no. is EEQ/2018/001338, dated 27th February 2019.

Author information

Authors and Affiliations

National Institute of Technology Meghalaya, Shillong, 793003, India
Rantu Buragohain, R. Aditya Reddy, Yenduri Venkatesh, Gudmalwar Ashishkumar Prabhakar & Ch. V. Rama Rao

Authors

Rantu Buragohain
View author publications
You can also search for this author in PubMed Google Scholar
R. Aditya Reddy
View author publications
You can also search for this author in PubMed Google Scholar
Yenduri Venkatesh
View author publications
You can also search for this author in PubMed Google Scholar
Gudmalwar Ashishkumar Prabhakar
View author publications
You can also search for this author in PubMed Google Scholar
Ch. V. Rama Rao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ch. V. Rama Rao .

Editor information

Editors and Affiliations

University of South Dakota, Vermillion, SD, USA
KC Santosh
Central University of Karnataka, Gulbarga, India
Ravindra Hegadi
Indian Statistical Institute, Kolkata, India
Umapada Pal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Buragohain, R., Reddy, R.A., Venkatesh, Y., Prabhakar, G.A., Rao, C.V.R. (2022). Single Channel Speech Enhancement Using Masking Based on Sinusoidal Modeling. In: Santosh, K., Hegadi, R., Pal, U. (eds) Recent Trends in Image Processing and Pattern Recognition. RTIP2R 2021. Communications in Computer and Information Science, vol 1576. Springer, Cham. https://doi.org/10.1007/978-3-031-07005-1_28

Download citation

DOI: https://doi.org/10.1007/978-3-031-07005-1_28
Published: 22 May 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-07004-4
Online ISBN: 978-3-031-07005-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Single Channel Speech Enhancement Using Masking Based on Sinusoidal Modeling