Skip to main content

Single Channel Speech Enhancement Using Masking Based on Sinusoidal Modeling

  • Conference paper
  • First Online:
Recent Trends in Image Processing and Pattern Recognition (RTIP2R 2021)

Abstract

This paper focused on development of single channel speech enhancement method. Conventional noise reduction methods based on filtering like Wiener filtering and masking uses spectral magnitudes. These magnitudes are obtained from time-frequency representation of noisy speech signals. Here, speech signal is analyzed using sinusoidal modelling. Filter gain is developed for masking of the background noise based on sinusoidal components. The developed system’s performance is evaluated using Perceptual Evaluation of Speech Quality (PESQ). It is evident from experiments that proposed approach displaying better performance compared to existing approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Benesty, J., Makino, S., Chen, J.: Speech Enhancement. Springer, Cham (2005)

    Google Scholar 

  2. Boll, S.F.: Supression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. ASSP 27, 113–120 (1979)

    Article  Google Scholar 

  3. Ephraim, Y., Malah, D.: Speech enhancement using a minimum mean square error log-spectral amplitude estimator. IEEE Trans. Acoust. Speech Signal Process. 32(6), 1109–1121 (1984)

    Article  Google Scholar 

  4. Ephraim, Y., Malah, D.: Speech enhancement using a minimum mean square error log-spectral amplitude estimator. IEEE Trans. Acoust. Speech Signal Process. 33(1), 443–445 (1985)

    Article  Google Scholar 

  5. Lim, J., Oppenheim, A.: All-pole modeling of degraded speech. IEEE Trans. Acoust. Speech Signal Process. ASSP 26(3), 197–210 (1978)

    Article  Google Scholar 

  6. Cohen, I.: Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging. IEEE Trans. Speech Audio Prcoess. 9(1), 12–15 (2002)

    Google Scholar 

  7. Cohen, I.: Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging. IEEE Trans. Speech Audio Prcoess. 11(5), 466–475 (2003)

    Article  Google Scholar 

  8. McAulay, R.J., Quatieri, T.F.: Speech analysis-synthesis based on a sinusoidal representation. IEEE Trans. Acoust. Speech Signal Process. ASSP 34, pp. 744–754, Lincoln Lab., M.I.T (1986)

    Google Scholar 

  9. Rao, C.V.R., Murthy, M.R., Rao, K.S.: Speech enhancement using sub-band cross-correlation compensated Wiener filter combined with harmonic regeneration. Int. J. Electron. Commun. 66(6), 459–464 (2012)

    Article  Google Scholar 

  10. Lim, J.S., Oppenheim, A.V.: Enhancement and bandwidth compression of noisy speech. Proc. IEEE 67(12), 1586–604 (1979)

    Article  Google Scholar 

  11. McAulay, R.J., Quatieri, T.F.: Magnitude only reconstruction using a sinusoidal speech model. In: Proceedings International Conference on Acoustics, Speech, and Signal Processing, pp. 27.6.1-27.6.4 (1985)

    Google Scholar 

  12. http://www.utdallas.edu/loizou/speech/oizeus/

  13. Rix, A., Beerends, J., Hollier, M., Hekstra, A.: Perceptual evaluation of speech quality (PESQ) - a new method for speech quality assessment of telephone networks and codecs. In: Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing, vol 2, pp. 749–52 (2001)

    Google Scholar 

  14. Berouti, M., Schwartz, R., Makhoul, J.: Enhancement of speech corrupted by acoustic noise. In: Proceedings International Conference on Acoustics, Speech, and Signal Processing, pp. 208–211 (1979)

    Google Scholar 

Download references

Acknowledgement

This work is supported by the Science and Engineering Research Board (SERB), Department of Science and Technology (DST), Government of India and, file no. is EEQ/2018/001338, dated 27th February 2019.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ch. V. Rama Rao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Buragohain, R., Reddy, R.A., Venkatesh, Y., Prabhakar, G.A., Rao, C.V.R. (2022). Single Channel Speech Enhancement Using Masking Based on Sinusoidal Modeling. In: Santosh, K., Hegadi, R., Pal, U. (eds) Recent Trends in Image Processing and Pattern Recognition. RTIP2R 2021. Communications in Computer and Information Science, vol 1576. Springer, Cham. https://doi.org/10.1007/978-3-031-07005-1_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-07005-1_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-07004-4

  • Online ISBN: 978-3-031-07005-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics