Skip to main content

Modulation Spectral Features for Intrusive Measurement of Reverberant Speech Quality

  • Conference paper
  • First Online:
Book cover Digital TV and Multimedia Communication (IFTC 2018)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1009))

  • 864 Accesses

Abstract

Temporal fine structure includes temporal envelope and fine structure which also called carrier, and instantaneous frequency is the partial derivative of the carrier. An intrusive reverberant speech quality measurement is investigated with the representation of corresponding instantaneous frequency. A Gammatone filterbank is used to simulate auditory mechanism and a modulation filterbank is used to improve frequency resolution. The mean mutual information between reference and reverberant modulation spectral instantaneous frequency probability distribution is taken as the final measurement score. Experimental results show the proposed method outperforming two benchmark algorithms in some practical application conditions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Kokkinakis, K., Loizou, P.C.: The impact of reverberant self-masking and overlap-masking effects on speech intelligibility by cochlear implant listeners (L). J. Acoust. Soc. Am. 130(3), 1099–1102 (2011)

    Article  Google Scholar 

  2. Kinoshita, K., Delcroix, M., Yoshioka, T., et al.: The REVERB challenge: a common evaluation framework for dereverberation and recognition of reverberant speech. In: IEEE WASPAA, pp. 1–4 (2013)

    Google Scholar 

  3. Hazrati, O., Loizou, P.C.: The combined effects of reverberation and noise on speech intelligibility by cochlear implant listeners. Int. J. Audiol. 51(6), 437–443 (2012)

    Article  Google Scholar 

  4. ITU-T P. 862 Perceptual Evaluation of Speech Quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs (2000)

    Google Scholar 

  5. Falk, T.H., Chan, W.-Y.: A non-intrusive quality measure of dereverberated speech. In: Proceedings of International Workshop on Acoustic Echo and Noise Control (2008)

    Google Scholar 

  6. Falk, T.H., Chan, W.-Y.: Temporal dynamic for blind measurement of room acoustical parameters. IEEE Trans. Instrum. Meas. 59(4), 978–989 (2010)

    Article  Google Scholar 

  7. Kokkinakis, K., Loizou, P.C.: Evaluation of objective measures for quality assessment of reverberant speech. In: IEEE ICASSP, pp. 2420–2423 (2011)

    Google Scholar 

  8. Ma, S., Xie, X.: Blind estimation of spectral standard deviation from room impulse response for reverberation level recognition based on linear prediction. Commun. Comput. Inform. Sci. 685, 231–241 (2017)

    Article  Google Scholar 

  9. Ma, S., Li, H., Zhang, H., et al.: Reverberation level recognition by formants based on 10-fold cross validation of GMM. Commun. Comput. Inform. Sci. 815, 161–171 (2018)

    Article  Google Scholar 

  10. Moon, I.J., Hong, S.H.: What is temporal fine structure and why is it important? Korean J. Audiol. 18(1), 1–7 (2014)

    Article  Google Scholar 

  11. Vijayan, K., Reddy, P.R., Murty, K.S.R.: Significance of analytic phase of speech signals in speaker verification. Speech Commun. 81, 54–71 (2016)

    Article  Google Scholar 

  12. Tu, A.: Reverberation simulation from impulse response using the image source model (2014)

    Google Scholar 

  13. Habets, E.A.P., Gannot, S., Cohen, I.: Late reverberation spectral variance estimation based on a statistical model. IEEE Signal Process. Lett. 16(9), 770–773 (2009)

    Article  Google Scholar 

  14. Kuttruff, H.: Room Acoustics, 4th edn. Elsevier, London (2000)

    Google Scholar 

  15. Schroeder, M.: New method of measuring reverberation time. J. Acoust. Soc. Am. 37(3), 409–412 (1965)

    Article  Google Scholar 

  16. Del Vallado, J.M.F., De Lima, A.A., Prego, T.D.M., et al.: Feature analysis for the reverberation perception in speech signals. In: IEEE ICASSP, 8169–8173 (2013)

    Google Scholar 

  17. ISO 3382–1 Acoustics C Measurement of Room Acoustic Parameters C Part 1: Performances Spaces (2009)

    Google Scholar 

  18. Allen, J.B., Berkley, D.A.: Image method for efficiently simulating small-room acoustics. J. Acoust. Soc. Am. 65(4), 943–950 (1979)

    Article  Google Scholar 

  19. Patterson, R.D., Robinson, K., Holdsworth, J., et al.: Complex sounds and auditory images. In: Auditory Physiology & Perception, pp. 429–446 (1992)

    Google Scholar 

  20. Slaney, M.: An efficient implementation of the Patterson-Holdsworth auditory filter bank. Apple Computer Technical report #35, Perception Group C Advanced Technology Group (1993)

    Google Scholar 

  21. Schimmel, S.M., Atlas, L.E., Nie, K.: Feasibility of single channel speaker separation based on modulation frequency analysis. In: IEEE ICASSP, vol. 4, pp. 605–608 (2007)

    Google Scholar 

Download references

Acknowledgments

This research is supported by the Fundamental Research Funds for the Central Universities (2018XNG1810). The authors thank Prof. W.-Y. Chan for his guidance for this research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sai Ma .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ma, S., Zhang, H., Xie, L., Xie, X. (2019). Modulation Spectral Features for Intrusive Measurement of Reverberant Speech Quality. In: Zhai, G., Zhou, J., An, P., Yang, X. (eds) Digital TV and Multimedia Communication. IFTC 2018. Communications in Computer and Information Science, vol 1009. Springer, Singapore. https://doi.org/10.1007/978-981-13-8138-6_24

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-8138-6_24

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-8137-9

  • Online ISBN: 978-981-13-8138-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics