Modulation Spectral Features for Intrusive Measurement of Reverberant Speech Quality

Ma, Sai; Zhang, Hui; Xie, Lingyun; Xie, Xi

doi:10.1007/978-981-13-8138-6_24

Sai Ma¹¹,
Hui Zhang¹¹,
Lingyun Xie¹¹ &
…
Xi Xie¹¹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1009))

Included in the following conference series:

International Forum on Digital TV and Wireless Multimedia Communications

864 Accesses

Abstract

Temporal fine structure includes temporal envelope and fine structure which also called carrier, and instantaneous frequency is the partial derivative of the carrier. An intrusive reverberant speech quality measurement is investigated with the representation of corresponding instantaneous frequency. A Gammatone filterbank is used to simulate auditory mechanism and a modulation filterbank is used to improve frequency resolution. The mean mutual information between reference and reverberant modulation spectral instantaneous frequency probability distribution is taken as the final measurement score. Experimental results show the proposed method outperforming two benchmark algorithms in some practical application conditions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Kokkinakis, K., Loizou, P.C.: The impact of reverberant self-masking and overlap-masking effects on speech intelligibility by cochlear implant listeners (L). J. Acoust. Soc. Am. 130(3), 1099–1102 (2011)
Article Google Scholar
Kinoshita, K., Delcroix, M., Yoshioka, T., et al.: The REVERB challenge: a common evaluation framework for dereverberation and recognition of reverberant speech. In: IEEE WASPAA, pp. 1–4 (2013)
Google Scholar
Hazrati, O., Loizou, P.C.: The combined effects of reverberation and noise on speech intelligibility by cochlear implant listeners. Int. J. Audiol. 51(6), 437–443 (2012)
Article Google Scholar
ITU-T P. 862 Perceptual Evaluation of Speech Quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs (2000)
Google Scholar
Falk, T.H., Chan, W.-Y.: A non-intrusive quality measure of dereverberated speech. In: Proceedings of International Workshop on Acoustic Echo and Noise Control (2008)
Google Scholar
Falk, T.H., Chan, W.-Y.: Temporal dynamic for blind measurement of room acoustical parameters. IEEE Trans. Instrum. Meas. 59(4), 978–989 (2010)
Article Google Scholar
Kokkinakis, K., Loizou, P.C.: Evaluation of objective measures for quality assessment of reverberant speech. In: IEEE ICASSP, pp. 2420–2423 (2011)
Google Scholar
Ma, S., Xie, X.: Blind estimation of spectral standard deviation from room impulse response for reverberation level recognition based on linear prediction. Commun. Comput. Inform. Sci. 685, 231–241 (2017)
Article Google Scholar
Ma, S., Li, H., Zhang, H., et al.: Reverberation level recognition by formants based on 10-fold cross validation of GMM. Commun. Comput. Inform. Sci. 815, 161–171 (2018)
Article Google Scholar
Moon, I.J., Hong, S.H.: What is temporal fine structure and why is it important? Korean J. Audiol. 18(1), 1–7 (2014)
Article Google Scholar
Vijayan, K., Reddy, P.R., Murty, K.S.R.: Significance of analytic phase of speech signals in speaker verification. Speech Commun. 81, 54–71 (2016)
Article Google Scholar
Tu, A.: Reverberation simulation from impulse response using the image source model (2014)
Google Scholar
Habets, E.A.P., Gannot, S., Cohen, I.: Late reverberation spectral variance estimation based on a statistical model. IEEE Signal Process. Lett. 16(9), 770–773 (2009)
Article Google Scholar
Kuttruff, H.: Room Acoustics, 4th edn. Elsevier, London (2000)
Google Scholar
Schroeder, M.: New method of measuring reverberation time. J. Acoust. Soc. Am. 37(3), 409–412 (1965)
Article Google Scholar
Del Vallado, J.M.F., De Lima, A.A., Prego, T.D.M., et al.: Feature analysis for the reverberation perception in speech signals. In: IEEE ICASSP, 8169–8173 (2013)
Google Scholar
ISO 3382–1 Acoustics C Measurement of Room Acoustic Parameters C Part 1: Performances Spaces (2009)
Google Scholar
Allen, J.B., Berkley, D.A.: Image method for efficiently simulating small-room acoustics. J. Acoust. Soc. Am. 65(4), 943–950 (1979)
Article Google Scholar
Patterson, R.D., Robinson, K., Holdsworth, J., et al.: Complex sounds and auditory images. In: Auditory Physiology & Perception, pp. 429–446 (1992)
Google Scholar
Slaney, M.: An efficient implementation of the Patterson-Holdsworth auditory filter bank. Apple Computer Technical report #35, Perception Group C Advanced Technology Group (1993)
Google Scholar
Schimmel, S.M., Atlas, L.E., Nie, K.: Feasibility of single channel speaker separation based on modulation frequency analysis. In: IEEE ICASSP, vol. 4, pp. 605–608 (2007)
Google Scholar

Download references

Acknowledgments

This research is supported by the Fundamental Research Funds for the Central Universities (2018XNG1810). The authors thank Prof. W.-Y. Chan for his guidance for this research.

Author information

Authors and Affiliations

Communication University of China, Beijing, China
Sai Ma, Hui Zhang, Lingyun Xie & Xi Xie

Authors

Sai Ma
View author publications
You can also search for this author in PubMed Google Scholar
Hui Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Lingyun Xie
View author publications
You can also search for this author in PubMed Google Scholar
Xi Xie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sai Ma .

Editor information

Editors and Affiliations

Shanghai Jiao Tong University, Shanghai, China
Guangtao Zhai
Shanghai Jiao Tong University, Shanghai, China
Jun Zhou
Shanghai University, Shanghai, China
Ping An
Shanghai Jiao Tong University, Shanghai, China
Xiaokang Yang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ma, S., Zhang, H., Xie, L., Xie, X. (2019). Modulation Spectral Features for Intrusive Measurement of Reverberant Speech Quality. In: Zhai, G., Zhou, J., An, P., Yang, X. (eds) Digital TV and Multimedia Communication. IFTC 2018. Communications in Computer and Information Science, vol 1009. Springer, Singapore. https://doi.org/10.1007/978-981-13-8138-6_24

Download citation

DOI: https://doi.org/10.1007/978-981-13-8138-6_24
Published: 11 May 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-8137-9
Online ISBN: 978-981-13-8138-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics