Abstract
Temporal fine structure includes temporal envelope and fine structure which also called carrier, and instantaneous frequency is the partial derivative of the carrier. An intrusive reverberant speech quality measurement is investigated with the representation of corresponding instantaneous frequency. A Gammatone filterbank is used to simulate auditory mechanism and a modulation filterbank is used to improve frequency resolution. The mean mutual information between reference and reverberant modulation spectral instantaneous frequency probability distribution is taken as the final measurement score. Experimental results show the proposed method outperforming two benchmark algorithms in some practical application conditions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kokkinakis, K., Loizou, P.C.: The impact of reverberant self-masking and overlap-masking effects on speech intelligibility by cochlear implant listeners (L). J. Acoust. Soc. Am. 130(3), 1099–1102 (2011)
Kinoshita, K., Delcroix, M., Yoshioka, T., et al.: The REVERB challenge: a common evaluation framework for dereverberation and recognition of reverberant speech. In: IEEE WASPAA, pp. 1–4 (2013)
Hazrati, O., Loizou, P.C.: The combined effects of reverberation and noise on speech intelligibility by cochlear implant listeners. Int. J. Audiol. 51(6), 437–443 (2012)
ITU-T P. 862 Perceptual Evaluation of Speech Quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs (2000)
Falk, T.H., Chan, W.-Y.: A non-intrusive quality measure of dereverberated speech. In: Proceedings of International Workshop on Acoustic Echo and Noise Control (2008)
Falk, T.H., Chan, W.-Y.: Temporal dynamic for blind measurement of room acoustical parameters. IEEE Trans. Instrum. Meas. 59(4), 978–989 (2010)
Kokkinakis, K., Loizou, P.C.: Evaluation of objective measures for quality assessment of reverberant speech. In: IEEE ICASSP, pp. 2420–2423 (2011)
Ma, S., Xie, X.: Blind estimation of spectral standard deviation from room impulse response for reverberation level recognition based on linear prediction. Commun. Comput. Inform. Sci. 685, 231–241 (2017)
Ma, S., Li, H., Zhang, H., et al.: Reverberation level recognition by formants based on 10-fold cross validation of GMM. Commun. Comput. Inform. Sci. 815, 161–171 (2018)
Moon, I.J., Hong, S.H.: What is temporal fine structure and why is it important? Korean J. Audiol. 18(1), 1–7 (2014)
Vijayan, K., Reddy, P.R., Murty, K.S.R.: Significance of analytic phase of speech signals in speaker verification. Speech Commun. 81, 54–71 (2016)
Tu, A.: Reverberation simulation from impulse response using the image source model (2014)
Habets, E.A.P., Gannot, S., Cohen, I.: Late reverberation spectral variance estimation based on a statistical model. IEEE Signal Process. Lett. 16(9), 770–773 (2009)
Kuttruff, H.: Room Acoustics, 4th edn. Elsevier, London (2000)
Schroeder, M.: New method of measuring reverberation time. J. Acoust. Soc. Am. 37(3), 409–412 (1965)
Del Vallado, J.M.F., De Lima, A.A., Prego, T.D.M., et al.: Feature analysis for the reverberation perception in speech signals. In: IEEE ICASSP, 8169–8173 (2013)
ISO 3382–1 Acoustics C Measurement of Room Acoustic Parameters C Part 1: Performances Spaces (2009)
Allen, J.B., Berkley, D.A.: Image method for efficiently simulating small-room acoustics. J. Acoust. Soc. Am. 65(4), 943–950 (1979)
Patterson, R.D., Robinson, K., Holdsworth, J., et al.: Complex sounds and auditory images. In: Auditory Physiology & Perception, pp. 429–446 (1992)
Slaney, M.: An efficient implementation of the Patterson-Holdsworth auditory filter bank. Apple Computer Technical report #35, Perception Group C Advanced Technology Group (1993)
Schimmel, S.M., Atlas, L.E., Nie, K.: Feasibility of single channel speaker separation based on modulation frequency analysis. In: IEEE ICASSP, vol. 4, pp. 605–608 (2007)
Acknowledgments
This research is supported by the Fundamental Research Funds for the Central Universities (2018XNG1810). The authors thank Prof. W.-Y. Chan for his guidance for this research.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Ma, S., Zhang, H., Xie, L., Xie, X. (2019). Modulation Spectral Features for Intrusive Measurement of Reverberant Speech Quality. In: Zhai, G., Zhou, J., An, P., Yang, X. (eds) Digital TV and Multimedia Communication. IFTC 2018. Communications in Computer and Information Science, vol 1009. Springer, Singapore. https://doi.org/10.1007/978-981-13-8138-6_24
Download citation
DOI: https://doi.org/10.1007/978-981-13-8138-6_24
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-8137-9
Online ISBN: 978-981-13-8138-6
eBook Packages: Computer ScienceComputer Science (R0)