Abstract
This paper is about the detection of the glottal closure instants (GCI) by the multi-scale product (MP) of the derivative glottal waveform signal. Based on the source filter model, the derivative glottal waveform signal is estimated by the inverse filtering of the non pre-emphasized speech signal with the LP coefficients. The derivative glottal waveform signal represents the real excitation of the vocal tract and shows discontinuities at GCI. MP acts as a discontinuity detector. A preprocessing step is added to improve the GCI detection. The performance of our method is evaluated on the Keele university database and compared to the MP applied directly on the speech signal. Using the preprocessing phase, the MP applied on the derivative glottal waveform signal gives an identification rate of 99.21 % and an accuracy to \(\pm \)0.25 ms of 87.32 % versus an identification rate of 99.15 % and an accuracy to \(\pm \)0.25 ms of 75.78 % for the MP method applied directly on speech signal.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bozkurt, B., Dutoit, T.: Mixed-phase speech modeling and formant estimation, using differential phase spectrums. In: ISCA ITRW VOQUAL03, pp. 21–24 (2003)
Moulines, E., Charpentier, F.: Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Commun. 9(5–6), 453–467 (1990)
Gaubitch, N.D., Naylor, P.A.: Spatiotemporal averaging method for enhancement of reverberant speech. In: Proceedings of IEEE International Conference on Digital Signal Processing (DSP), Cardiff, UK (2007)
Wong, D.Y., Markel, J.D., Gray, J.A.H.: Least squares glottal inverse filtering from the acoustic speech waveform. IEEE Trans. Acoust. Speech Signal Process. 27(4), 350–355 (1979)
Rao, K.S., Prasanna, S.R.M., Yegnanarayana, B.: Determination of instants of significant excitation in speech using Hilbert envelope and group delay function. IEEE Signal Process. Lett. 14(10), 762–765 (2007)
Murty, K.S.R., Yegnanarayana, B.: Epoch extraction from speech signals. IEEE Trans. Audio Speech Lang. Process. 16(8), 1602–1613 (2008)
Naylor, P., Kounoudes, A., Gudnason, J., Brookes, M.: Estimation of glottal closure instants in voiced speech using the DYPSA algorithm. IEEE Trans. Audio Speech Lang. Process. 15(1), 34–43 (2007)
Drugman, T., Dutoit, T.: Glottal closure and opening instant detection from speech signals. In: Proceedings of Interspeech Conference (2009)
Thomas, M.R.P., Gudnason, J., Naylor, P.A.: Estimation of glottal closing and opening instants in voiced speech using the YAGA algorithm, Feb (2012)
Bouzid, A., Ellouze, N.: Produit multiéchelle pour la détection des instants d’ouverture et de fermeture de la glotte sur le signal de parole. JEP (2006)
Tuan, V.N., d’Allessandro, C.: Robust glottal closure detection using the wavelet transform. In: Proceedings of the European Conference on Speech Technology, pp. 2805–2808 (1999)
Markel, J.D., Gray Jr., A.H.: Linear Prediction of Speech. Springer (1976)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Smidi, G., Bouzid, A., Ellouze, N. (2016). Glottal Closure Instant Detection by the Multi-scale Product of the Derivative Glottal Waveform Signal. In: Esposito, A., et al. Recent Advances in Nonlinear Speech Processing. Smart Innovation, Systems and Technologies, vol 48. Springer, Cham. https://doi.org/10.1007/978-3-319-28109-4_19
Download citation
DOI: https://doi.org/10.1007/978-3-319-28109-4_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28107-0
Online ISBN: 978-3-319-28109-4
eBook Packages: EngineeringEngineering (R0)