Skip to main content

Glottal Closure Instant Detection by the Multi-scale Product of the Derivative Glottal Waveform Signal

  • Chapter
  • First Online:
Recent Advances in Nonlinear Speech Processing

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 48))

  • 813 Accesses

Abstract

This paper is about the detection of the glottal closure instants (GCI) by the multi-scale product (MP) of the derivative glottal waveform signal. Based on the source filter model, the derivative glottal waveform signal is estimated by the inverse filtering of the non pre-emphasized speech signal with the LP coefficients. The derivative glottal waveform signal represents the real excitation of the vocal tract and shows discontinuities at GCI. MP acts as a discontinuity detector. A preprocessing step is added to improve the GCI detection. The performance of our method is evaluated on the Keele university database and compared to the MP applied directly on the speech signal. Using the preprocessing phase, the MP applied on the derivative glottal waveform signal gives an identification rate of 99.21 % and an accuracy to \(\pm \)0.25 ms of 87.32 % versus an identification rate of 99.15 % and an accuracy to \(\pm \)0.25 ms of 75.78 % for the MP method applied directly on speech signal.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bozkurt, B., Dutoit, T.: Mixed-phase speech modeling and formant estimation, using differential phase spectrums. In: ISCA ITRW VOQUAL03, pp. 21–24 (2003)

    Google Scholar 

  2. Moulines, E., Charpentier, F.: Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Commun. 9(5–6), 453–467 (1990)

    Google Scholar 

  3. Gaubitch, N.D., Naylor, P.A.: Spatiotemporal averaging method for enhancement of reverberant speech. In: Proceedings of IEEE International Conference on Digital Signal Processing (DSP), Cardiff, UK (2007)

    Google Scholar 

  4. Wong, D.Y., Markel, J.D., Gray, J.A.H.: Least squares glottal inverse filtering from the acoustic speech waveform. IEEE Trans. Acoust. Speech Signal Process. 27(4), 350–355 (1979)

    Google Scholar 

  5. Rao, K.S., Prasanna, S.R.M., Yegnanarayana, B.: Determination of instants of significant excitation in speech using Hilbert envelope and group delay function. IEEE Signal Process. Lett. 14(10), 762–765 (2007)

    Article  Google Scholar 

  6. Murty, K.S.R., Yegnanarayana, B.: Epoch extraction from speech signals. IEEE Trans. Audio Speech Lang. Process. 16(8), 1602–1613 (2008)

    Google Scholar 

  7. Naylor, P., Kounoudes, A., Gudnason, J., Brookes, M.: Estimation of glottal closure instants in voiced speech using the DYPSA algorithm. IEEE Trans. Audio Speech Lang. Process. 15(1), 34–43 (2007)

    Google Scholar 

  8. Drugman, T., Dutoit, T.: Glottal closure and opening instant detection from speech signals. In: Proceedings of Interspeech Conference (2009)

    Google Scholar 

  9. Thomas, M.R.P., Gudnason, J., Naylor, P.A.: Estimation of glottal closing and opening instants in voiced speech using the YAGA algorithm, Feb (2012)

    Google Scholar 

  10. Bouzid, A., Ellouze, N.: Produit multiéchelle pour la détection des instants d’ouverture et de fermeture de la glotte sur le signal de parole. JEP (2006)

    Google Scholar 

  11. Tuan, V.N., d’Allessandro, C.: Robust glottal closure detection using the wavelet transform. In: Proceedings of the European Conference on Speech Technology, pp. 2805–2808 (1999)

    Google Scholar 

  12. Markel, J.D., Gray Jr., A.H.: Linear Prediction of Speech. Springer (1976)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ghaya Smidi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Smidi, G., Bouzid, A., Ellouze, N. (2016). Glottal Closure Instant Detection by the Multi-scale Product of the Derivative Glottal Waveform Signal. In: Esposito, A., et al. Recent Advances in Nonlinear Speech Processing. Smart Innovation, Systems and Technologies, vol 48. Springer, Cham. https://doi.org/10.1007/978-3-319-28109-4_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-28109-4_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-28107-0

  • Online ISBN: 978-3-319-28109-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics