Skip to main content

Estimating the Dispersion of the Biometric Glottal Signature in Continuous Speech

  • Conference paper
Advances in Nonlinear Speech Processing (NOLISP 2007)

Abstract

The biometric voice signature may be derived from voice as a whole, or from the separate vocal tract and glottal source after inverse filtering extraction. This last approach has been used by the authors in early work, where it has been shown that the biometric signature obtained from the glottal source provides a good description of speaker’s characteristics as gender or age. In the present work more accurate estimations of the singularities in the power spectral density of the glottal source are obtained using an adaptive version of the inverse filtering to carefully follow the spectral changes in continuous speech. Therefore the resulting biometric signature gives a better description of intra-speaker variability. Typical male and female samples chosen from a database of 100 normal speakers are used to determine certain gender specific patterns useful in pathology treatment availing. The low intra-speaker variability present in the biometric signature makes it suitable for speaker identification applications as well as for pathology detection and other fields of speech characterization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Gómez, P., Rodellar, V., Álvarez, A., Lázaro, J.C., Murphy, K., Díaz, F., Fernández, R.: Biometrical Speaker Description from Vocal Cord Parameterization. In: Proc. of ICASSP 2006, Toulouse, France, pp. 1036–1039 (2006)

    Google Scholar 

  2. Whiteside, S.P.: Sex-specific fundamental and formant frequency patterns in a cross-sectional study. J. Acoust. Soc. Am. 110(1), 464–478 (2001)

    Article  Google Scholar 

  3. Godino, J.I., Gomez, P.: Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors. IEEE Trans Biomed. Eng. 51, 380–384 (2004)

    Article  Google Scholar 

  4. Gómez, P., Godino, J.I., Díaz, F., Álvarez, A., Martínez, R., Rodellar, V.: Biomechanical Parameter Fingerprint in the Mucosal Wave Power Spectral Density. In: Proc. of the ICSLP 2004, pp. 842–845 (2004)

    Google Scholar 

  5. Gómez, P., Martínez, R., Díaz, F., Lázaro, C., Álvarez, A., Rodellar, V., Nieto, V.: Estimation of vocal cord biomechanical parameters by non-linear inverse filtering of voice. In: Faundez-Zanuy, M., Janer, L., Esposito, A., Satue-Villar, A., Roure, J., Espinosa-Duro, V. (eds.) NOLISP 2005. LNCS (LNAI), vol. 3817, pp. 174–183. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  6. Alku, P.: An Automatic Method to Estimate the Time-Based Parameters of the Glottal Pulseform. In: Proc. of the ICASSP 1992, pp. II/29-32 (1992)

    Google Scholar 

  7. Akande, O.O., Murphy, P.J.: Estimation of the vocal tract transfer function with application to glottal wave analysis. Speech Communication 46(1), 1–13 (2005)

    Google Scholar 

  8. Nickel, R.M.: Automatic Speech Character Identification. IEEE Circuits and Systems Magazine 6(4), 8–29 (2006)

    Article  Google Scholar 

  9. Haykin, S.: Adaptive Filter Theory, 4th edn. Prentice-Hall, Upper Saddle River, NJ (2001)

    Google Scholar 

  10. Fant, G., Liljentcrants, J., Lin, Q.: A four-parameter model of glottal flow. STL-QSPR 4, 1–13 (1985), Reprinted in Speech Acoustics and Phonetics: Selected Writings, G. Fant, pp. 95–108. Kluwer Academic Publishers, Dordrecht (2004)

    Google Scholar 

  11. Hirano, M., Hibi, S., Yoshida, T., Hirade, Y., Kasuya, H., Kikuchi, Y.: Acoustic analysis of pathological voice. Some results of clinical application. Acta Otolaryngologica 105(5-6), 432–438 (1988)

    Google Scholar 

  12. Berry, D.A.: Mechanisms of modal and non-modal phonation. J. Phonetics 29, 431–450 (2001)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Mohamed Chetouani Amir Hussain Bruno Gas Maurice Milgram Jean-Luc Zarader

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gómez, P. et al. (2007). Estimating the Dispersion of the Biometric Glottal Signature in Continuous Speech. In: Chetouani, M., Hussain, A., Gas, B., Milgram, M., Zarader, JL. (eds) Advances in Nonlinear Speech Processing. NOLISP 2007. Lecture Notes in Computer Science(), vol 4885. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77347-4_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-77347-4_22

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-77346-7

  • Online ISBN: 978-3-540-77347-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics