Estimating the Dispersion of the Biometric Glottal Signature in Continuous Speech

Gómez, Pedro; Álvarez, Agustín; Mazaira, Luis Miguel; Fernández, Roberto; Rodellar, Victoria; Martínez, Rafael; Muñoz, Cristina

doi:10.1007/978-3-540-77347-4_22

Pedro Gómez¹,
Agustín Álvarez¹,
Luis Miguel Mazaira¹,
Roberto Fernández¹,
Victoria Rodellar¹,
Rafael Martínez¹ &
…
Cristina Muñoz¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4885))

Included in the following conference series:

International Conference on Nonlinear Speech Processing

578 Accesses

Abstract

The biometric voice signature may be derived from voice as a whole, or from the separate vocal tract and glottal source after inverse filtering extraction. This last approach has been used by the authors in early work, where it has been shown that the biometric signature obtained from the glottal source provides a good description of speaker’s characteristics as gender or age. In the present work more accurate estimations of the singularities in the power spectral density of the glottal source are obtained using an adaptive version of the inverse filtering to carefully follow the spectral changes in continuous speech. Therefore the resulting biometric signature gives a better description of intra-speaker variability. Typical male and female samples chosen from a database of 100 normal speakers are used to determine certain gender specific patterns useful in pathology treatment availing. The low intra-speaker variability present in the biometric signature makes it suitable for speaker identification applications as well as for pathology detection and other fields of speech characterization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Gómez, P., Rodellar, V., Álvarez, A., Lázaro, J.C., Murphy, K., Díaz, F., Fernández, R.: Biometrical Speaker Description from Vocal Cord Parameterization. In: Proc. of ICASSP 2006, Toulouse, France, pp. 1036–1039 (2006)
Google Scholar
Whiteside, S.P.: Sex-specific fundamental and formant frequency patterns in a cross-sectional study. J. Acoust. Soc. Am. 110(1), 464–478 (2001)
Article Google Scholar
Godino, J.I., Gomez, P.: Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors. IEEE Trans Biomed. Eng. 51, 380–384 (2004)
Article Google Scholar
Gómez, P., Godino, J.I., Díaz, F., Álvarez, A., Martínez, R., Rodellar, V.: Biomechanical Parameter Fingerprint in the Mucosal Wave Power Spectral Density. In: Proc. of the ICSLP 2004, pp. 842–845 (2004)
Google Scholar
Gómez, P., Martínez, R., Díaz, F., Lázaro, C., Álvarez, A., Rodellar, V., Nieto, V.: Estimation of vocal cord biomechanical parameters by non-linear inverse filtering of voice. In: Faundez-Zanuy, M., Janer, L., Esposito, A., Satue-Villar, A., Roure, J., Espinosa-Duro, V. (eds.) NOLISP 2005. LNCS (LNAI), vol. 3817, pp. 174–183. Springer, Heidelberg (2006)
Chapter Google Scholar
Alku, P.: An Automatic Method to Estimate the Time-Based Parameters of the Glottal Pulseform. In: Proc. of the ICASSP 1992, pp. II/29-32 (1992)
Google Scholar
Akande, O.O., Murphy, P.J.: Estimation of the vocal tract transfer function with application to glottal wave analysis. Speech Communication 46(1), 1–13 (2005)
Google Scholar
Nickel, R.M.: Automatic Speech Character Identification. IEEE Circuits and Systems Magazine 6(4), 8–29 (2006)
Article Google Scholar
Haykin, S.: Adaptive Filter Theory, 4th edn. Prentice-Hall, Upper Saddle River, NJ (2001)
Google Scholar
Fant, G., Liljentcrants, J., Lin, Q.: A four-parameter model of glottal flow. STL-QSPR 4, 1–13 (1985), Reprinted in Speech Acoustics and Phonetics: Selected Writings, G. Fant, pp. 95–108. Kluwer Academic Publishers, Dordrecht (2004)
Google Scholar
Hirano, M., Hibi, S., Yoshida, T., Hirade, Y., Kasuya, H., Kikuchi, Y.: Acoustic analysis of pathological voice. Some results of clinical application. Acta Otolaryngologica 105(5-6), 432–438 (1988)
Google Scholar
Berry, D.A.: Mechanisms of modal and non-modal phonation. J. Phonetics 29, 431–450 (2001)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Facultad de Informática, Campus de Montegancedo, s/n, E-28660, Boadilla del Monte, Madrid, Spain
Pedro Gómez, Agustín Álvarez, Luis Miguel Mazaira, Roberto Fernández, Victoria Rodellar, Rafael Martínez & Cristina Muñoz

Authors

Pedro Gómez
View author publications
You can also search for this author in PubMed Google Scholar
Agustín Álvarez
View author publications
You can also search for this author in PubMed Google Scholar
Luis Miguel Mazaira
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Fernández
View author publications
You can also search for this author in PubMed Google Scholar
Victoria Rodellar
View author publications
You can also search for this author in PubMed Google Scholar
Rafael Martínez
View author publications
You can also search for this author in PubMed Google Scholar
Cristina Muñoz
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Mohamed Chetouani Amir Hussain Bruno Gas Maurice Milgram Jean-Luc Zarader

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gómez, P. et al. (2007). Estimating the Dispersion of the Biometric Glottal Signature in Continuous Speech. In: Chetouani, M., Hussain, A., Gas, B., Milgram, M., Zarader, JL. (eds) Advances in Nonlinear Speech Processing. NOLISP 2007. Lecture Notes in Computer Science(), vol 4885. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77347-4_22

Download citation

DOI: https://doi.org/10.1007/978-3-540-77347-4_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77346-7
Online ISBN: 978-3-540-77347-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics