Classification of Voice Aging Using Parameters Extracted from the Glottal Signal

Forero Mendoza, Leonardo Alfredo; Cataldo, Edson; Vellasco, Marley; Silva, Marco

doi:10.1007/978-3-642-15825-4_20

Leonardo Alfredo Forero Mendoza¹⁹,
Edson Cataldo²⁰,
Marley Vellasco¹⁹ &
…
Marco Silva¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6354))

Included in the following conference series:

International Conference on Artificial Neural Networks

3301 Accesses
2 Citations

Abstract

Classification of voice aging has many applications in health and geriatrics. This work focuses on finding the most significant parameters to identify voice aging. This work proposes to choose the most significant parameters extracted of the glottal signal to identify the voice aging process of men and women using the wrapper approach combining a genetic algorithm (as a search algorithm) with a neural network (as an induction algorithm). The chosen parameters will be used as entries in a neural network to classify male and female Brazilian speakers in three different age groups, which will be called young (from 15 to 30 years old), adult (from 31 to 60 years old) and senior (from 61 to 90 years old). The voice database used for this work was composed by one hundred twenty Brazilian people (male and female) of different ages. In this work we use the largest basis for classification of age compared with other similar works, and its rate of classification is superior to other studies reaching 91.6% in males and 83.33% in women.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Rosa, I.S.: Analise acústica da voz de indivíduos na terceira idade. Tese de mestrado Universidade de São Carlos (2005)
Google Scholar
Verdonck, I., Mahieu, H.: Vocal aging and the impact on daily life: a longitudinal study (2003), doi:10.1016 jornal voice
Google Scholar
Sadeghi, N.A., Homayounpour, M.M.: Speaker age interval and sex identification based on jitters, shimmers and mean MFCC using supervised and unsupervised discriminative classification methods. In: Proc. ICSP, Guilin, China (2006)
Google Scholar
Sedaaghi, M.H.: A Comparative Study of Gender and Age Classification in Speech Signals. Iranian Journal of Electrical & Electronic Engineering 5(1) (March 2009)
Google Scholar
Campbell Jr., J.P.: Speaker Recognition: A tutorial. Proceedings of the IEEE 85(9), 1437–1462 (1997)
Article Google Scholar
Alku, P.: Glottal wave analysis with Pitch Synchronous Adaptive Inverse Filtering”. Speech Communication 11, 109–118 (1992)
Article Google Scholar
Pulakka, H.: Analysis of Human Voice Production Using Inverse Filtering, High-Speed Imaging, and Electroglottography. University of Technology Helsinki (2005)
Google Scholar
Juliano, S. M.: Um estudo comparativo entre o sinal electroglotográfico e o sinal de voz”, Dissertação de mestrado em Engenharia de Telecomunicações, UFF (2008)
Google Scholar
Software Aparat Helsinki University of Technology Laboratory of Acoustics and Audio Signal Processing, http://aparat.sourceforge.net/index.php/Main_Page
Alku, P., Vilkman, E.: Amplitude Domain Quotient for Characterization of the Glottal Volume Velocity Waveform Estimated by Inverse Filtering. Speech Communication 18(2), 131–138 (1996)
Article Google Scholar
Alku, P., Bäckström, T., Vilkman, E.: Normalized Amplitude Quotient for Parameterization of the Glottal Flow. Journal of the Acoustical Society of America 112(2), 701–710 (2002)
Article Google Scholar
Gobl, C., Chasaide, A.: Amplitude-based source parameters for measuring voice quality. In: VOQUAL, pp. 151–156 (2003)
Google Scholar
Laukkanen, A.-M., Vilkman, E., Alku, P.: Related to Stress and Emotional State: a Preliminary Study. Journal of Phonetics 24(3), 313–335 (1996)
Article Google Scholar
Titze, I., Sundberg, J.: Vocal intensity in speakers and singers. Journal of the Acoustical Society of America, 2936–2946 (May 1992)
Google Scholar
Airas, M.: Methods and studies of laringeal voice quality analysis in speech production. Dissertation for the degree of Doctor Helsinki University of Technology (2008)
Google Scholar
Childers, D.G., Lee. C. K.: Vocal quality factors: Analysis, synthesis, and perception. Journal of the Acoustical Society of America, 2394–2410 (May 1990)
Google Scholar
Cataldo, E., Rodrigues, F., Brandão, A., Lucero, J.: Usando Redes Neurais para Classificação de padrões de voz. In: XXVIII CNMAC - Congresso Nacional de Matemática Aplicada e Computacional, 2005, Anais do XXVIII CNMAC, São Paulo (2005)
Google Scholar
Vieira, M.N.: Automated Measures of Dysphonias and the Phonatory Effects of Asymmetries in the Posterior Larynx, Ph.D Thesis, University of Edinburgh, UK (1997)
Google Scholar
Pappa, G.L.: Seleção de atributos utilizando algoritmos genéticos múltiplos objetivos. Tese de mestrado PUC Paraná (2005)
Google Scholar
Behlau, M., Pontes, P.P.: Avaliação e tratamento das disfonias. Lovise, São Paulo (1995)
Google Scholar
Software praat University of Amsterdam, http://www.fon.hum.uva.nl/praat/

Download references

Author information

Authors and Affiliations

Eletrical Engineering Department, Pontifical Catholic University of Rio de Janeiro, CEP 22.451-900 Rua Marquês de São Vicente, 225 Gávea, Rio de Janeiro, Brasil
Leonardo Alfredo Forero Mendoza, Marley Vellasco & Marco Silva
Applied Mathematics Department –Telecommunications Engineering, Universidade Federal Fluminense, CEP 24210-240 Escola de Engenharia - Bloco D - Sala 502-B R. Passo da Pátria, 156, São Domingos Niterói, RJ, Brasil
Edson Cataldo

Authors

Leonardo Alfredo Forero Mendoza
View author publications
You can also search for this author in PubMed Google Scholar
Edson Cataldo
View author publications
You can also search for this author in PubMed Google Scholar
Marley Vellasco
View author publications
You can also search for this author in PubMed Google Scholar
Marco Silva
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics, TEI of Thessaloniki, 57400, Sindos, Greece
Konstantinos Diamantaras
Department of Informatics, Nicolaus Copernicus University, School of Physics, Astronomy, and Informatics, ul. Grudziadzka 5, 87-100, Torun, Poland
Wlodek Duch
Department of Forestry and Management of the Environment and Natural Resources, Democritus University of Thrace, Pantazidou 193, 68200, Orestiada Thrace, Greece
Lazaros S. Iliadis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Forero Mendoza, L.A., Cataldo, E., Vellasco, M., Silva, M. (2010). Classification of Voice Aging Using Parameters Extracted from the Glottal Signal. In: Diamantaras, K., Duch, W., Iliadis, L.S. (eds) Artificial Neural Networks – ICANN 2010. ICANN 2010. Lecture Notes in Computer Science, vol 6354. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15825-4_20

Download citation

DOI: https://doi.org/10.1007/978-3-642-15825-4_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15824-7
Online ISBN: 978-3-642-15825-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics