Skip to main content

Classification of Voice Aging Using Parameters Extracted from the Glottal Signal

  • Conference paper
Artificial Neural Networks – ICANN 2010 (ICANN 2010)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6354))

Included in the following conference series:

Abstract

Classification of voice aging has many applications in health and geriatrics. This work focuses on finding the most significant parameters to identify voice aging. This work proposes to choose the most significant parameters extracted of the glottal signal to identify the voice aging process of men and women using the wrapper approach combining a genetic algorithm (as a search algorithm) with a neural network (as an induction algorithm). The chosen parameters will be used as entries in a neural network to classify male and female Brazilian speakers in three different age groups, which will be called young (from 15 to 30 years old), adult (from 31 to 60 years old) and senior (from 61 to 90 years old). The voice database used for this work was composed by one hundred twenty Brazilian people (male and female) of different ages. In this work we use the largest basis for classification of age compared with other similar works, and its rate of classification is superior to other studies reaching 91.6% in males and 83.33% in women.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Rosa, I.S.: Analise acústica da voz de indivíduos na terceira idade. Tese de mestrado Universidade de São Carlos (2005)

    Google Scholar 

  2. Verdonck, I., Mahieu, H.: Vocal aging and the impact on daily life: a longitudinal study (2003), doi:10.1016 jornal voice

    Google Scholar 

  3. Sadeghi, N.A., Homayounpour, M.M.: Speaker age interval and sex identification based on jitters, shimmers and mean MFCC using supervised and unsupervised discriminative classification methods. In: Proc. ICSP, Guilin, China (2006)

    Google Scholar 

  4. Sedaaghi, M.H.: A Comparative Study of Gender and Age Classification in Speech Signals. Iranian Journal of Electrical & Electronic Engineering 5(1) (March 2009)

    Google Scholar 

  5. Campbell Jr., J.P.: Speaker Recognition: A tutorial. Proceedings of the IEEE 85(9), 1437–1462 (1997)

    Article  Google Scholar 

  6. Alku, P.: Glottal wave analysis with Pitch Synchronous Adaptive Inverse Filtering”. Speech Communication 11, 109–118 (1992)

    Article  Google Scholar 

  7. Pulakka, H.: Analysis of Human Voice Production Using Inverse Filtering, High-Speed Imaging, and Electroglottography. University of Technology Helsinki (2005)

    Google Scholar 

  8. Juliano, S. M.: Um estudo comparativo entre o sinal electroglotográfico e o sinal de voz”, Dissertação de mestrado em Engenharia de Telecomunicações, UFF (2008)

    Google Scholar 

  9. Software Aparat Helsinki University of Technology Laboratory of Acoustics and Audio Signal Processing, http://aparat.sourceforge.net/index.php/Main_Page

  10. Alku, P., Vilkman, E.: Amplitude Domain Quotient for Characterization of the Glottal Volume Velocity Waveform Estimated by Inverse Filtering. Speech Communication 18(2), 131–138 (1996)

    Article  Google Scholar 

  11. Alku, P., Bäckström, T., Vilkman, E.: Normalized Amplitude Quotient for Parameterization of the Glottal Flow. Journal of the Acoustical Society of America 112(2), 701–710 (2002)

    Article  Google Scholar 

  12. Gobl, C., Chasaide, A.: Amplitude-based source parameters for measuring voice quality. In: VOQUAL, pp. 151–156 (2003)

    Google Scholar 

  13. Laukkanen, A.-M., Vilkman, E., Alku, P.: Related to Stress and Emotional State: a Preliminary Study. Journal of Phonetics 24(3), 313–335 (1996)

    Article  Google Scholar 

  14. Titze, I., Sundberg, J.: Vocal intensity in speakers and singers. Journal of the Acoustical Society of America, 2936–2946 (May 1992)

    Google Scholar 

  15. Airas, M.: Methods and studies of laringeal voice quality analysis in speech production. Dissertation for the degree of Doctor Helsinki University of Technology (2008)

    Google Scholar 

  16. Childers, D.G., Lee. C. K.: Vocal quality factors: Analysis, synthesis, and perception. Journal of the Acoustical Society of America, 2394–2410 (May 1990)

    Google Scholar 

  17. Cataldo, E., Rodrigues, F., Brandão, A., Lucero, J.: Usando Redes Neurais para Classificação de padrões de voz. In: XXVIII CNMAC - Congresso Nacional de Matemática Aplicada e Computacional, 2005, Anais do XXVIII CNMAC, São Paulo (2005)

    Google Scholar 

  18. Vieira, M.N.: Automated Measures of Dysphonias and the Phonatory Effects of Asymmetries in the Posterior Larynx, Ph.D Thesis, University of Edinburgh, UK (1997)

    Google Scholar 

  19. Pappa, G.L.: Seleção de atributos utilizando algoritmos genéticos múltiplos objetivos. Tese de mestrado PUC Paraná (2005)

    Google Scholar 

  20. Behlau, M., Pontes, P.P.: Avaliação e tratamento das disfonias. Lovise, São Paulo (1995)

    Google Scholar 

  21. Software praat University of Amsterdam, http://www.fon.hum.uva.nl/praat/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Forero Mendoza, L.A., Cataldo, E., Vellasco, M., Silva, M. (2010). Classification of Voice Aging Using Parameters Extracted from the Glottal Signal. In: Diamantaras, K., Duch, W., Iliadis, L.S. (eds) Artificial Neural Networks – ICANN 2010. ICANN 2010. Lecture Notes in Computer Science, vol 6354. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15825-4_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15825-4_20

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15824-7

  • Online ISBN: 978-3-642-15825-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics