Abstract
Classification of voice aging has many applications in health and geriatrics. This work focuses on finding the most significant parameters to identify voice aging. This work proposes to choose the most significant parameters extracted of the glottal signal to identify the voice aging process of men and women using the wrapper approach combining a genetic algorithm (as a search algorithm) with a neural network (as an induction algorithm). The chosen parameters will be used as entries in a neural network to classify male and female Brazilian speakers in three different age groups, which will be called young (from 15 to 30 years old), adult (from 31 to 60 years old) and senior (from 61 to 90 years old). The voice database used for this work was composed by one hundred twenty Brazilian people (male and female) of different ages. In this work we use the largest basis for classification of age compared with other similar works, and its rate of classification is superior to other studies reaching 91.6% in males and 83.33% in women.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Rosa, I.S.: Analise acústica da voz de indivíduos na terceira idade. Tese de mestrado Universidade de São Carlos (2005)
Verdonck, I., Mahieu, H.: Vocal aging and the impact on daily life: a longitudinal study (2003), doi:10.1016 jornal voice
Sadeghi, N.A., Homayounpour, M.M.: Speaker age interval and sex identification based on jitters, shimmers and mean MFCC using supervised and unsupervised discriminative classification methods. In: Proc. ICSP, Guilin, China (2006)
Sedaaghi, M.H.: A Comparative Study of Gender and Age Classification in Speech Signals. Iranian Journal of Electrical & Electronic Engineering 5(1) (March 2009)
Campbell Jr., J.P.: Speaker Recognition: A tutorial. Proceedings of the IEEE 85(9), 1437–1462 (1997)
Alku, P.: Glottal wave analysis with Pitch Synchronous Adaptive Inverse Filtering”. Speech Communication 11, 109–118 (1992)
Pulakka, H.: Analysis of Human Voice Production Using Inverse Filtering, High-Speed Imaging, and Electroglottography. University of Technology Helsinki (2005)
Juliano, S. M.: Um estudo comparativo entre o sinal electroglotográfico e o sinal de voz”, Dissertação de mestrado em Engenharia de Telecomunicações, UFF (2008)
Software Aparat Helsinki University of Technology Laboratory of Acoustics and Audio Signal Processing, http://aparat.sourceforge.net/index.php/Main_Page
Alku, P., Vilkman, E.: Amplitude Domain Quotient for Characterization of the Glottal Volume Velocity Waveform Estimated by Inverse Filtering. Speech Communication 18(2), 131–138 (1996)
Alku, P., Bäckström, T., Vilkman, E.: Normalized Amplitude Quotient for Parameterization of the Glottal Flow. Journal of the Acoustical Society of America 112(2), 701–710 (2002)
Gobl, C., Chasaide, A.: Amplitude-based source parameters for measuring voice quality. In: VOQUAL, pp. 151–156 (2003)
Laukkanen, A.-M., Vilkman, E., Alku, P.: Related to Stress and Emotional State: a Preliminary Study. Journal of Phonetics 24(3), 313–335 (1996)
Titze, I., Sundberg, J.: Vocal intensity in speakers and singers. Journal of the Acoustical Society of America, 2936–2946 (May 1992)
Airas, M.: Methods and studies of laringeal voice quality analysis in speech production. Dissertation for the degree of Doctor Helsinki University of Technology (2008)
Childers, D.G., Lee. C. K.: Vocal quality factors: Analysis, synthesis, and perception. Journal of the Acoustical Society of America, 2394–2410 (May 1990)
Cataldo, E., Rodrigues, F., Brandão, A., Lucero, J.: Usando Redes Neurais para Classificação de padrões de voz. In: XXVIII CNMAC - Congresso Nacional de Matemática Aplicada e Computacional, 2005, Anais do XXVIII CNMAC, São Paulo (2005)
Vieira, M.N.: Automated Measures of Dysphonias and the Phonatory Effects of Asymmetries in the Posterior Larynx, Ph.D Thesis, University of Edinburgh, UK (1997)
Pappa, G.L.: Seleção de atributos utilizando algoritmos genéticos múltiplos objetivos. Tese de mestrado PUC Paraná (2005)
Behlau, M., Pontes, P.P.: Avaliação e tratamento das disfonias. Lovise, São Paulo (1995)
Software praat University of Amsterdam, http://www.fon.hum.uva.nl/praat/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Forero Mendoza, L.A., Cataldo, E., Vellasco, M., Silva, M. (2010). Classification of Voice Aging Using Parameters Extracted from the Glottal Signal. In: Diamantaras, K., Duch, W., Iliadis, L.S. (eds) Artificial Neural Networks – ICANN 2010. ICANN 2010. Lecture Notes in Computer Science, vol 6354. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15825-4_20
Download citation
DOI: https://doi.org/10.1007/978-3-642-15825-4_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15824-7
Online ISBN: 978-3-642-15825-4
eBook Packages: Computer ScienceComputer Science (R0)