Impact of Age in ASR for the Elderly: Preliminary Experiments in European Portuguese

Pellegrini, Thomas; Trancoso, Isabel; Hämäläinen, Annika; Calado, António; Dias, Miguel Sales; Braga, Daniela

doi:10.1007/978-3-642-35292-8_15

Impact of Age in ASR for the Elderly: Preliminary Experiments in European Portuguese

Thomas Pellegrini⁷,
Isabel Trancoso^7,8,
Annika Hämäläinen^9,10,
António Calado⁹,
Miguel Sales Dias^9,10 &
…
Daniela Braga^9,10

Conference paper

737 Accesses
13 Citations

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 328))

Abstract

Standard automatic speech recognition (ASR) systems use acoustic models typically trained with speech of young adult speakers. Ageing is known to alter speech production in ways that require ASR systems to be adapted, in particular at the level of acoustic modeling. This paper reports ASR experiments that illustrate the impact of speaker age on speech recognition performance. A large read speech corpus in European Portuguese allowed us to measure statistically significant performance differences among age groups ranging from 60- to 90-year-old speakers. An increase of 41% relative (11.9% absolute) in word error rate was observed between 60-65-year-old and 81-86-year-old speakers. This paper also reports experiments on retraining acoustic models (AMs), further illustrating the impact of ageing on ASR performance. Differentiated gains were observed depending on the age range of the adaptation data use to retrain the acoustic models.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Wilpon, J., Jacobsen, C.: A study of speech recognition for children and the elderly. In: Proc. ICASSP, Atlanta, pp. 349–352 (1996)
Google Scholar
Baba, A., Yoshizawa, S., Yamada, M., Lee, A., Shikano, K.: Acoustic models of the elderly for large-vocabulary continuous speech recognition. Electronics and Communications in Japan 87(7), 49–57 (2004)
Google Scholar
Vipperla, R., Renals, S., Frankel, J.: Longitudinal study of ASR performance on ageing voices. In: Proc. Interspeech, Brisbane, pp. 2550–2553 (2008)
Google Scholar
Baeckman, L., Small, B., Wahlin, A.: Aging and memory: cognitive and biological perspectives. In: Handbook of the Psychology of Aging, pp. 349–377 (2001)
Google Scholar
Fozard, J., Gordon-Salant, S.: Changes in vision and hearing with aging. In: Handbook of the Psychology of Aging, pp. 241–266 (2001)
Google Scholar
Anderson, S., Liberman, N., Bernstein, E., Foster, S., Cate, E., Levin, B., Hudson, R.: Recognition of elderly speech and voice-driven document retrieval. In: Proc. ICASSP, Phoenix, pp. 145–148 (1999)
Google Scholar
Neto, J., Meinedo, H., Viveiros, M., Cassaca, R., Martins, C., Caseiro, D.: Broadcast news subtitling system in portuguese. In: Proc. ICASSP 2008, Las Vegas, USA (2008)
Google Scholar
Meinedo, H.: Audio pre-processing and speech recognition for broadcast news. Ph.D. dissertation, IST, Lisbon, Portugal (2008)
Google Scholar
Meinedo, H., Caseiro, D.A., Neto, J.P., Trancoso, I.: AUDIMUS.MEDIA: A Broadcast News Speech Recognition System for the European Portuguese Language. In: Mamede, N.J., Baptista, J., Trancoso, I., Nunes, M.d.G.V. (eds.) PROPOR 2003. LNCS, vol. 2721, pp. 9–17. Springer, Heidelberg (2003)
Chapter Google Scholar
Meinedo, H., Abad, A., Pellegrini, T., Neto, J., Trancoso, I.: The L2F Broadcast News Speech Recognition System. In: Proc. Fala, Vigo, pp. 93–96 (2010)
Google Scholar
Abad, A., Neto, J.: Incorporating Acoustical Modelling of Phone Transitions in a Hybrid ANN/HMM Speech Recognizer. In: Proceedings of INTERSPEECH, Brisbane, pp. 2394–2397 (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

INESC-ID Lisboa, R. Alves Redol, 9, 1000-029, Lisbon, Portugal
Thomas Pellegrini & Isabel Trancoso
Instituto Superior Técnico, Lisbon, Portugal
Isabel Trancoso
Microsoft Language Development Center, Lisbon, Portugal
Annika Hämäläinen, António Calado, Miguel Sales Dias & Daniela Braga
ADETTI ISCTE, IUL, Lisbon, Portugal
Annika Hämäläinen, Miguel Sales Dias & Daniela Braga

Authors

Thomas Pellegrini
View author publications
You can also search for this author in PubMed Google Scholar
Isabel Trancoso
View author publications
You can also search for this author in PubMed Google Scholar
Annika Hämäläinen
View author publications
You can also search for this author in PubMed Google Scholar
António Calado
View author publications
You can also search for this author in PubMed Google Scholar
Miguel Sales Dias
View author publications
You can also search for this author in PubMed Google Scholar
Daniela Braga
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Escuela Politecnica Superior, Universidad Autonoma de Madrid. C/ Francisco, Tomas y Valiente 11, 28049, Madrid, Spain
Doroteo Torre Toledano
Centro Politécnico Superior, Edificio Ada Byron, C/ María de Luna nº 1, 50018, Zaragoza, Spain
Alfonso Ortega Giménez
Universidade de Aveiro, Campus Universitário Aveiro, 3810-193, Aveiro, Portugal
António Teixeira
Escuela Politecnica Superior, Universidad Autonoma de Madrid, C/ Francisco, Tomas y Valiente 11, 28049, Madrid, Spain
Joaquín González Rodríguez
E.T.S.I.Telecomunicacion, Universidad Politécnica de Madrid, Ciudad Universitaria s/n, 28040, Madrid, Spain
Luis Hernández Gómez & Rubén San Segundo Hernández &
Escuela Politecnica Superior, Universidad Autonoma de Madrid, C/ Francisco, Tomas y Valiente 11, 28049, Madrid, Spain
Daniel Ramos Castro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pellegrini, T., Trancoso, I., Hämäläinen, A., Calado, A., Dias, M.S., Braga, D. (2012). Impact of Age in ASR for the Elderly: Preliminary Experiments in European Portuguese. In: Torre Toledano, D., et al. Advances in Speech and Language Technologies for Iberian Languages. Communications in Computer and Information Science, vol 328. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35292-8_15

Download citation

DOI: https://doi.org/10.1007/978-3-642-35292-8_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35291-1
Online ISBN: 978-3-642-35292-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics