Towards a Deep Learning Based ASR System for Users with Dysarthria

Mulfari, Davide; Meoni, Gabriele; Marini, Marco; Fanucci, Luca

doi:10.1007/978-3-319-94277-3_86

Davide Mulfari²¹,
Gabriele Meoni²¹,
Marco Marini²¹ &
…
Luca Fanucci²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10896))

Included in the following conference series:

International Conference on Computers Helping People with Special Needs

3619 Accesses
3 Citations

Abstract

In this paper, we investigate the benefits of deep learning approaches for the development of personalized assistive technology solutions for users with dysarthria, a speech disorder that leads to low intelligibility of users’ speaking. It prevents these people from using automatic speech recognition (ASR) solutions on computers and mobile devices. In order to address these issue, our effort is to leverage convolutional neural networks toward a speaker dependent ASR software solution intended for users with dysarthria, which can be trained according to particular user’s needs and preferences.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Joy, N.M., Umesh, S.: Improving acoustic models in TORGO Dysarthric speech database. IEEE Trans. Neural Syst. Rehabil. Eng. 26, 637–645 (2018)
Article Google Scholar
Polur, P.D., Miller, G.E.: Effect of high-frequency spectral components in computer recognition of dysarthric speech based on a mel-cepstral stochastic model. J. Rehabil. Res. Dev. 42(3), 363 (2005)
Article Google Scholar
Sainath, T.N., Parada, C.: Convolutional neural networks for small-footprint keyword spotting. In: Sixteenth Annual Conference of the International Speech Communication Association (2015)
Google Scholar
Tejaswi, S., Umesh, S.: DNN acoustic models for Dysarthric speech. In: 2017 Twenty-Third National Conference on Communications (NCC), pp. 1–4. IEEE (2017)
Google Scholar
Young, V., Mihailidis, A.: Difficulties in automatic speech recognition of dysarthric speakers and implications for speech-based applications used by the elderly: a literature review. Assist. Technol. 22(2), 99–112 (2010)
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Pisa, Pisa, Italy
Davide Mulfari, Gabriele Meoni, Marco Marini & Luca Fanucci

Authors

Davide Mulfari
View author publications
You can also search for this author in PubMed Google Scholar
Gabriele Meoni
View author publications
You can also search for this author in PubMed Google Scholar
Marco Marini
View author publications
You can also search for this author in PubMed Google Scholar
Luca Fanucci
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Davide Mulfari .

Editor information

Editors and Affiliations

Johannes Kepler University Linz, Linz, Austria
Klaus Miesenberger
National and Kapodistrian University of Athens, Athens, Greece
Georgios Kouroupetroglou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mulfari, D., Meoni, G., Marini, M., Fanucci, L. (2018). Towards a Deep Learning Based ASR System for Users with Dysarthria. In: Miesenberger, K., Kouroupetroglou, G. (eds) Computers Helping People with Special Needs. ICCHP 2018. Lecture Notes in Computer Science(), vol 10896. Springer, Cham. https://doi.org/10.1007/978-3-319-94277-3_86

Download citation

DOI: https://doi.org/10.1007/978-3-319-94277-3_86
Published: 26 June 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-94276-6
Online ISBN: 978-3-319-94277-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics