Dysarthric Speech Classification Using Hierarchical Multilayer Perceptrons and Posterior Rhythmic Features

Selouani, Sid-Ahmed; Dahmani, Habiba; Amami, Riadh; Hamam, Habib

doi:10.1007/978-3-642-19644-7_46

Sid-Ahmed Selouani⁸,
Habiba Dahmani⁸,
Riadh Amami¹⁰ &
…
Habib Hamam⁹

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 87))

1319 Accesses
1 Citations

Abstract

In this paper class posterior distributions are combined with a hierarchal structure of multilayer Perceptrons to perform an automatic assessment of dysarthric speech. In addition to the standard Mel-frequency coefficients, this hybrid classifier uses rhythm-based features as input parameters since the preliminary evidence from perceptual experiments show that rhythm troubles may be the common characteristic of various types of dysarthria. The Nemours database of American dysarthric speakers is used throughout experiments. Results show the relevance of rhythm metrics and the effectiveness of the proposed hybrid classifier to discriminate the levels of dysarthria severity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Arvaniti, A.: A rhythm timing and the timing of rhythm. Phonetica (66), 46–63 (2009)
Article Google Scholar
Enderby, P., Pamela, M.: Frenchay Dysarthria Assessment. College Hill Press (1983)
Google Scholar
Grabe, E., Low, E.L.: Durational variability in speech and the rhythm class hypothesis. Papers in Laboratory Phonology 7 (2002)
Google Scholar
Liss, J.M., White, L., Mattys, S.L., Lansford, K., Lotto, A.J., Spitzer, S., Caviness, J.N.: Quantifying speech rhythm abnormalities in the dysarthrias. Journal of Speech Language and Hearing Research (52), 1334–1352 (2009)
Article Google Scholar
Polikoff, J.B., Bunnell, H.T.: The Nemours database of dysarthric speech: A perceptual analysis. In: The XIVth International Congress of Phonetic Sciences (ICPhS), San Francisco (1999)
Google Scholar
Polur, D., Miller, G.: Investigation of an HMM/ANN hybrid structure in pattern recognition application using cepstral analysis of dysarthric (distorted) speech signals. Medical Engineering & Physics 28(8), 741–748 (2006)
Article Google Scholar
Ramus, F., Nespor, M., Mehler, J.: Correlates of linguistic rhythm in the speech signal. Cognition 73(3), 265–292 (1999)
Article Google Scholar
Rudzicz, F.: Phonological features in discriminative classification of dysarthric speech. In: Proceedings of ICASSP 2009, Taiwan, pp. 4605–4608 (2009)
Google Scholar
Schwarz, P., Matejka, P., Cernocky, J.: Hierarchical structures of neural networks for phoneme recognition. In: Proceedings of ICASSP 2006, Toulouse, pp. 325–328 (2006)
Google Scholar
Selouani, S.A., Yakoub, M., O’Shaughnessy, D.: Alternative speech communication system, for persons with severe speech disorders. EURASIP Journal on Advances in Signal Processing (2009), doi:10.1155
Google Scholar
Tolba, H., Eltorgoman, A.: Towards the improvement of automatic recognition of dysarthric speech. In: IEEE International Conference ICSIT, pp. 277–281 (2009)
Google Scholar
Tsuji, T., Fukuda, O., Ichinobe, H., Kaneko, M.: A log-linearized Gaussian mixture network and its application to EEG pattern classification. IEEE Transactions on Systems, Man, and Cybernetics 29(1), 60–72 (1999)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Université de Moncton, New Brunswick, Canada
Sid-Ahmed Selouani & Habiba Dahmani
INRS-Université du Québec, Montreal, Canada
Habib Hamam
École ESPRIT, Tunis, Tunisia
Riadh Amami

Authors

Sid-Ahmed Selouani
View author publications
You can also search for this author in PubMed Google Scholar
Habiba Dahmani
View author publications
You can also search for this author in PubMed Google Scholar
Riadh Amami
View author publications
You can also search for this author in PubMed Google Scholar
Habib Hamam
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Universidad de Salamanca, Plaza de la Merced S/N, 37008, Salamanca, Spain
Emilio Corchado
VŠB-TU Ostrava, 17. listopadu 15, 70833, Ostrava, Czech Republic
Václav Snášel
University of Burgos, Avenida Cantaria S/N, 09006, Burgos, Spain
Javier Sedano
Cairo University, 5 Ahmed Zewal St., Orman, Cairo, Egypt
Aboul Ella Hassanien
University of La Coruña, Avda. 19 de Febrero, S/N, A Coruña,, 15403, Ferrol, Spain
José Luis Calvo
Infobright, 47 Colborne Street, Suite 403, M5E1P8, Toronto, Ontario, Canada
Dominik Ślȩzak

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Selouani, SA., Dahmani, H., Amami, R., Hamam, H. (2011). Dysarthric Speech Classification Using Hierarchical Multilayer Perceptrons and Posterior Rhythmic Features. In: Corchado, E., Snášel, V., Sedano, J., Hassanien, A.E., Calvo, J.L., Ślȩzak, D. (eds) Soft Computing Models in Industrial and Environmental Applications, 6th International Conference SOCO 2011. Advances in Intelligent and Soft Computing, vol 87. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19644-7_46

Download citation

DOI: https://doi.org/10.1007/978-3-642-19644-7_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19643-0
Online ISBN: 978-3-642-19644-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics