Abstract
In this paper class posterior distributions are combined with a hierarchal structure of multilayer Perceptrons to perform an automatic assessment of dysarthric speech. In addition to the standard Mel-frequency coefficients, this hybrid classifier uses rhythm-based features as input parameters since the preliminary evidence from perceptual experiments show that rhythm troubles may be the common characteristic of various types of dysarthria. The Nemours database of American dysarthric speakers is used throughout experiments. Results show the relevance of rhythm metrics and the effectiveness of the proposed hybrid classifier to discriminate the levels of dysarthria severity.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Arvaniti, A.: A rhythm timing and the timing of rhythm. Phonetica (66), 46–63 (2009)
Enderby, P., Pamela, M.: Frenchay Dysarthria Assessment. College Hill Press (1983)
Grabe, E., Low, E.L.: Durational variability in speech and the rhythm class hypothesis. Papers in Laboratory Phonology 7 (2002)
Liss, J.M., White, L., Mattys, S.L., Lansford, K., Lotto, A.J., Spitzer, S., Caviness, J.N.: Quantifying speech rhythm abnormalities in the dysarthrias. Journal of Speech Language and Hearing Research (52), 1334–1352 (2009)
Polikoff, J.B., Bunnell, H.T.: The Nemours database of dysarthric speech: A perceptual analysis. In: The XIVth International Congress of Phonetic Sciences (ICPhS), San Francisco (1999)
Polur, D., Miller, G.: Investigation of an HMM/ANN hybrid structure in pattern recognition application using cepstral analysis of dysarthric (distorted) speech signals. Medical Engineering & Physics 28(8), 741–748 (2006)
Ramus, F., Nespor, M., Mehler, J.: Correlates of linguistic rhythm in the speech signal. Cognition 73(3), 265–292 (1999)
Rudzicz, F.: Phonological features in discriminative classification of dysarthric speech. In: Proceedings of ICASSP 2009, Taiwan, pp. 4605–4608 (2009)
Schwarz, P., Matejka, P., Cernocky, J.: Hierarchical structures of neural networks for phoneme recognition. In: Proceedings of ICASSP 2006, Toulouse, pp. 325–328 (2006)
Selouani, S.A., Yakoub, M., O’Shaughnessy, D.: Alternative speech communication system, for persons with severe speech disorders. EURASIP Journal on Advances in Signal Processing (2009), doi:10.1155
Tolba, H., Eltorgoman, A.: Towards the improvement of automatic recognition of dysarthric speech. In: IEEE International Conference ICSIT, pp. 277–281 (2009)
Tsuji, T., Fukuda, O., Ichinobe, H., Kaneko, M.: A log-linearized Gaussian mixture network and its application to EEG pattern classification. IEEE Transactions on Systems, Man, and Cybernetics 29(1), 60–72 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Selouani, SA., Dahmani, H., Amami, R., Hamam, H. (2011). Dysarthric Speech Classification Using Hierarchical Multilayer Perceptrons and Posterior Rhythmic Features. In: Corchado, E., Snášel, V., Sedano, J., Hassanien, A.E., Calvo, J.L., Ślȩzak, D. (eds) Soft Computing Models in Industrial and Environmental Applications, 6th International Conference SOCO 2011. Advances in Intelligent and Soft Computing, vol 87. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19644-7_46
Download citation
DOI: https://doi.org/10.1007/978-3-642-19644-7_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19643-0
Online ISBN: 978-3-642-19644-7
eBook Packages: EngineeringEngineering (R0)