ABSTRACT
We describe a novel evaluation system for the intelligibility assessment of children with CLP on standardized tests. The system is solely based on standard cepstral features in form of MFCCs. No other information like word alignments is used. So the system can be easily adapted to other languages. For each child one GMM is created by adaptation of a UBM to the speaker-specific MFCCs. The components of this GMM are concatenated in order to create a so-called GMM supervector. These GMM supervectors are then used as meta features for an SVR. We evaluated our language-independent system on two different datasets of children suffering from CLP. One dataset contains recordings of 35 German children, where the children named different pictograms. The other dataset contains recordings of 14 Italian speaking children, who repeated standardized sentences. On both datasets we achieved high correlations: up to 0.81 for the German dataset and 0.83 for the Italian dataset.
- T. Bocklet, T. Haderlein, F. Hönig, and E. Nöth. Evaluation and Assessment of Speech Intelligibilty on Pathologic Voices Based upon Accoustic Speaker Models. In Proceedings of the 3rd Advanced Voice Function Assessment International Workshop, pages 89--92, Madrid, 2009.Google Scholar
- T. Bocklet, A. Maier, and E. Nöth. Age Determination of Children in Preschool and Primary School Age with GMM-Based Supervectors and Support Vector Machines/Regression. In P. Sojka, A. Horak, I. Kopecek, and K. Pala, editors, Proceedings Text, Speech and Dialogue; 11th International Conference, Lecture Notes in Artificial Intelligence, pages 253--260, Heidelberg, 2008. Google ScholarDigital Library
- A. Dempster, N. Laird, and D. Rubin. Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society, Series B (Methodological), 39(1):1--38, 1977.Google ScholarCross Ref
- A. V. Fox. PLAKSS - Psycholinguistische Analyse kindlicher Sprechstörungen. Swets&Zeitlinger, Frankfurt a.M., 2002.Google Scholar
- J. Gauvain and C. Lee. Maximum A-Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains. IEEE Transactions on Speech and Audio Processing, 2:291--298, 1994.Google ScholarCross Ref
- T. Haderlein, E. Nöth, H. Toy, A. Batliner, M. Schuster, U. Eysholdt, J. Hornegger, and F. Rosanowski. Automatic Evaluation of Prosodic Features of Tracheoesophageal Substitute Voice. Eur Arch Otorhinolaryngol, 264(11):1315--1321, 2007.Google ScholarCross Ref
- A. Maier, T. Haderlein, U. Eysholdt, F. Rosanowski, A. Batliner, M. Schuster, and E. Nöth. PEAKS - A system for the automatic evaluation of voice and speech disorders. Speech Communication, 51(5):425--437, 2009. Google ScholarDigital Library
- M. Schuster, A. Maier, T. Haderlein, E. Nkenke, U. Wohlleben, F. Rosanowski, U. Eysholdt, and E. Nöth. Evaluation of speech intelligibility for children with cleft lip and palate by means of automatic speech recognition. International Journal of Pediatric Otorhinolaryngology, 70:1741--1747, 2006.Google ScholarCross Ref
- A. Smola and B. Schölkopf. A tutorial on support vector regression. Statistics and Computing, 14:199--222, 2004. Google ScholarDigital Library
Index Terms
- Towards a language-independent intelligibility assessment of children with cleft lip and palate
Recommendations
Articulation Analysis in the Speech of Children with Cleft Lip and Palate
Progress in Pattern Recognition, Image Analysis, Computer Vision, and ApplicationsAbstractHypernasality is a speech deficit that affects children with cleft lip and palate (CLP). It is characterized by the lack of control of the velum, which causes problems when controlling the amount of air passing from the oral to the nasal cavity ...
Intelligibility of Children with Cleft Lip and Palate: Evaluation by Speech Recognition Techniques
ICPR '06: Proceedings of the 18th International Conference on Pattern Recognition - Volume 04Cleft lip and palate (CLP) may cause functional limitations even after adequate surgical and non-surgical treatment, speech disorder being one of them. Until now, an objective means to determine and quantify the intelligibility does not exist. An ...
Analysis of Hypernasal Speech in Children with Cleft Lip and Palate
TSD '08: Proceedings of the 11th international conference on Text, Speech and DialogueIn children with cleft lip and palate speech disorders appear often. One major disorder amongst them is hypernasality. This is the first study which shows that it is possible to automatically detect hypernasality in connected speech without any invasive ...
Comments