Abstract
The Sammon Transform performs data projections in a topology-preserving manner on the basis of an arbitrary distance measure. We use the weights of the observation probabilities of semi-continuous HMMs that were adapted to the current speaker as input. Experiments on laryngectomized speakers with tracheoesophageal substitute voice, hoarse, and normal speakers show encouraging results. Different speaker groups are separated in 2-D space, and the projection of a new speaker into the Sammon map allows prediction of his or her kind of voice pathology. The method can thus be used as an objective, automated support for the evaluation of voice disorders, and it visualizes them in a way that is convenient for speech therapists.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ruben, R.: Redefining the survival of the fittest: communication disorders in the 21st century. Laryngoscope 110, 241–245 (2000)
Sammon, J.: A nonlinear mapping for data structure analysis. IEEE Trans. Computers C-18, 401–409 (1969)
Steidl, S., Stemmer, G., Hacker, C., Nöth, E.: Adaption in the Pronunciation Space for Non-Native Speech Recognition. In: Proc. ICSLP, Jeju Island, Korea, pp. 318–321 (2004)
Shozakai, M., Nagino, G.: Analysis of Speaking Styles by Two-Dimensional Visualization of Aggregate of Acoustic Models. In: Proc. ICSLP, Jeju Island, Korea, pp. 717–720 (2004)
Schuster, M., Nöth, E., Haderlein, T., Steidl, S., Batliner, A., Rosanowski, F.: Can You Understand Him? Let’s Look at His Word Accuracy – Automatic Evaluation of Tracheoesophageal Speech. In: Proc. ICASSP, Philadelphia, PA, vol. I, pp. 61–64 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Haderlein, T., Zorn, D., Steidl, S., Nöth, E., Shozakai, M., Schuster, M. (2006). Visualization of Voice Disorders Using the Sammon Transform. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2006. Lecture Notes in Computer Science(), vol 4188. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11846406_74
Download citation
DOI: https://doi.org/10.1007/11846406_74
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-39090-9
Online ISBN: 978-3-540-39091-6
eBook Packages: Computer ScienceComputer Science (R0)