Abstract
We present ongoing work on prosody prediction for speech synthesis. This approach considers sentences as tree-like structures and decides on the prosody from a corpus of such structures using machine learning techniques. The prediction is achieved from the prosody of the closest sentence of the corpus through tree similarity measurements in a nearest neighbour context. We introduce a syntactic structure and a performance structure representation, the tree similarity metrics considered, and then we discuss the prediction method. Experiments are currently under process to qualify this approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ross, K.: Modelling of intonation for speech synthesis, Ph.D. Thesis, College of Engineering, Boston University, 1995.
Traber, C.: F0 generation with a database of natural F0 patterns and with a neural network, Talking machines: theories, models and designs, 1992, pp. 287–304.
Jensen, U., Moore, R.K., Dalsgaard, P., Lindberg, B.: Modelling intonation contours at the phrase level using continuous density hidden Markov models, Computer Speech and Language, vol. 8, 1994, pp. 247–260.
Ostendorf, M., Price, P.J., Shattuck-Hufnagel, S.: The Boston University Radio News Corpus, Technical Report ECS-95-001, Boston University, 1995.
Silverman, K., Beckman, M.E., Pitrelli, J., Ostendorf, M., Wightman, C.W., Price, P.J., Pierrehumbert, J.B., Hirschberg, J.: TOBI: A standard for labelling English Prosody, Int. Conf. on Spoken Language Processing, vol. 2, 1992, pp. 867–870.
Marcus, M.P., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of English: the Penn Treebank, Comp. Linguistics, vol. 19, 1993.
Gee, J.P., Grosjean, F.: Performance structures: a psycholinguistic and linguistic appraisal, Cognitive Psychology, vol. 15, 1983.
Bachenko, J., Fitzpatrick, E.: A computational grammar of discourse-neutral prosodic phrasing in English, Comp. Linguistics, vol. 16, N. 3, 1990, pp. 155–170.
Wagner, R.A., Fisher, M.J.: The string-to-string correction problem, Journal of the Association for Computing Machinery, vol. 21, N. 1, 1974, pp. 168–173.
Selkow, S.M., The tree-to-tree editing problem, Information Processing Letters, vol. 6, N. 6, 1977, pp. 184–186.
Taï, K.C., The tree-to-tree correction problem, Journal of the Association for Computing Machinery, vol. 26, N. 3, 1979, pp. 422–433.
Zhang, K., Algorithms for the constrained editing distance between ordered labelled trees and related problems, Pattern Recognition, vol. 28, N. 3, 1995, pp. 463–474.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Blin, L., Edgington, M. (2000). Prosody Prediction from Tree-Like Structure Similarities. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2000. Lecture Notes in Computer Science(), vol 1902. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45323-7_62
Download citation
DOI: https://doi.org/10.1007/3-540-45323-7_62
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41042-3
Online ISBN: 978-3-540-45323-9
eBook Packages: Springer Book Archive