Skip to main content

Prosody Prediction from Tree-Like Structure Similarities

  • Conference paper
  • First Online:
Text, Speech and Dialogue (TSD 2000)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1902))

Included in the following conference series:

Abstract

We present ongoing work on prosody prediction for speech synthesis. This approach considers sentences as tree-like structures and decides on the prosody from a corpus of such structures using machine learning techniques. The prediction is achieved from the prosody of the closest sentence of the corpus through tree similarity measurements in a nearest neighbour context. We introduce a syntactic structure and a performance structure representation, the tree similarity metrics considered, and then we discuss the prediction method. Experiments are currently under process to qualify this approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ross, K.: Modelling of intonation for speech synthesis, Ph.D. Thesis, College of Engineering, Boston University, 1995.

    Google Scholar 

  2. Traber, C.: F0 generation with a database of natural F0 patterns and with a neural network, Talking machines: theories, models and designs, 1992, pp. 287–304.

    Google Scholar 

  3. Jensen, U., Moore, R.K., Dalsgaard, P., Lindberg, B.: Modelling intonation contours at the phrase level using continuous density hidden Markov models, Computer Speech and Language, vol. 8, 1994, pp. 247–260.

    Article  Google Scholar 

  4. Ostendorf, M., Price, P.J., Shattuck-Hufnagel, S.: The Boston University Radio News Corpus, Technical Report ECS-95-001, Boston University, 1995.

    Google Scholar 

  5. Silverman, K., Beckman, M.E., Pitrelli, J., Ostendorf, M., Wightman, C.W., Price, P.J., Pierrehumbert, J.B., Hirschberg, J.: TOBI: A standard for labelling English Prosody, Int. Conf. on Spoken Language Processing, vol. 2, 1992, pp. 867–870.

    Google Scholar 

  6. Marcus, M.P., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of English: the Penn Treebank, Comp. Linguistics, vol. 19, 1993.

    Google Scholar 

  7. Gee, J.P., Grosjean, F.: Performance structures: a psycholinguistic and linguistic appraisal, Cognitive Psychology, vol. 15, 1983.

    Google Scholar 

  8. Bachenko, J., Fitzpatrick, E.: A computational grammar of discourse-neutral prosodic phrasing in English, Comp. Linguistics, vol. 16, N. 3, 1990, pp. 155–170.

    Google Scholar 

  9. Wagner, R.A., Fisher, M.J.: The string-to-string correction problem, Journal of the Association for Computing Machinery, vol. 21, N. 1, 1974, pp. 168–173.

    MATH  MathSciNet  Google Scholar 

  10. Selkow, S.M., The tree-to-tree editing problem, Information Processing Letters, vol. 6, N. 6, 1977, pp. 184–186.

    Article  MATH  MathSciNet  Google Scholar 

  11. Taï, K.C., The tree-to-tree correction problem, Journal of the Association for Computing Machinery, vol. 26, N. 3, 1979, pp. 422–433.

    MATH  MathSciNet  Google Scholar 

  12. Zhang, K., Algorithms for the constrained editing distance between ordered labelled trees and related problems, Pattern Recognition, vol. 28, N. 3, 1995, pp. 463–474.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Blin, L., Edgington, M. (2000). Prosody Prediction from Tree-Like Structure Similarities. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2000. Lecture Notes in Computer Science(), vol 1902. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45323-7_62

Download citation

  • DOI: https://doi.org/10.1007/3-540-45323-7_62

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41042-3

  • Online ISBN: 978-3-540-45323-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics