Abstract
With the emergence of the HMM-synthesis paradigm, producing natural, expressive prosody has become viable in speech synthesis. This paper describes the development of rule-based prominence prediction model for Finnish Text-to-Speech system, based on deep syntactic analysis and discourse structure.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Hirschberg, J.: Pitch accent in context: predicting intonational prominence from text. Artif. Intell. 63(1-2), 305–340 (1993)
Prevost, S., Steedman, M.: Specifying intonation from context for speech synthesis. Speech Communication 15(1), 139–153 (1994)
Vainio, M.: Artificial Neural Network Based Prosody Models for Finnish Text-to-Speech Synthesis, ser. Publications of the Department of Phonetics, University of Helsinki. Yliopistopaino, 43 (2001)
Tapanainen, P.: Parsing in Two Frameworks: Finite-state and Functional Dependency Grammar. University of Helsinki, Dept. of General Linguistics (1999)
Tokuda, K., Yoshimura, T., Masuko, T., Kobayashi, T., Kitamura, T.: Speech parameter generation algorithms for HMM-based speechsynthesis. In: Proceedings of Int. Conf. of Acoustics, Speech, and Signal Processing, ICASSP 2000, pp. 1315–1318 (2000)
Vainio, M., Suni, A., Sirjola, P.: Accent and prominence in Finnish speech synthesis. In: Proceedings of the 10th International Conference on Speech and Computer (Specom 2005), University of Patras, Greece, October 2005, pp. 309–312 (2005)
Hakulinen, A., Vilkuna, M., Korhonen, R., Koivisto, V., Heinonen, T.R., Alho, I.: Iso suomen kielioppi. Suomalaisen Kirjallisuuden Seura (2004)
Brenier, J.M., Nenkova, A., Kothari, A., Whitton, L., Beaver, D., Jurafsky, D.: The (Non)Utility of Linguistic Features for Predicting Prominence in Spontaneous Speech. In: Proceedings of the IEEE / ACL 2006 Workshop on Spoken Language Technology. The Stanford Natural Language Processing Group (2006)
Sityaev, D.: The relationship between accentuation and information status of discourse referents: A corpus-based study. In: UCL Working Papers in Linguistics, vol. 12 (2000)
Zacharski, R.: Generation of accent in nominally premodified noun phrases. In: Proceedings of the 14th conference on Computational linguistics, Morristown, NJ, USA, pp. 253–259. Association for Computational Linguistics (1992)
Steedman, M.: Information structure and the syntax-phonology interface. Linguistic Inquiry 31(4), 649–689 (2000)
Vainio, M., Jarvikivi, J.: Focus in production: Tonal shape, intensity and word order. The Journal of the Acoustical Society of America 121(2), EL55–EL61 (2007), http://link.aip.org/link/?JAS/121/EL55/1
Hajičová, E., Sgall, P., Skoumalová, H.: Identifying topic and focus by an automatic procedure. In: EACL 93P, EACL 93L, pp. 178–182 (1993)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Suni, A., Vainio, M. (2008). Deep Syntactic Analysis and Rule Based Accentuation in Text-to-Speech Synthesis. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2008. Lecture Notes in Computer Science(), vol 5246. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87391-4_68
Download citation
DOI: https://doi.org/10.1007/978-3-540-87391-4_68
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87390-7
Online ISBN: 978-3-540-87391-4
eBook Packages: Computer ScienceComputer Science (R0)