Abstract
This contribution describes the influence of the Czech language parameters selection on the coarticulation of the phonemes for the modelling of prosody features by the artificial neural network (ANN) in a text-to-speech (TTS) synthesis. The GUHA method and neural network pruning can be used for this reason. In our work we analyzed the errors between the target and calculated values of F0 and D from the point of view of the different context of speech units. The context of three phonemes combinations CCC, VVC, VCV, CVV, VCC, CCV, and CVC (C = consonant, V = vowel) were analyzed for the determination of a next improvement of prosody. The qualitative criteria have been found in this contribution.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Moulines, E. (1990) Algorithmes de codage et de modification des parametres prosodiques pour la synthèse de parole a partir du texte. In: Thèse de Docteur, l’Ecole National Superieure des Telecommunications, TELECOM Paris 90 E 004
Tuckova,J., Sebesta,V. (2001) Data Mining Approach for Prosody Modelling by ANN in Text-to-Speech Synthesis. In: Proc. of the Int. Conf. IAESTED AIA2001, Marbella, Spain, September 2001, pp. 161–166, ISBN:0-88986-301-6
Sebesta,V., Tuckova,J. (2001) Optimisation of Artificial Neural Network Topology Applied in the Prosody Control in Text-to-Speech Synthesis. In: ICANNGA’2001, Prague, Avril 2001, pp. 420–430, ISBN:3-540-41348-0
Tuckova,J., Sebesta,V. (2000) Prosody Modeling for a Text-to-Speech System by Artificial Neural Networks. In: Proc. of the Int. Conf. IAESTED SIP’2000, Las Vegas, USA, November 2000, pp.307–312, ISBN: 0-88986-308-3
Sebesta,V., Tuckova,J. (1999) Selection of Important Input Parameters for a Text-to-Speech Synthesis by Neural Networks. In:IJCNN’99(CD-ROM), Washington, D.C., USA, July 1999, IEEE Catalog Number: 99CH36339C,ISBN:0-7803-5532-6
Palkova, Z. (1994) Phonetics and phonologies of the Czech language (in Czech: Fonetika a fonologie češtiny). Univerzita Karlova-Praha, 1994, ISBN: 80-7066-843-1.
Matoušek, J. (2000) Text-to-Speech Systém Using Statistical Approach to Speech Segment Database Construction. PhD dissertation, UWB in Plzeň, Czech Republic, (in Czech).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Wien
About this paper
Cite this paper
Tučková, J., Šebesta, V. (2003). Influence of Language Parameters Selection on the Coarticulation of the Phonemes for Prosody Training in TTS by Neural Networks. In: Pearson, D.W., Steele, N.C., Albrecht, R.F. (eds) Artificial Neural Nets and Genetic Algorithms. Springer, Vienna. https://doi.org/10.1007/978-3-7091-0646-4_17
Download citation
DOI: https://doi.org/10.1007/978-3-7091-0646-4_17
Publisher Name: Springer, Vienna
Print ISBN: 978-3-211-00743-3
Online ISBN: 978-3-7091-0646-4
eBook Packages: Springer Book Archive