Abstract
The document presents a speech synthesis system based upon hidden Markov models and decision trees. The development follows the approach generally implemented for speech recognition systems. A front end analysis of the database is done and the derived feature vectors are subject to phonetic hidden Markov models, which are then clustered based upon a decision-tree approach that models the context of the phones being considered. The individual phone segments form an acoustic leaf sequence which divides various contexts into equivalence classes. During synthesis, the phone sequence to be synthesised is translated into an acoustic leaf sequence by posing questions associated with different nodes of the decision tree to the immediate context of the phone. The sequence of terminal nodes thus obtained is amalgamated to obtain the desired pronunciation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bulyko, I., Ostendorf, M., “The Impact of speech recognition on speech synthesis,” in Proc. of the IEEE Workshop on Speech Synthesis, av11-13 Sept., 2002., pp. 99-106.
Pitrelli, J. F., Bakis, R., Eide, E. M., Fernandez, R., Hamza, W., Picheny, M. A., “The IBM expressive text-to-speech synthesis system for American English”. IEEE transactions on Audio, Speech and Language Processing, vol. 14, no. 4, pp. 1099-1108, July 2006.
Kumar, M., Rajput, N., Verma, A.,“A Large Vocabulary Continuous Speech Recognition System for Hindi”, IBM J. RES. & DEV. 48 (5/6) September/November 2004.
Donovan, R. E., “Trainable Speech Synthesis,” Ph.D. thesis, Cambridge University Engineering Department, 1996.
Rajput , N., Subramanium, L. V., Verma, A., “Adapting phonetic decision trees between languages for continuous speech recognition”, International Conference on Spoken Language Processing, vol.3, pp. 850-852, October 16-20, 2000.
Kalika Bali, Partha Pratim Talukdar, N. Sridhar Krishna, A.G. Ramakrishnan, “Tools for the Development of a Hindi Speech Synthesis System”, In 5th ISCA Speech Synthesis Workshop, Pittsburgh, pp.109-114, 2004.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer Science+Business Media B.V.
About this paper
Cite this paper
Gupta, K. (2008). A Concatenative Synthesis Based Speech Synthesiser for Hindi. In: Sobh, T. (eds) Advances in Computer and Information Sciences and Engineering. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-8741-7_47
Download citation
DOI: https://doi.org/10.1007/978-1-4020-8741-7_47
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-8740-0
Online ISBN: 978-1-4020-8741-7
eBook Packages: Computer ScienceComputer Science (R0)