A Concatenative Synthesis Based Speech Synthesiser for Hindi

Gupta, Kshitij

doi:10.1007/978-1-4020-8741-7_47

Kshitij Gupta²

1847 Accesses

Abstract

The document presents a speech synthesis system based upon hidden Markov models and decision trees. The development follows the approach generally implemented for speech recognition systems. A front end analysis of the database is done and the derived feature vectors are subject to phonetic hidden Markov models, which are then clustered based upon a decision-tree approach that models the context of the phones being considered. The individual phone segments form an acoustic leaf sequence which divides various contexts into equivalence classes. During synthesis, the phone sequence to be synthesised is translated into an acoustic leaf sequence by posing questions associated with different nodes of the decision tree to the immediate context of the phone. The sequence of terminal nodes thus obtained is amalgamated to obtain the desired pronunciation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bulyko, I., Ostendorf, M., “The Impact of speech recognition on speech synthesis,” in Proc. of the IEEE Workshop on Speech Synthesis, av11-13 Sept., 2002., pp. 99-106.
Google Scholar
Pitrelli, J. F., Bakis, R., Eide, E. M., Fernandez, R., Hamza, W., Picheny, M. A., “The IBM expressive text-to-speech synthesis system for American English”. IEEE transactions on Audio, Speech and Language Processing, vol. 14, no. 4, pp. 1099-1108, July 2006.
Google Scholar
Kumar, M., Rajput, N., Verma, A.,“A Large Vocabulary Continuous Speech Recognition System for Hindi”, IBM J. RES. & DEV. 48 (5/6) September/November 2004.
Google Scholar
Donovan, R. E., “Trainable Speech Synthesis,” Ph.D. thesis, Cambridge University Engineering Department, 1996.
Google Scholar
Rajput , N., Subramanium, L. V., Verma, A., “Adapting phonetic decision trees between languages for continuous speech recognition”, International Conference on Spoken Language Processing, vol.3, pp. 850-852, October 16-20, 2000.
Google Scholar
Kalika Bali, Partha Pratim Talukdar, N. Sridhar Krishna, A.G. Ramakrishnan, “Tools for the Development of a Hindi Speech Synthesis System”, In 5th ISCA Speech Synthesis Workshop, Pittsburgh, pp.109-114, 2004.
Google Scholar

Download references

Author information

Authors and Affiliations

IIM, Ahmedabad
Kshitij Gupta

Authors

Kshitij Gupta
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Engineering, University of Bridgeport, 221 University Avenue, Bridgeport, CT 06604, USA
Tarek Sobh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gupta, K. (2008). A Concatenative Synthesis Based Speech Synthesiser for Hindi. In: Sobh, T. (eds) Advances in Computer and Information Sciences and Engineering. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-8741-7_47

Download citation

DOI: https://doi.org/10.1007/978-1-4020-8741-7_47
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-8740-0
Online ISBN: 978-1-4020-8741-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics