Abstract
This paper brings together the development of the first Text-to- Speech (TTS) system for Sinhala using the Festival framework and practical applications of it. Construction of a diphone database and implementation of the natural language processing modules are described. The paper also presents the development methodology of direct Sinhala Unicode text input by rewriting letter-to-sound rules in Festival’s context sensitive rule format and the implementation of Sinhala syllabification algorithm. A Modified Rhyme Test (MRT) was conducted to evaluate the intelligibility of the synthesized speech and yielded a score of 71.5% for the TTS system described.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Black, A.W., Lenzo, K.A.: Building Synthetic Voices, Language Technologies Institute, Carnegie Mellon University and Cepstral LLC (retrieved from 2003), http://festvox.org/bsv/
Jones, R.J., Choy, A., Williams, B.: Integrating Festival and Windows. InterSpeech 2006. In: 9th International Conference on Spoken Language Processing, Pittsburgh, USA (2006)
Kamisetty, C., Adapa, S.M.: Telugu Festival Text-to-Speech System (retrieved from 2006), http://festival-te.sourceforge.net/wiki/Main_Page
Karunatillake, W.S.: An Introduction to Spoken Sinhala, 3rd edn. M.D. Gunasena & Co. ltd. 217, Olcott Mawatha, Colombo, 11 (2004)
Kishore, S.P., Sangal, R., Srinivas, M.: Building Hindi and Telugu Voices using Festvox. In: Proceedings of the International Conference On Natutal Language Processing 2002 (ICON-2002), Mumbai (2002)
Lemmetty, S.: Review of Speech Synthesis Technology, MSc. thesis, Helsinki University of Technology (1999)
Louw, A.: A Short Guide to Pitch-Marking in the Festival Speech Synthesis System and Recommendations for Improvement. Local Language Speech Technology Initiative (LLSTI) Reports. Retrieved from (n.d.), http://www.llsti.org/documents.htm
Ramakishnan, A.G., Bali, K., Talukdar, P.P., Krishna, N.S.: Tools for the Development of a Hindi Speech Synthesis System. In: 5th ISCA Speech Synthesis Workshop, Pittsburgh, pp. 109–114 (2004)
Salor, Ö., Pellom, B., Demirekler, M.: Implementation and Evaluation of a Textto- Speech Synthesis System for Turkish. In: Proceedings of Eurospeech-Interspeech 2003, Geneva, Switzerland, pp. 1573–1576 (2003)
Wasala, A., Weerasinghe, R., Gamage, K.: Sinhala Grapheme-to-Phoneme Conversion and Rules for Schwa Epenthesis. In: Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, Sydney, Australia, pp. 890–897 (2006)
Weerasinghe, R., Wasala, A., Gamage, K.: A Rule Based Syllabification Algorithm for Sinhala. In: Dale, R., Wong, K.-F., Su, J., Kwong, O.Y. (eds.) IJCNLP 2005. LNCS (LNAI), vol. 3651, pp. 438–449. Springer, Heidelberg (2005)
Williams, B., Jones, R.J., Uemlianin, I.: Tools and Resources for Speech Synthesis Arising from a Welsh TTS Project. In: Fifth Language Resources and Evaluation Conference (LREC), Genoa, Italy (2006)
Microsoft Corporation.: Microsoft Speech SDK Version 5.1. Retrieved from (n.d.), http://msdn2.microsoft.com/en-s/library/ms990097.aspx
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Weerasinghe, R., Wasala, A., Welgama, V., Gamage, K. (2007). Festival-si: A Sinhala Text-to-Speech System. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2007. Lecture Notes in Computer Science(), vol 4629. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74628-7_61
Download citation
DOI: https://doi.org/10.1007/978-3-540-74628-7_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74627-0
Online ISBN: 978-3-540-74628-7
eBook Packages: Computer ScienceComputer Science (R0)