Abstract
Text and speech corpora are a prerequisite for the development of an effective commercial text-to-speech system, using the concatenative technology. Given that such a system needs to synthesize both common and domain-specific discourses, the considered corpora are of main importance. This paper presents the authors’ experience in creating a corpus for the Romanian language, designed to support a concatenative TTS system, able to reproduce common and domain-specific sentences with naturalness.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Sanders, W.R., Gramlich, C., Levine, A.: Naturalness of synthesized speech. In: University-Level Computer-Assisted Instruction at Stanford: 1968-1980, pp. 487–502 (1981), http://suppes-corpus.stanford.edu/pdfs/CAI/II-8.pdf
Hawkins, S., Heid, S., House, J., Huckvale, M.: Assessment of naturalness in the prosynth speech synthesis project. In: IEE Colloquium on Speech Synthesis (2000), http://www.phon.ucl.ac.uk/home/mark/papers/iee00hawkins.pdf
Keller, E., Bailly, G., Monagham, A., Terken, J., Huckvale, M.: Improvements in Speech Synthesis: Cost 258: The Naturalness of Synthetic Speech. Wiley (2011)
Cristea, D., Forascu, C.: Linguistic resources and technologies for romanian language. Computer Science Journal of Moldova 14, 34–73 (2006)
Feraru, S., Teodorescu, H., Zbancioc, M.: SRoL – Web-based Resources for Languages and Language Technology e-Learning. International Journal of Computers Communications & Control 5, 301–313 (2010)
Burileanu, C., Popescu, V., Buzo, A., Petrea, C.S., Ghelmez-Hanes, D.: Spontaneous speech recognition for romanian in spoken dialogue systems. Proceedings of the Romanian Academy, Series A 11, 83–91 (2010)
Burileanu, C., Buzo, A., Petre, C.S., Ghelmez-Hanes, D., Cucu, H.: Romanian spoken language resources and annotation for speaker independent spontaneous speech recognition. In: Fifth International Conference on Digital Telecommunications, pp. 7–10. IEEE Press (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ordean, M.A., Şaupe, A., Ordean, M., Silaghi, G.C., Giurgea, C. (2012). A Romanian Language Corpus for a Commercial Text-To-Speech Application. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2012. Lecture Notes in Computer Science(), vol 7499. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32790-2_49
Download citation
DOI: https://doi.org/10.1007/978-3-642-32790-2_49
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32789-6
Online ISBN: 978-3-642-32790-2
eBook Packages: Computer ScienceComputer Science (R0)