Abstract
This paper describes a Unit Selection system based on diphones that was developed by the Speech Technology Group of the Enginyeria Arquitectura La Salle School, Universitat Ramon Llull. This system works with a PSOLA synthesiser for Catalan language which is used in an Oral Synthesised Message Editor (EMOVS) and Windows applications developed using Microsoft SAPI. Some common questions about Unit Selection are formulated in order to find solutions and achieve a better segmental speech quality.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Guaus, R., Oliver, J., Moure, H., Iriondo, I., Martí J.: Síntesis de voz por concatenación de unidades: Mejoras en la calidad segmental. Tecniacústica 98, Lisboa (1998) 123–125.
Guaus, R. Oliver, J., Gudayol, F., Martí, J.: Síntesis de voz utilizando difonemas: Uniones entre vocales. SEPLN 97, Madrid (1997) 234–456.
Guaus, R.: Implementació i millores dún sistema de síntesi de veu d’alta qualitat utilitzant PSOLA. Projecte final de Carrera, ETSETB, Universitat Politècnica de Catalunya, Barcelona (1999).
Black, A.W.: Optimizing Selection of Units from Speech Databases for Concatenative Synthesis. Eurospeech’ 95, Madrid (1995).
Conkie, A.: Robust Unit Selection System for Speech Synthesis Joint Meeting of ASA, EAA and DAGA, Berlin (March 1999).
Beutnagel, M., Conkie, A., Schoeter, J., Stylianou, Y., Sydral, A.: The AT&T Next-Gen TTS System. Joint Meeting of ASA, EAA and DAGA, Berlin (March 1999).
Beutnagel, M., Conkie, A., Sydral, K.: Diphone Synthesis using Unit Selection. 3rd ESCA/COCOSDA Workshop on speech synthesis. Jenolan Caves, Austalia (November 1998).
Beutnagel, M., Conkie: Interaction of Units in a Unit Selection Database. EUROSPEECH’ 99, Budapest, Hungary (September 1999).
Avui Catalan newspaper URL: http://www.avui.com.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Teŕmens, R.G.i., Sanz, I.I. (2000). Diphone-Based Unit Selection for Catalan Text-to-Speech Synthesis. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2000. Lecture Notes in Computer Science(), vol 1902. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45323-7_47
Download citation
DOI: https://doi.org/10.1007/3-540-45323-7_47
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41042-3
Online ISBN: 978-3-540-45323-9
eBook Packages: Springer Book Archive