An Overview of the ILSP Unit Selection Text-to-Speech Synthesis System

Tsiakoulis, Pirros; Karabetsos, Sotiris; Chalamandaris, Aimilios; Raptis, Spyros

doi:10.1007/978-3-319-07064-3_30

Pirros Tsiakoulis²²,
Sotiris Karabetsos²²,
Aimilios Chalamandaris²² &
…
Spyros Raptis²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8445))

Included in the following conference series:

Hellenic Conference on Artificial Intelligence

2722 Accesses
5 Citations

Abstract

This paper presents an overview of the Text-to-Speech synthesis system developed at the Institute for Language and Speech Processing (ILSP). It focuses on the key issues regarding the design of the system components. The system currently fully supports three languages (Greek, English, Bulgarian) and is designed in such a way to be as language and speaker independent as possible. Also, experimental results are presented which show that the system produces high quality synthetic speech in terms of naturalness and intelligibility. The system was recently ranked among the first three systems worldwide in terms of achieved quality for the English language, at the international Blizzard Challenge 2013 workshop.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Benesty, J., Sondhi, M., Huang, Y. (eds.): Springer Handbook of Speech Processing. Springer (2008)
Google Scholar
Li, D., Wang, K., Wu, C.: Speech Technology and Systems in Human-Machine Communication. IEEE Signal Processing Magazine 22(5), 12–14 (2005)
Article MathSciNet Google Scholar
Gilbert, M., Feng, J.: Speech and Language Processing over the Web: Changing the way people communicate and access information. IEEE Signal Processing Magazine 25(3), 18–28 (2008)
Article Google Scholar
Dutoit, T.: Corpus-based Speech Synthesis. In: Benesty, J., Sondhi, M.M., Huang, Y. (eds.) Springer Handbook of Speech Processing, Part D, ch. 21, pp. 437–455. Springer (2008)
Google Scholar
The Blizzard Challenge 2013 Workshop (2013), http://www.synsig.org/index.php/Blizzard_Challenge
Chalamandaris, A., Raptis, S., Tsiakoulis, P.: Rule-based grapheme-to-phoneme me-thod for the Greek. In: Proc. Interspeech 2005: 9th European Conference on Speech Communication and Technology, Lisbon, Portugal, September 4-8 (2005)
Google Scholar
Klabbers, E., van Santen, J.P.H., Kain, A.: The Contribution of Various Sources of Spectral Mismatch to Audible Discontinuities in a Diphone Database. IEEE Transactions on Audio Speech, and Language Processing 15(3), 949–956 (2007)
Article Google Scholar
Vepa, J., King, S.: Subjective Evaluation of Join Cost and Smoothing Methods for Unit Selection Speech Synthesis. IEEE Trans. Audio, Speech and Language Processing 14(5), 1763–1771 (2006)
Article Google Scholar
Toda, T., Kawai, H., Tsuzaki, M., Shikano, K.: An evaluation of cost functions sensitively capturing local degradation of naturalness for segment selection in concatenative speech synthesis. Speech Communication 48(1), 45–56 (2006)
Article Google Scholar
Karabetsos, S., Tsiakoulis, P., Chalamandaris, A., Raptis, S.: One-Class Classification for Spectral Join Cost Calculation in Unit Selection Speech Synthesis. IEEE Signal Processing Letters 17(8), 746–749 (2010)
Article Google Scholar
Chalamandaris, A., Tsiakoulis, P., Karabetsos, S., Raptis, S.: An Efficient and Robust Pitchmarking Algorithm on the Speech Waveform for TD-PSOLA. In: Proc. of the IEEE ICSIPA 2009 (IEEE International Conference on Signal and Image Processing Applications 2009), paper 190, Malaysia (November 2009)
Google Scholar
Chalamandaris, A., Tsiakoulis, P., Karabetsos, S., Raptis, S.: The ILSP Text-to-Speech System for the Blizzard Challenge 2013. In: Proc. Blizzard Challenge 2013 Workshop, Barcelona, Spain (2013)
Google Scholar
Marc, S.: Expressive Speech Synthesis: Past, Present,and Possible Futures. In: Tao, J.H., Tan, T.N. (eds.) Affective Information Processing. Springer Science+Business Media LLC (2009)
Google Scholar
Raptis, S.: Exploring Latent Structure in Expressive Speech. In: Proc. IEEE CogInfoCom 2013, 4th IEEE International Conference on Cognitive Infocommunications, Budapest, Hungary, December 2-5, pp. 741–745 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Language and Speech Processing – Research Centre ATHENA, Artemidos 6 & Epidavrou, GR 15125, Athens, Greece
Pirros Tsiakoulis, Sotiris Karabetsos, Aimilios Chalamandaris & Spyros Raptis

Authors

Pirros Tsiakoulis
View author publications
You can also search for this author in PubMed Google Scholar
Sotiris Karabetsos
View author publications
You can also search for this author in PubMed Google Scholar
Aimilios Chalamandaris
View author publications
You can also search for this author in PubMed Google Scholar
Spyros Raptis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Ioannina, GR 45110, Ioannina, Greece
Aristidis Likas
Department of Computer Science, University of Ioannina, P.O. Box 1186, 45110, Ioannina, Greece
Konstantinos Blekas
Hellenic Open University, GR 26335, Peribola, Patras, Greece
Dimitris Kalles

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tsiakoulis, P., Karabetsos, S., Chalamandaris, A., Raptis, S. (2014). An Overview of the ILSP Unit Selection Text-to-Speech Synthesis System. In: Likas, A., Blekas, K., Kalles, D. (eds) Artificial Intelligence: Methods and Applications. SETN 2014. Lecture Notes in Computer Science(), vol 8445. Springer, Cham. https://doi.org/10.1007/978-3-319-07064-3_30

Download citation

DOI: https://doi.org/10.1007/978-3-319-07064-3_30
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07063-6
Online ISBN: 978-3-319-07064-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics