Minimum Text Corpus Selection for Limited Domain Speech Synthesis

Jůzová, Markéta; Tihelka, Daniel

doi:10.1007/978-3-319-10816-2_48

Markéta Jůzová²¹ &
Daniel Tihelka²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8655))

Included in the following conference series:

International Conference on Text, Speech, and Dialogue

1576 Accesses

Abstract

This paper concerns limited domain TTS system based on the concatenative method, and presents an algorithm capable to extract the minimal domain-oriented text corpus from the real data of the given domain, while still reaching the maximum coverage of the domain. The proposed approach ensures that the least amount of texts are extracted, containing the most common phrases and (possibly) all the words from the domain. At the same time, it ensures that appropriate phrase overlapping is kept, allowing to find smooth concatenation in the overlapped regions to reach high quality synthesized speech. In addition, several recommendations allowing a speaker to record the corpus more fluently and comfortably are presented and discussed. The corpus building is tested and evaluated on several domains differing in size and nature, and the authors present the results of the algorithm and demonstrate the advantages of using the domain oriented corpus for speech synthesis.

This work was supported by the European Regional Development Fund (ERDF), project “New Technologies for Information Society” (NTIS), European Centre of Excellence, CZ.1.05/1.1.00/02.0090, the Technology Agency of the Czech Republic, project No. TA01030476 and SGS-2013-032.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Design of a Yoruba Language Speech Corpus for the Purposes of Text-to-Speech (TTS) Synthesis

Emilia: a speech corpus for Argentine Spanish text to speech synthesis

Article 02 February 2019

Current State of Text-to-Speech System ARTIC: A Decade of Research on the Field of Speech Technologies

References

Brenton, H., Gillies, M., Ballin, D., Chatting, D.: The uncanny valley: does it exist. In: 19th British HCI Group Annual Conference: Workshop on Human-animated Character Interaction (2005)
Google Scholar
Tihelka, D., Kala, J., Matoušek, J.: Enhancements of Viterbi search for fast unit selection synthesis. In: Proceedings of 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, pp. 174–177 (2010)
Google Scholar
Grůber, M., Hanzlíček, Z.: Czech expressive speech synthesis in limited domain: Comparison of unit selection and HMM-based approaches. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2012. LNCS, vol. 7499, pp. 656–664. Springer, Heidelberg (2012)
Chapter Google Scholar
Matoušek, J., Tihelka, D., Romportl, J.: Building of a speech corpus optimised for unit selection tts synthesis. In: LREC 2008, Proceedings of 6th International Conference on Language Resources and Evaluation. ELRA (2008)
Google Scholar
Matoušek, J., Romportl, J.: On building phonetically and prosodically rich speech corpus for text-to-speech synthesis. In: Proc. of the Second IASTED Int. Conf. on Computational intelligence, pp. 442–447. ACTA Press, San Francisco (2006)
Google Scholar
Tihelka, D.: Towards automatic measure of similarity for use in unit selection. In: 9th Int. Conf. on Signal Processing, ICSP 2008, Beijing, China, pp. 637–642 (2008)
Google Scholar
Black, A.W., Zen, H., Tokuda, K.: Statistical parametric speech synthesis. In: Proc. ICASSP 2007, pp. 1229–1232 (2007)
Google Scholar
Labov, W.: The Social Stratification of English in New York City. Center for Applied Linguistics, Washington, DC (1966)
Google Scholar
Jůzová, M., Tihelka, D.: Tuning limited domain speech synthesis using general tts system. Accepted at Text, Speech and Dialogue 2014 (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

University of West Bohemia, Univerzitní 8, Plzeň, Czech Republic
Markéta Jůzová & Daniel Tihelka

Authors

Markéta Jůzová
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Tihelka
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Informatics, Masaryk University, Botanicá 6a, 60200, Brno, Czech Republic
Petr Sojka
Faculty of Informatics, Department of Information Technologies, Masaryk University, 602 00, Brno, Czech Republic
Aleš Horák , Ivan Kopeček & Karel Pala , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jůzová, M., Tihelka, D. (2014). Minimum Text Corpus Selection for Limited Domain Speech Synthesis. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2014. Lecture Notes in Computer Science(), vol 8655. Springer, Cham. https://doi.org/10.1007/978-3-319-10816-2_48

Download citation

DOI: https://doi.org/10.1007/978-3-319-10816-2_48
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10815-5
Online ISBN: 978-3-319-10816-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Minimum Text Corpus Selection for Limited Domain Speech Synthesis

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Design of a Yoruba Language Speech Corpus for the Purposes of Text-to-Speech (TTS) Synthesis

Emilia: a speech corpus for Argentine Spanish text to speech synthesis

Current State of Text-to-Speech System ARTIC: A Decade of Research on the Field of Speech Technologies

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Minimum Text Corpus Selection for Limited Domain Speech Synthesis

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Design of a Yoruba Language Speech Corpus for the Purposes of Text-to-Speech (TTS) Synthesis

Emilia: a speech corpus for Argentine Spanish text to speech synthesis

Current State of Text-to-Speech System ARTIC: A Decade of Research on the Field of Speech Technologies

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation