Speech Corpus Preparation for Voice Banking of Laryngectomised Patients

Jůzová, Markéta; Romportl, Jan; Tihelka, Daniel

doi:10.1007/978-3-319-24033-6_32

Markéta Jůzová^15,17,
Jan Romportl^16,17 &
Daniel Tihelka¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9302))

Included in the following conference series:

International Conference on Text, Speech, and Dialogue

1844 Accesses
6 Citations

Abstract

This paper focuses on voice banking and creating personalised speech synthesis of laryngectomised patients who lose their voice after this radical surgery. Specific aspects of voice banking are discussed in the paper, including a description of the adjustments of the generic methods. The main attention is paid to the speech corpus building since the quality of synthesised speech depends a lot on the speech units variability and the number of their occurrences. Also some statistics and characteristics of the first experimental voices are presented and the possibility of using different speech synthesis methods depending on the voice quality and speech corpus size is pointed out.

The research leading to these results has received funding from the Norwegian Financial Mechanism 2009-2014 and the Ministry of Education, Youth and Sports under Project Contract no. MSMT-28477/2014, Project no. 7F14236 (HCENAT). This work was also supported by the grant of the University of West Bohemia, project No. SGS-2013-032.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hanzlíček, Z., Romportl, J., Matoušek, J.: Voice conservation: towards creating a speech-aid system for total laryngectomees. In: Kelemen, J., Romportl, J., Zackova, E. (eds.) Beyond Artificial Intelligence. TIEI, vol. 4, pp. 203–212. Springer, Heidelberg (2013)
Chapter Google Scholar
Romportl, J., Řepová, B., Betka, J.: Vocal rehabilitation of laryngectomised patients by personalised computer speech synthesis. In: Zehnhoff-Dinnesen, A., Schindler, A., Wiskirska-Woznica, B., Zorowka, P., Nawka, T., Sopko, J. (eds.) Phoniatrics. European Manual of Medicine. Springer (2015 in press)
Google Scholar
Matoušek, J., Romportl, J.: On building phonetically and prosodically rich speech corpus for text-to-speech synthesis. In: Proceedings of the Second IASTED International Conference on Computational Intelligence. ACTA Press, San Francisco, pp. 442–447 (2006)
Google Scholar
Matoušek, J., Psutka, J., Krůta, J.: Design of speech corpus for text-to-speech synthesis. In: Eurospeech 2001 - Interspeech, Proceedings of the 7th European Conference on Speech Communication and Technology, Aalborg, Denmark, pp. 2047–2050 (2001)
Google Scholar
Matoušek, J., Tihelka, D., Psutka, J.: New slovak unit-selection speech synthesis in ARTIC TTS system. In: Proceedings of the World Congress on Engineering and Computer Science 2011, San Francisco, USA, pp. 485–490 (2011)
Google Scholar
Romportl, J.: Structural data-driven prosody model for TTS synthesis. In: Proceedings of the Speech Prosody 2006 Conference, pp. 549–552. TUDpress, Dresden (2006)
Google Scholar
Matoušek, J., Tihelka, D., Romportl, J.: Current state of czech text-to-speech system ARTIC. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2006. LNCS (LNAI), vol. 4188, pp. 439–446. Springer, Heidelberg (2006)
Chapter Google Scholar
Matoušek, J., Romportl, J.: Recording and annotation of speech corpus for czech unit selection speech synthesis. In: Matoušek, V., Mautner, P. (eds.) TSD 2007. LNCS (LNAI), vol. 4629, pp. 326–333. Springer, Heidelberg (2007)
Chapter Google Scholar
Matoušek, J., Tihelka, D., Romportl, J.: Building of a speech corpus optimised for unit selection TTS synthesis. In: Proceedings of 6th International Conference on Language Resources and Evaluation, LREC 2008. ELRA (2008)
Google Scholar
Hanzlíček, Z.: Czech HMM-based speech synthesis. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2010. LNCS, vol. 6231, pp. 291–298. Springer, Heidelberg (2010)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Cybernetics, University of West Bohemia, Pilsen, Czech Republic
Markéta Jůzová
Department of Interdisciplinary Activities, New Technologies – Research Centre, University of West Bohemia, Pilsen, Czech Republic
Jan Romportl
New Technologies for the Information Society, University of West Bohemia, Pilsen, Czech Republic
Markéta Jůzová, Jan Romportl & Daniel Tihelka

Authors

Markéta Jůzová
View author publications
You can also search for this author in PubMed Google Scholar
Jan Romportl
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Tihelka
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Markéta Jůzová .

Editor information

Editors and Affiliations

University of West Bohemia, Pilsen, Czech Republic
Pavel Král
University of West Bohemia, Pilsen, Czech Republic
Václav Matoušek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jůzová, M., Romportl, J., Tihelka, D. (2015). Speech Corpus Preparation for Voice Banking of Laryngectomised Patients. In: Král, P., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2015. Lecture Notes in Computer Science(), vol 9302. Springer, Cham. https://doi.org/10.1007/978-3-319-24033-6_32

Download citation

DOI: https://doi.org/10.1007/978-3-319-24033-6_32
Published: 11 December 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24032-9
Online ISBN: 978-3-319-24033-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics