Skip to main content

Speech Corpus Preparation for Voice Banking of Laryngectomised Patients

  • Conference paper
  • First Online:
Text, Speech, and Dialogue (TSD 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9302))

Included in the following conference series:

Abstract

This paper focuses on voice banking and creating personalised speech synthesis of laryngectomised patients who lose their voice after this radical surgery. Specific aspects of voice banking are discussed in the paper, including a description of the adjustments of the generic methods. The main attention is paid to the speech corpus building since the quality of synthesised speech depends a lot on the speech units variability and the number of their occurrences. Also some statistics and characteristics of the first experimental voices are presented and the possibility of using different speech synthesis methods depending on the voice quality and speech corpus size is pointed out.

The research leading to these results has received funding from the Norwegian Financial Mechanism 2009-2014 and the Ministry of Education, Youth and Sports under Project Contract no. MSMT-28477/2014, Project no. 7F14236 (HCENAT). This work was also supported by the grant of the University of West Bohemia, project No. SGS-2013-032.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hanzlíček, Z., Romportl, J., Matoušek, J.: Voice conservation: towards creating a speech-aid system for total laryngectomees. In: Kelemen, J., Romportl, J., Zackova, E. (eds.) Beyond Artificial Intelligence. TIEI, vol. 4, pp. 203–212. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  2. Romportl, J., Řepová, B., Betka, J.: Vocal rehabilitation of laryngectomised patients by personalised computer speech synthesis. In: Zehnhoff-Dinnesen, A., Schindler, A., Wiskirska-Woznica, B., Zorowka, P., Nawka, T., Sopko, J. (eds.) Phoniatrics. European Manual of Medicine. Springer (2015 in press)

    Google Scholar 

  3. Matoušek, J., Romportl, J.: On building phonetically and prosodically rich speech corpus for text-to-speech synthesis. In: Proceedings of the Second IASTED International Conference on Computational Intelligence. ACTA Press, San Francisco, pp. 442–447 (2006)

    Google Scholar 

  4. Matoušek, J., Psutka, J., Krůta, J.: Design of speech corpus for text-to-speech synthesis. In: Eurospeech 2001 - Interspeech, Proceedings of the 7th European Conference on Speech Communication and Technology, Aalborg, Denmark, pp. 2047–2050 (2001)

    Google Scholar 

  5. Matoušek, J., Tihelka, D., Psutka, J.: New slovak unit-selection speech synthesis in ARTIC TTS system. In: Proceedings of the World Congress on Engineering and Computer Science 2011, San Francisco, USA, pp. 485–490 (2011)

    Google Scholar 

  6. Romportl, J.: Structural data-driven prosody model for TTS synthesis. In: Proceedings of the Speech Prosody 2006 Conference, pp. 549–552. TUDpress, Dresden (2006)

    Google Scholar 

  7. Matoušek, J., Tihelka, D., Romportl, J.: Current state of czech text-to-speech system ARTIC. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2006. LNCS (LNAI), vol. 4188, pp. 439–446. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  8. Matoušek, J., Romportl, J.: Recording and annotation of speech corpus for czech unit selection speech synthesis. In: Matoušek, V., Mautner, P. (eds.) TSD 2007. LNCS (LNAI), vol. 4629, pp. 326–333. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  9. Matoušek, J., Tihelka, D., Romportl, J.: Building of a speech corpus optimised for unit selection TTS synthesis. In: Proceedings of 6th International Conference on Language Resources and Evaluation, LREC 2008. ELRA (2008)

    Google Scholar 

  10. Hanzlíček, Z.: Czech HMM-based speech synthesis. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2010. LNCS, vol. 6231, pp. 291–298. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Markéta Jůzová .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Jůzová, M., Romportl, J., Tihelka, D. (2015). Speech Corpus Preparation for Voice Banking of Laryngectomised Patients. In: Král, P., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2015. Lecture Notes in Computer Science(), vol 9302. Springer, Cham. https://doi.org/10.1007/978-3-319-24033-6_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-24033-6_32

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-24032-9

  • Online ISBN: 978-3-319-24033-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics