Difficulties with Wh-Questions in Czech TTS System

Jůzová, Markéta; Tihelka, Daniel

doi:10.1007/978-3-319-45510-5_41

Markéta Jůzová¹⁷ &
Daniel Tihelka¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9924))

Included in the following conference series:

International Conference on Text, Speech, and Dialogue

1748 Accesses

Abstract

The sentence intonation is very important for differentiation of sentence types (declarative sentences, questions, etc.), especially in languages without fixed word order. Thus, it is very important to deal with that also in text-to-speech systems. This paper concerns the problem of wh-question, where its intonation differs from the intonation of another basic question type – yes/no question. We discuss the possibility to use wh-questions (recorded during the speech corpus preparation) in speech synthesis. The inclusion and appropriate usage of these recordings is tested in a real text-to-speech system and evaluated by listening tests. Furthermore, we focus on the problem of the perception of wh-question by listeners, with the aim to reveal whether listeners really prefer phonologically correct (falling) intonation in this type of questions.

M. Jůzová—This research was supported by Ministry of Education, Youth and Sports of the Czech Republic, project No. LO1506, and by the grant of the University of West Bohemia, project No. SGS-2016-039.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Study on Prosodic Distribution of Yes/No Questions with Focus in Mandarin

The Intonation of Wh- and Yes/No-Questions in Tokyo Japanese

The Acquisition of Question Intonation by Mexican Spanish Learners of French

Notes

1.
Note that only neutral speech is taken into account in this paper since our TTS system does not currently involve emotions.
2.
The important communication function in (complex/compound) declarative sentences is manifested with falling intonation only in the last phrase. Therefore, we sometimes substitute a term declarative sentence with a term declarative phrase.

References

Cruttenden, A.: Intonation. Cambridge University Press, Cambridge (1997)
Book Google Scholar
Skarnitzl, R., Šturm, P., Volín, J.: Zvuková báze řečové komunikace: Fonetický a fonologický popis řeči. Univerzita Karlova, vydavatelství Karolinum, Praha (2016)
Google Scholar
Romportl, J., Kala, J.: Prosody modelling in Czech text-to-speech synthesis. In: Proceedings of the 6th ISCA Workshop on Speech Synthesis, pp. 200–205. Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn (2007)
Google Scholar
Palková, Z.: Fonetika a fonologie češtiny: s obecným úvodem do problematiky oboru. Univerzita Karlova, vydavatelství Karolinum, Praha (1994)
Google Scholar
Matoušek, J., Tihelka, D., Romportl, J.: Current state of Czech text-to-speech system ARTIC. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2006. LNCS (LNAI), vol. 4188, pp. 439–446. Springer, Heidelberg (2006)
Google Scholar
Tihelka, D., Kala, J., Matoušek, J.: Enhancements of viterbi search for fast unit selection synthesis. In: INTERSPEECH 2010, Proceedings of 11th Annual Conference of the International Speech Communication Association, pp. 174–177 (2010)
Google Scholar
Matoušek, J., Legát, M.: Is unit selection aware of audible artifacts?. In: Proceedings of the 8th Speech Synthesis Workshop, Barcelona, Spain, pp. 267–271 (2013)
Google Scholar
Tihelka, D., Matoušek, J.: Unit selection and its relation to symbolic prosody: a new approach. In: INTERSPEECH 2006 - ICSLP, Proceedings of 9th International Conference on Spoken Language Procesing, vol. 1, pp. 2042–2045. ISCA, Bonn (2006)
Google Scholar
Hanzlíček, Z.: Correction of prosodic phrases in large speech corpora. In: Sojka, P., et al. (eds.) TSD 2016. LNAI, vol. 9924, pp. 408–417. Springer, Heidelberg (2016)
Google Scholar
Matoušek, J., Romportl, J.: On building phonetically and prosodically rich speech corpus for text-to-speech synthesis. In: Proceedings of the 2nd IASTED International Conference on Computational Intelligence, pp. 442–447. ACTA Press, San Francisco (2006)
Google Scholar
Tihelka, D., Grůber, M., Hanzlíček, Z.: Robust methodology for TTS enhancement evaluation. In: Habernal, I. (ed.) TSD 2013. LNCS, vol. 8082, pp. 442–449. Springer, Heidelberg (2013)
Google Scholar
Volín, J., Bořil, T.: General and speaker-specific properties of F0 contours in short utterances. AUC Philologica 1/2014, Phonetica Pragensia XIII, pp. 101–112 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Departement of Cybernetics and New Technologies for the Information Society, University of West Bohemia, Univerzitní 8, Plzeň, Czech Republic
Markéta Jůzová & Daniel Tihelka

Authors

Markéta Jůzová
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Tihelka
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Markéta Jůzová .

Editor information

Editors and Affiliations

Masaryk University , Brno, Czech Republic
Petr Sojka
Masaryk University , Brno, Czech Republic
Aleš Horák
Masaryk University , Brno, Czech Republic
Ivan Kopeček
Masaryk University , Brno, Czech Republic
Karel Pala

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jůzová, M., Tihelka, D. (2016). Difficulties with Wh-Questions in Czech TTS System. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech, and Dialogue. TSD 2016. Lecture Notes in Computer Science(), vol 9924. Springer, Cham. https://doi.org/10.1007/978-3-319-45510-5_41

Download citation

DOI: https://doi.org/10.1007/978-3-319-45510-5_41
Published: 03 September 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-45509-9
Online ISBN: 978-3-319-45510-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Difficulties with Wh-Questions in Czech TTS System

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Study on Prosodic Distribution of Yes/No Questions with Focus in Mandarin

The Intonation of Wh- and Yes/No-Questions in Tokyo Japanese

The Acquisition of Question Intonation by Mexican Spanish Learners of French

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Difficulties with Wh-Questions in Czech TTS System

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Study on Prosodic Distribution of Yes/No Questions with Focus in Mandarin

The Intonation of Wh- and Yes/No-Questions in Tokyo Japanese

The Acquisition of Question Intonation by Mexican Spanish Learners of French

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation