German and Czech Speech Synthesis Using HMM-Based Speech Segment Database

Matoušek, Jindřich; Tihelka, Daniel; Psutka, Josef; Hesová, Jana

doi:10.1007/3-540-46154-X_24

Jindřich Matoušek³,
Daniel Tihelka³,
Josef Psutka³ &
…
Jana Hesová⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2448))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

580 Accesses

Abstract

This paper presents an experimental German speech synthesis system. As in case of a Czech text-to-speech system ARTIC, statistical approach (using hidden Markov models) was employed to build a speech segment database. This approach was confirmed to be language independent and it was shown to be capable of designing a quality database that led to an intelligible synthetic speech of a high quality. Some experiments with clustering the similar speech contexts were performed to enhance the quality of the synthetic speech. Our results show the superiority of phoneme-level clustering to subphoneme-level one.

This research was supported by the project no. MSM235200004 of the Ministry of Education of Czech Republic and the firm SpeechTech.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Donovan R.E., Woodland P.C.: A Hidden Markov-Model-Based Trainable Speech Synthesizer. Computer Speech and Language, 13. (1999) 223–241.
Article Google Scholar
Matoušek J., and Psutka J.: ARTIC: a New Czech Text-to-Speech System Using Statistical Approach to Speech Segment Database Construction. Proceedings of ICSLP2000, vol. IV. Beijing (2000) 612–615.
Google Scholar
Matoušek J.: Text-to-Speech Synthesis Using Statistical Approach to Automatic Speech Segment Database Construction (in Czech). Ph.D. thesis, Pilsen (2001).
Google Scholar
Matoušek J., Psutka J., and Krůta J.: On Building Speech Corpus for Concatenation-Based Speech Synthesis. Proceedings of Eurospeech2001, vol 3. AAlborg (2001) 2047–2050.
Google Scholar
Gibbon D., Moore R., and Winski T.: Handbook of Standards and Resources for Spoken Language Systems. Mouton de Gruyter. Berlin (1997).
Google Scholar
Young S.: Tree-Based State Tying for High Accuracy Acoustic Modelling. Proceedings of the ARPA Workshop on Human Language Technology. Plainsboro, New Jersey (1994) 307–312.
Google Scholar
Hon H., Acero A., Huang X., Liu J., and Plumpe M.: Automatic Generation of Synthesis Units for Trainable Text-to-Speech Systems. Proceedings of ICASSP’98, vol. 1, Seattle (1998) 293–296.
Google Scholar
Young S. et al.: The HTK Book. Entropic Inc. (1999).
Google Scholar
Psutka J.: Communication with Computer by Speech (in Czech). Academia, Prague (1995).
Google Scholar
Duden. Aussprachenwörterbuch (in German). Max Mangold, Duden-Verlag, vol. 6, Mannheim (1990).
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Cybernetics, University of West Bohemia, Univerzitní 8, 306 14, Plzeň, Czech Republic
Jindřich Matoušek, Daniel Tihelka & Josef Psutka
Department of Applied Linguistics, University of West Bohemia, Riegrova 11, 306 14, Plzeň, Czech Republic
Jana Hesová

Authors

Jindřich Matoušek
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Tihelka
View author publications
You can also search for this author in PubMed Google Scholar
Josef Psutka
View author publications
You can also search for this author in PubMed Google Scholar
Jana Hesová
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Informatics Department of Programming Systems and Communication, Masaryk University, Botanická 68a, 602 00, Brno, Czech Republic
Petr Sojka
Faculty of Informatics Department of Information Technologies, Masaryk University, Botanická 68a, 602 00, Brno, Czech Republic
Ivan Kopeček & Karel Pala &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Matoušek, J., Tihelka, D., Psutka, J., Hesová, J. (2002). German and Czech Speech Synthesis Using HMM-Based Speech Segment Database. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_24

Download citation

DOI: https://doi.org/10.1007/3-540-46154-X_24
Published: 23 August 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44129-8
Online ISBN: 978-3-540-46154-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics