Abstract
Correct reading of numbers is an essential feature of today's text-to-speech systems (TTS). This article discusses the problems of converting numbers into letters for Hungarian speech synthesis. In the first part of the article, numbers as an organic part of the text are analyzed. Different representations of ordinals, cardinals, telephone numbers, and date and time formats are examined. Special characters, such as comma, period, and plus or minus sign, and various suffixes attached to the numbers may modify the written and spoken form of the number. Numerous conjugated forms are introduced, and examples are shown for these cases. Next, the prosody of spoken numbers is analyzed; intonation, word stress, and pause assignment rules are discussed. Last, implementation questions of number-to-letter (NTL) conversion are addressed with respect to application in TTS and number-to-speech systems.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Bartkova, K. and Jouvet, D. (1999). Selective prosodic postprocessing for improving recognition of French telephone numbers. Eurospeech’99 Proceedings. Budapest, Hungary: ESCA, pp. 267–270.
Kvale, K. and Foldvik, A.K. (1997). "Four-and-twenty, twentyfour." What’s in a number? Eurospeech ’97 Proceedings. Rhodes, Greece: ESCA, pp. 729–732.
Németh, G., Zainkó, C., Olaszy, G., and Prószéky, G. (1999). Problems of creating a flexible e-mail reader for Hungarian. Eurospeech ’99 Proceedings. Budapest, Hungary: ESCA, pp. 939–942.
Olaszy, G. (1996). Számelemek kiejtésének fonetikai vizsgálata (Phonetic analysis of pronunciation of number elements). Beszédkutatás ’96 (Speech Research ’96). Budapest: Hungarian Academy of Sciences, Linguistic Institute, pp. 72–78.
Olaszy, G., Gordos, G., and Németh, G. (1992). The MULTIVOX multilingual text-to-speech converter. In G. Bailly, C. Benoit, and T.R. Sawallis (Eds.), Talking Machines: Theories, Models and Designs. Amsterdam: Elsevier Science Publishers, pp. 385–411.
Olaszy, G. and Németh, G. (1999). IVR for banking and residential telephone subscribers using stored messages combined with a new number-to-speech synthesis method. In D. Gardner-Bonneau (Ed.), Human Factors and Voice Interactive systems. Boston: Kluwer, pp. 237–255.
Navarro, J.M.V., Gonsalo, E.L., and Gil, J.R. (1999). A mixed strategy approach to Spanish prosody Eurospeech ’99 Proceedings. Budapest, Hungary: ESCA, pp. 1743–1746.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Olaszi, P. Analysis of Written and Spoken Form of Hungarian Numbers for TTS Applications. International Journal of Speech Technology 3, 177–186 (2000). https://doi.org/10.1023/A:1026506930945
Issue Date:
DOI: https://doi.org/10.1023/A:1026506930945