Skip to main content

Text Normalization for the Pronunciation of Non-standard Words in an Inflected Language

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3025))

Abstract

In this paper we present a novel approach, called ”Text to Pronunciation (TtP)”, for the proper normalization of Non-Standard Words (NSWs) in unrestricted texts. The methodology deals with inflection issues for the consistency of the NSWs with the syntactic structure of the utterances they belong to. Moreover, for the achievement of an augmented auditory representation of NSWs in Text-to-Speech (TtS) systems, we introduce the coupling of the standard normalizer with: i) a language generator that compiles pronunciation formats and ii) VoiceXML attributes for the guidance of the underlying TtS to imitate the human speaking style in the case of numbers. For the evaluation of the above model in the Greek language we have used a 158K word corpus with 4499 numerical expressions. We achieved an internal error rate of 7,67% however, only 1,02% were perceivable errors due to the nature of the language.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Mobius, B., Sproat, R., van Santen, J., Olive, J.: The Bell Labs German Text-To-Speech system: An overview. In: Proceedings of EUROSPEECH 1997, vol. IV, pp. 2443–2446 (1997)

    Google Scholar 

  2. Fries, G., Wirth, A.: FELIX – A TTS System with Improved pre-processing and source signal generation. In: Proceedings of EUROSPEECH 1997, vol. II, pp. 589–592 (1997)

    Google Scholar 

  3. Zingle, H.: Traitement de la prosodie allemande dans un systeme de synthese de la parole. These pour le ‘Doctorat d’Etat, Universite de Strasbourg II (1982)

    Google Scholar 

  4. Ooyama, Y., Miyazaki, M., Ikehara, S.: Natural Language Processing in a Japanese Text- To-Speech System. In: Proceedings of the Annual Computer Science Conference, pp. 40–47. ACM, New York (1987)

    Chapter  Google Scholar 

  5. Coughlin, D.: Leveraging Syntactic Information for Text Normalization. In: Matoušek, V., Mautner, P., Ocelíková, J., Sojka, P. (eds.) TSD 1999. LNCS (LNAI), vol. 1692, pp. 95–100. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  6. Sproat, R., Black, A., Chen, S., Kumar, S., Ostendorf, M., Richards, C.: Normalization of non-standard words. Computer Speech and Language 15(3), 287–333 (2001)

    Article  Google Scholar 

  7. Olinsky, G., Black, A.: Non-Standard Word and Homograph Resolution for Asian Language Text Analysis. In: Proceedings of ICSLP 2000, Beijing, China (2000)

    Google Scholar 

  8. Xydas, G., Kouroupetroglou, G.: The DEMOSTHeNES Speech Composer. In: Proceedings of the 4th ISCA Tutorial and Research Workshop on Speech Synthesis, Perthshire, Scotland, August 29-September 1, pp. 167–172 (2001)

    Google Scholar 

  9. Babiniotis, G., Christou, K.: The Grammar of Modern Greek, II. The verb. Ellinika Grammata (1998)

    Google Scholar 

  10. Burnett, D., Walker, M., Hunt, A.: Speech Synthesis Markup Language Version 1.0. W3C Working Draft, http://www.w3.org/TR/speech-synthesis

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Xydas, G., Karberis, G., Kouroupertroglou, G. (2004). Text Normalization for the Pronunciation of Non-standard Words in an Inflected Language. In: Vouros, G.A., Panayiotopoulos, T. (eds) Methods and Applications of Artificial Intelligence. SETN 2004. Lecture Notes in Computer Science(), vol 3025. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24674-9_41

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-24674-9_41

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-21937-8

  • Online ISBN: 978-3-540-24674-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics