Skip to main content

Normalization of Non-standard Words with Finite State Transducers for Russian Speech Synthesis

  • Conference paper
  • First Online:
Analysis of Images, Social Networks and Texts (AIST 2015)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 542))

Abstract

This paper describes finite state transducers employed for expansion of numbers, acronyms and graphic abbreviations into full-word numerals and phrases in the task of Russian speech synthesis. The developed finite state transducers cover cardinal and ordinal numbers, convert phone numbers, dates, codes, etc. The developed project is the first Russian open-source normalization system known to the author.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://github.com/avlukanin/normatex

  2. 2.

    http://susu.ac.ru

  3. 3.

    http://ruscorpora.ru

  4. 4.

    http://cards.voicefabric.ru/

  5. 5.

    https://translate.google.ru/

References

  1. Reichel, U.D., Pfitzinger, H.R.: Text preprocessing for speech synthesis (2006)

    Google Scholar 

  2. The Festival Speech Synthesis System. http://www.cstr.ed.ac.uk/projects/festival/

  3. Unitex 3.1beta. http://www-igm.univ-mlv.fr/~unitex/

  4. Paumier, S.: Unitex 3.1.beta User Manual. Université Paris-Est Marne-la-Vallée. http://igm.univ-mlv.fr/~unitex/UnitexManual3.1.pdf (2015). Accessed 15 Jan 2015

  5. Dutoit, T.: An Introduction to Text-to-Speech Synthesis, vol. 3. Springer Science & Business Media, Berlin (1997)

    Google Scholar 

  6. Sproat, R., Black, A., Chen, S., Kumar, S., Ostendorfk, M., Richards, C.: Normalization of non-standard words. Comput. Speech Lang. 15, 287–333 (2001)

    Article  Google Scholar 

  7. Sproat, R.: Lightly supervised learning of text normalization: Russian number names. In: Spoken Language Technology Workshop (SLT), 2010 IEEE, pp. 436–441. IEEE, December 2010

    Google Scholar 

  8. Khomitsevich, O.G., Rybin, S.V., Anichkin, I.M.: Linguistic analysis for text normalization and homonymy resolution in a Russian TTS system [Иcпoльзoвaниe лингвиcтичecкoгo aнaлизa для нopмaлизaции тeкcтa и cнятия oмoнимии в cиcтeмe cинтeзa pyccкoй peчи]. Instrument making. Thematic issue “Speech information systems” [Пpибopocтpoeниe. Teмaтичecкий выпycк «Peчeвыe инфopмaциoнныe cиcтeмы»], vol. 2, pp. 42–46. Izvestija vuzov (2013)

    Google Scholar 

  9. Nagel, S.: Formenbildung im Russischen. Formale Beschreibung und Automatisierung für das CISLEX-Wörterbuchsystem (2002)

    Google Scholar 

  10. Russian Grammar [Pyccкaя гpaммaтикa], vol. 1. Nauka, Moscow (1980)

    Google Scholar 

  11. Rosental, D.E., Golub, I.B., Telenkova, M.A.: The Modern Russian Language [Coвpeмeнный pyccкий язык]. Airis-Press, Moscow (1997)

    Google Scholar 

  12. Rosental, D.E., Djandjakova, E.V., Kabanova, N.P.: Reference Book on Orthography, Pronunciation, Literary Editing [Cпpaвoчник пo пpaвoпиcaнию, пpoизнoшeнию, литepaтypнoмy peдaктиpoвaнию]. CheRo, Moscow (1998)

    Google Scholar 

  13. Linguistics. Big encyclopedic dictionary [Языкoзнaниe. Бoльшoй энциклoпeдичecкий cлoвapь]. Big Russian Encyclopedy, Moscow (1998)

    Google Scholar 

  14. Akhmanova, O.S.: The Dictionary of Linguistic Terms [Cлoвapь лингвиcтичecкиx тepминoв]. Editorial URSS, Moscow (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Artem Lukanin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Lukanin, A. (2015). Normalization of Non-standard Words with Finite State Transducers for Russian Speech Synthesis. In: Khachay, M., Konstantinova, N., Panchenko, A., Ignatov, D., Labunets, V. (eds) Analysis of Images, Social Networks and Texts. AIST 2015. Communications in Computer and Information Science, vol 542. Springer, Cham. https://doi.org/10.1007/978-3-319-26123-2_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-26123-2_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-26122-5

  • Online ISBN: 978-3-319-26123-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics