Skip to main content

Development and Integration of Natural Brazilian Portuguese Synthetic Voices to Framework FIVE

  • Conference paper
  • First Online:
Text, Speech, and Dialogue (TSD 2017)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10415))

Included in the following conference series:

Abstract

The Framework FIVE is a multiplatform tool that assists the development of voice user interfaces applied in different technological environments. Several works have been carried out in order to provide increasingly in natural synthetic voices to FIVE, however, experiments realized with users has been reported the need for more friendly voices integrated to the framework. This paper describes the development and integration of natural synthetic voices in Brazilian Portuguese to the Framework FIVE. For this, a private audio and phonetics database were used on development of two voices (male and female) using the Unit Selection technique available on MaryTTS platform. For the integration process it was developed a specific web service. For comparison purposes, it was realized experiments to evaluate the naturalness and intelligibility of the voices, and the results obtained show that the constructed voices are more friendly, however, there is not a great difference when compared with HMM-based technique.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Nuance Language: http://www.nuance.com.

  2. 2.

    Microsoft Language: https://www.microsoft.com/en-us/download.

  3. 3.

    ProTools: http://www.avid.com/pro-tools.

  4. 4.

    Vero: https://pt-br.libreoffice.org/projetos/vero/.

  5. 5.

    NLST: https://github.com/marytts/marytts/wiki/New-Language-Support.

  6. 6.

    VCT: https://github.com/marytts/marytts/wiki/VoiceImportToolsTutorial.

References

  1. Farinazzo, V., Kawamoto, A.L.S., de Oliveira Neto, J.S., Salvador, M.: An Empirical Approach for the Evaluation of Voice User Interfaces. INTECH Open Access Publisher, Rijeka (2010)

    Google Scholar 

  2. Branco, A., Mendes, A., Pereira, S., Henriques, P., Pellegrini, T., Meinedo, H., Trancoso, I., Quaresma, P., de Lima, V.L.S.: The Portuguese language in the digital age (2012)

    Google Scholar 

  3. Couto, I., Neto, N., Tadaiesky, V., Klautau, A., Maia, R.: An open source HMM-based text-to-speech system for Brazilian Portuguese. In: 7th International Telecommunications Symposium (2010)

    Google Scholar 

  4. Schröder, M., Trouvain, J.: The German text-to-speech synthesis system MARY: a tool for research, development and teaching. Int. J. Speech Technol. 6(4), 365–377 (2003)

    Article  Google Scholar 

  5. Maciel, A., Carvalho, E.: FIVE-framework for an integrated voice environment. In: Proceedings of International Conference on Systems, Signal and Image Processing, Rio de Janeiro (2010)

    Google Scholar 

  6. Maciel, A., Carvalho Filho, E.: Integration and evaluation of an HMM-based text-to-speech system to FIVE. In: 2012 19th International Conference on Systems, Signals and Image Processing (IWSSIP), pp. 633–636. IEEE (2012)

    Google Scholar 

  7. Souza, D., Saturnino, L., Maciel, A.M.: A portability evaluation of Brazilian Portuguese voices produced with MARY TTS. In: 2014 International Conference on Systems, Signals and Image Processing (IWSSIP), pp. 95–98. IEEE (2014)

    Google Scholar 

  8. Charfuelan, M., Pammi, S., Steiner, I.: MARY TTS unit selection and HMM-based voices for the blizzard challenge 2013. In: Blizzard Challenge Workshop (2013)

    Google Scholar 

  9. Maven: Software project. https://maven.apache.org

  10. Gradle: Build tool. https://gradle.org/

  11. Le Maguer, S., Steiner, I.: The MaryTTS entry for the blizzard challenge 2016 (2016)

    Google Scholar 

  12. Steiner, I., Le Maguer, S., Manzoni, J., Gilles, P., Trouvain, J.: Developing new language tools for MaryTTS: the case of Luxembourgish. In: 28th Conference on Electronic Speech Signal Processing (ESSV), Saarbrücken, Germany (2017)

    Google Scholar 

  13. Pammi, S., Charfuelan, M., Schröder, M.: Multilingual voice creation toolkit for the MARY TTS platform. In: LREC. Citeseer (2010)

    Google Scholar 

Download references

Acknowledgments

The authors would like to thank the support of this work through the research projects granted by: “CNPQ-Bolsa de Produtividade DT” (Process 310752/ 2015-9), “CNPQ Edital Universal” (Process 444745/2014-9) and “FACEPE - Edital PRONEX” (Process APQ 0880-1.03/14).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Danilo S. Barbosa .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Barbosa, D.S., Bezerra, B.L.D., Maciel, A.M.A. (2017). Development and Integration of Natural Brazilian Portuguese Synthetic Voices to Framework FIVE. In: Ekštein, K., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2017. Lecture Notes in Computer Science(), vol 10415. Springer, Cham. https://doi.org/10.1007/978-3-319-64206-2_55

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-64206-2_55

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-64205-5

  • Online ISBN: 978-3-319-64206-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics