Abstract
The Framework FIVE is a multiplatform tool that assists the development of voice user interfaces applied in different technological environments. Several works have been carried out in order to provide increasingly in natural synthetic voices to FIVE, however, experiments realized with users has been reported the need for more friendly voices integrated to the framework. This paper describes the development and integration of natural synthetic voices in Brazilian Portuguese to the Framework FIVE. For this, a private audio and phonetics database were used on development of two voices (male and female) using the Unit Selection technique available on MaryTTS platform. For the integration process it was developed a specific web service. For comparison purposes, it was realized experiments to evaluate the naturalness and intelligibility of the voices, and the results obtained show that the constructed voices are more friendly, however, there is not a great difference when compared with HMM-based technique.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Nuance Language: http://www.nuance.com.
- 2.
Microsoft Language: https://www.microsoft.com/en-us/download.
- 3.
ProTools: http://www.avid.com/pro-tools.
- 4.
- 5.
- 6.
References
Farinazzo, V., Kawamoto, A.L.S., de Oliveira Neto, J.S., Salvador, M.: An Empirical Approach for the Evaluation of Voice User Interfaces. INTECH Open Access Publisher, Rijeka (2010)
Branco, A., Mendes, A., Pereira, S., Henriques, P., Pellegrini, T., Meinedo, H., Trancoso, I., Quaresma, P., de Lima, V.L.S.: The Portuguese language in the digital age (2012)
Couto, I., Neto, N., Tadaiesky, V., Klautau, A., Maia, R.: An open source HMM-based text-to-speech system for Brazilian Portuguese. In: 7th International Telecommunications Symposium (2010)
Schröder, M., Trouvain, J.: The German text-to-speech synthesis system MARY: a tool for research, development and teaching. Int. J. Speech Technol. 6(4), 365–377 (2003)
Maciel, A., Carvalho, E.: FIVE-framework for an integrated voice environment. In: Proceedings of International Conference on Systems, Signal and Image Processing, Rio de Janeiro (2010)
Maciel, A., Carvalho Filho, E.: Integration and evaluation of an HMM-based text-to-speech system to FIVE. In: 2012 19th International Conference on Systems, Signals and Image Processing (IWSSIP), pp. 633–636. IEEE (2012)
Souza, D., Saturnino, L., Maciel, A.M.: A portability evaluation of Brazilian Portuguese voices produced with MARY TTS. In: 2014 International Conference on Systems, Signals and Image Processing (IWSSIP), pp. 95–98. IEEE (2014)
Charfuelan, M., Pammi, S., Steiner, I.: MARY TTS unit selection and HMM-based voices for the blizzard challenge 2013. In: Blizzard Challenge Workshop (2013)
Maven: Software project. https://maven.apache.org
Gradle: Build tool. https://gradle.org/
Le Maguer, S., Steiner, I.: The MaryTTS entry for the blizzard challenge 2016 (2016)
Steiner, I., Le Maguer, S., Manzoni, J., Gilles, P., Trouvain, J.: Developing new language tools for MaryTTS: the case of Luxembourgish. In: 28th Conference on Electronic Speech Signal Processing (ESSV), Saarbrücken, Germany (2017)
Pammi, S., Charfuelan, M., Schröder, M.: Multilingual voice creation toolkit for the MARY TTS platform. In: LREC. Citeseer (2010)
Acknowledgments
The authors would like to thank the support of this work through the research projects granted by: “CNPQ-Bolsa de Produtividade DT” (Process 310752/ 2015-9), “CNPQ Edital Universal” (Process 444745/2014-9) and “FACEPE - Edital PRONEX” (Process APQ 0880-1.03/14).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Barbosa, D.S., Bezerra, B.L.D., Maciel, A.M.A. (2017). Development and Integration of Natural Brazilian Portuguese Synthetic Voices to Framework FIVE. In: Ekštein, K., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2017. Lecture Notes in Computer Science(), vol 10415. Springer, Cham. https://doi.org/10.1007/978-3-319-64206-2_55
Download citation
DOI: https://doi.org/10.1007/978-3-319-64206-2_55
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-64205-5
Online ISBN: 978-3-319-64206-2
eBook Packages: Computer ScienceComputer Science (R0)