A Large Speech Database for Brazilian Portuguese Spoken Language Research

Ynoguti, Carlos Alberto; Barbosa, Plínio Almeida; Violaro, Fábio

doi:10.1007/3-540-45011-4_30

Carlos Alberto Ynoguti⁴,
Plínio Almeida Barbosa⁵ &
Fábio Violaro⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2721))

Included in the following conference series:

International Workshop on Computational Processing of the Portuguese Language

464 Accesses

Abstract

Speech recognition systems use statistical methods based algorithms, and therefore need several training samples to perform properly. Consequently such systems require huge databases for training and testing. The development of large speech corpora in Europe and in the USA was possible only with the cooperation among research centers, universities, private companies and the government. In these countries, the availability of such databases provided the resources which were responsible for the great improvement in speech technologies in the last few years. In Brazil, such consortiums are not even mentioned, and the researchers have to work with small, locally developed databases. In this article we report an effort to develop a large speech corpus for Brazilian Portuguese to fill this crucial gap.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Chhattisgarhi speech corpus for research and development in automatic speech recognition

Article 16 February 2018

A review on speech recognition approaches and challenges for Portuguese: exploring the feasibility of fine-tuning large-scale end-to-end models

Article Open access 21 January 2025

The ParlaSpeech Collection of Automatically Generated Speech and Text Datasets from Parliamentary Proceedings

References

Albano, E.C. and Moreira, A.A., “Archisegment-Based Letter-to-Phone Conversion for Concatenative Speech Synthesis in Portuguese”, Proceedings ICSLP’96, 1996, v.3, pp. 1708–1711.
Google Scholar
Benoît, C. “An intelligibility test using semantically unpredictable sentences: Towards the quantification of linguistic complexity”. Speech Communication 9, 1990, pp. 293–303.
Article Google Scholar
“BD-PUBLICO (Base de Dados em Português eUropeu, vocaBulário Largo, Independente do orador e fala Contínua)” http://www.speech.inesc.pt/bib/Trancoso98a/bdpub.html (31/03/1999).
Cole, R., ed., Survey of the State of the Art in Human Language Technology, http://cslu.cse.ogi.edu/publications/index.htm, (26/10/98).
“EUROM_1: a multilingual European speech database”. http://www.icp.grenet.fr/Relator/multiling/eurom1.html∖#PortugCorpus (31/03/1999)

Download references

Author information

Authors and Affiliations

Departamento de Telecomunicações, Instituto Nacional de Telecomunicações, Santa Rita do Sapucaí, MG, Brazil
Carlos Alberto Ynoguti
Instituto de Estudos da Linguagem, Universidade Estadual de Campinas, Campinas, SP, Brazil
Plínio Almeida Barbosa
Faculdade de Engenharia Elétrica, Universidade Estadual de Campinas, Campinas, SP, Brazil
Fábio Violaro

Authors

Carlos Alberto Ynoguti
View author publications
You can also search for this author in PubMed Google Scholar
Plínio Almeida Barbosa
View author publications
You can also search for this author in PubMed Google Scholar
Fábio Violaro
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

L2F, INESC-ID Lisboa, Technical University of Lisbon, Rua Alves Redol, 9, 1000-029, Lisbon, Portugal
Nuno J. Mamede & Isabel Trancoso &
Faculty of Humanities and Social Sciences, University of Algarve, Campus de Gambelas, 8005-139, Faro, Portugal
Jorge Baptista
NILC, ICMC-USP São-Carlos, Av. do Trabalhador São-Carlense, 400, 13560-970, São Carlos, SP, Brazil
Maria das Graças Volpe Nunes

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ynoguti, C.A., Barbosa, P.A., Violaro, F. (2003). A Large Speech Database for Brazilian Portuguese Spoken Language Research. In: Mamede, N.J., Trancoso, I., Baptista, J., das Graças Volpe Nunes, M. (eds) Computational Processing of the Portuguese Language. PROPOR 2003. Lecture Notes in Computer Science(), vol 2721. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45011-4_30

Download citation

DOI: https://doi.org/10.1007/3-540-45011-4_30
Published: 18 June 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40436-1
Online ISBN: 978-3-540-45011-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

A Large Speech Database for Brazilian Portuguese Spoken Language Research

Abstract

Access this chapter

Preview

Similar content being viewed by others

Chhattisgarhi speech corpus for research and development in automatic speech recognition

A review on speech recognition approaches and challenges for Portuguese: exploring the feasibility of fine-tuning large-scale end-to-end models

The ParlaSpeech Collection of Automatically Generated Speech and Text Datasets from Parliamentary Proceedings

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Large Speech Database for Brazilian Portuguese Spoken Language Research

Abstract

Access this chapter

Preview

Similar content being viewed by others

Chhattisgarhi speech corpus for research and development in automatic speech recognition

A review on speech recognition approaches and challenges for Portuguese: exploring the feasibility of fine-tuning large-scale end-to-end models

The ParlaSpeech Collection of Automatically Generated Speech and Text Datasets from Parliamentary Proceedings

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation