Conferences >2017 International Conference...

The SWARA speech corpus: A large parallel Romanian read speech dataset

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper introduces one of the largest Romanian speech datasets freely available for both academic and commercial use. The dataset comprises speech data recorded over t...Show More

Metadata

Abstract:

This paper introduces one of the largest Romanian speech datasets freely available for both academic and commercial use. The dataset comprises speech data recorded over the last year from 12 speakers, along with 5 other speakers previously recorded in a separate environment. The data was manually segmented at utterance-level and semi-automatically labelled at phone-level. The resulting corpus amounts to approximately 21 hours of high-quality read speech data, split into over 19,000 utterances. The speakers read between 921 and 1493 utterances each. 880 utterances are common to all speakers and add up to over 16 hours of parallel data. We present the steps of performing the recordings and data segmentation, as well as a first use of this corpus in the context of synthetic voice development.

Published in: 2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

Date of Conference: 06-09 July 2017

Date Added to IEEE Xplore: 27 July 2017

ISBN Information:

DOI: 10.1109/SPED.2017.7990428

Conference Location: Bucharest, Romania

Contents

References is not available for this document.

The SWARA speech corpus: A large parallel Romanian read speech dataset

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

The SWARA speech corpus: A large parallel Romanian read speech dataset

Alerts

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?