Conferences >2013 IEEE 7th International C...

Developing text and speech databases for speech recognition of Vietnamese

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper describes our study on developing the text and speech databases for automatic speech recognition of Vietnamese using an available source of linguistic data: th...Show More

Metadata

Abstract:

This paper describes our study on developing the text and speech databases for automatic speech recognition of Vietnamese using an available source of linguistic data: the Internet. First, a two-stage procedure is applied to extract a general text corpus which can be used for researches on Vietnamese language such as speech recognition, audio-visual speech recognition, and natural language processing… We also collect another specific text corpus in the field of news and literature using the resource from some main websites of Vietnamese. The total text corpus containing 8,681,869 sentences with more than 124 million syllables is then used to build and test the language model for the speech recognizer. Besides, the collecting of speech corpora for experiments on continuous speech recognition and audio-visual speech recognition of Vietnamese are also described.

Published in: 2013 IEEE 7th International Conference on Intelligent Data Acquisition and Advanced Computing Systems (IDAACS)

Date of Conference: 12-14 September 2013

Date Added to IEEE Xplore: 14 November 2013

ISBN Information:

DOI: 10.1109/IDAACS.2013.6662662

Conference Location: Berlin, Germany

Contents

References is not available for this document.

Developing text and speech databases for speech recognition of Vietnamese

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Developing text and speech databases for speech recognition of Vietnamese

Alerts

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?