PDTSL: An annotated resource for speech reconstruction | IEEE Conference Publication | IEEE Xplore

PDTSL: An annotated resource for speech reconstruction


Abstract:

We present a description of a new resource (Prague Dependency Treebank of Spoken Language) being created for English and Czech to be used for the task of speech understan...Show More

Abstract:

We present a description of a new resource (Prague Dependency Treebank of Spoken Language) being created for English and Czech to be used for the task of speech understanding, broad natural language analysis for dialog systems and other speech-related tasks, including speech editing. The resources we have created so far contain audio and a standard transcription of spontaneous speech, but as a novel layer, we add an edited (ldquoreconstructedrdquo) version of the spoken utterances. These edits go beyond the scope of current speech reconstruction efforts in that we allow, on top of the usual deletions of speech artifacts, fillers, etc. also for word modifications, insertions and word order changes. We have used both monologue and dialogue recordings in English and Czech to verify the feasibility of such transcription. We have also assessed the quality of the resulting annotation since the relative freedom of the editing raises an issue of what a ldquocorrectrdquo annotation is.
Date of Conference: 15-19 December 2008
Date Added to IEEE Xplore: 06 February 2009
ISBN Information:
Conference Location: Goa, India

Contact IEEE to Subscribe

References

References is not available for this document.