Conferences >2010 IEEE Spoken Language Tec...

Out-of-vocabulary term detection by n-gram array with distance from continuous syllable recognition results

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

For spoken document retrieval, it is very important to consider Out-of-Vocabulary (OOV) and mis-recognition of spoken words. Therefore, sub-word unit based recognition an...Show More

Metadata

Abstract:

For spoken document retrieval, it is very important to consider Out-of-Vocabulary (OOV) and mis-recognition of spoken words. Therefore, sub-word unit based recognition and retrieval methods have been proposed. This paper describes a Japanese spoken document retrieval system that is robust for considering OOV words and mis-recognition of sub-units. To solve the problem of OOV keywords and mis-recognized words, we used individual syllables as sub-word unit in continuous speech recognition and an n-gram sequence of syllables as a retrieval unit. We propose an n-gram indexing/retrieval method with distance in a syllable lattice for attacking OOV, recognition errors, and high speed retrieval. We applied this method to academic lecture presentation database of 44 hours, and 60% of the OOV words were detected in less than 2.5 milliseconds.

Published in: 2010 IEEE Spoken Language Technology Workshop

Date of Conference: 12-15 December 2010

Date Added to IEEE Xplore: 24 January 2011

ISBN Information:

DOI: 10.1109/SLT.2010.5700853

Conference Location: Berkeley, CA, USA