Abstract:
Recently, there have been many efforts in developing confidence measures for speech recognition output. Usually, these measures are applied to the final result of the dec...Show MoreMetadata
Abstract:
Recently, there have been many efforts in developing confidence measures for speech recognition output. Usually, these measures are applied to the final result of the decoder. However, using these measures early in the search process can guide the search to more promising paths. In this paper we use a confidence metric, the Average Best_Base_Phoneme Rank, to dynamically tune the contribution of the language model score. The advantage of this metric is that it uses information already available during the decoder search. The performance of this guided search approach was tested in two experiments for the ATIS and WSJ data sets and results show significant reductions in recognition error rates.
Date of Conference: 13-17 May 2002
Date Added to IEEE Xplore: 07 April 2011
Print ISBN:0-7803-7402-9
Print ISSN: 1520-6149