Loading [MathJax]/extensions/MathZoom.js
Automatic and language independent triphone training using phonetic tables [speech recognition] | IEEE Conference Publication | IEEE Xplore

Automatic and language independent triphone training using phonetic tables [speech recognition]


Abstract:

Training triphone acoustic models for speech recognition is time-consuming and requires important manual intervention. We present an alternative solution, performing auto...Show More

Abstract:

Training triphone acoustic models for speech recognition is time-consuming and requires important manual intervention. We present an alternative solution, performing automatic training by use of a pronunciation phonetic table which summarizes the articulatory characteristics of the target language. The method is able to train triphones for any language, given an existing set of reference monophones in one or more languages, by automatically performing the tasks of monophone seeding, triphone clustering and other training steps. The automatic nature of the training algorithm lends itself to parameter optimization, which can further improve recognition accuracy with respect to manually trained models. In a continuous digit recognition experiment, it is shown that automatically generated triphone models gave a 1.26% error rate, compared to a 2.30% error rate for its manual counterpart.
Date of Conference: 17-21 May 2004
Date Added to IEEE Xplore: 30 August 2004
Print ISBN:0-7803-8484-9
Print ISSN: 1520-6149
Conference Location: Montreal, QC, Canada

Contact IEEE to Subscribe

References

References is not available for this document.