|
For Full-Text PDF, please login, if you are a member of IEICE,
or go to Pay Per View on menu list, if you are a nonmember of IEICE.
|
Genetic Algorithm Based Optimization of Partly-Hidden Markov Model Structure Using Discriminative Criterion
Tetsuji OGAWA Tetsunori KOBAYASHI
Publication
IEICE TRANSACTIONS on Information and Systems
Vol.E89-D
No.3
pp.939-945 Publication Date: 2006/03/01 Online ISSN: 1745-1361
DOI: 10.1093/ietisy/e89-d.3.939 Print ISSN: 0916-8532 Type of Manuscript: Special Section PAPER (Special Section on Statistical Modeling for Speech Processing) Category: Speech Recognition Keyword: acoustic model, hidden Markov model, partly-hidden Markov model, weighted likelihood-ratio maximization criterion, genetic algorithm, lecture talk speech recognition,
Full Text: PDF(602.5KB)>>
Summary:
A discriminative modeling is applied to optimize the structure of a Partly-Hidden Markov Model (PHMM). PHMM was proposed in our previous work to deal with the complicated temporal changes of acoustic features. It can represent observation dependent behaviors in both observations and state transitions. In the formulation of the previous PHMM, we used a common structure for all models. However, it is expected that the optimal structure which gives the best performance differs from category to category. In this paper, we designed a new structure optimization method in which the dependence of the states and the observations of PHMM are optimally defined according to each model using the weighted likelihood-ratio maximization (WLRM) criterion. The WLRM criterion gives high discriminability between the correct category and the incorrect categories. Therefore it gives model structures with good discriminative performance. We define the model structure combination which satisfy the WLRM criterion for any possible structure combinations as the optimal structures. A genetic algorithm is also applied to the adequate approximation of a full search. With results of continuous lecture talk speech recognition, the effectiveness of the proposed structure optimization is shown: it reduced the word errors compared to HMM and PHMM with a common structure for all models.
|
open access publishing via
|
|
|
|
|
|
|
|