Regular Article
Scaled random trajectory segment models

https://doi.org/10.1006/csla.1997.0035Get rights and content

Abstract

Speech recognition systems that are based on hidden Markov modelling (HMM) assume that the mean trajectory feature vector within a state is constant over time. In recent years, segment models that attempt to describe the dynamics of the speech signal within a phonetic unit have been proposed. Some of these models describe the mean trajectory over time as a random process. In this paper we present the concept of a scaled random trajectory segment model, which aims to overcome the modelling problem created by the fact that segment realizations of the same phonetic unit differ in length. The new model is supported by direct experimental evidence. It offers the following advantages over the standard (non-scaled) model. First, it shows improved performance compared to the non-scaled model. This is demonstrated using phone classification experiments. Second, it yields closed form expressions for the estimated parameters, unlike the previously suggested, non-scaled model, which requires more complicated iterative estimation procedures.

References (0)

Cited by (0)

View full text