Abstract:
This paper deals with the estimation of fundamental frequency for two speakers recorded on the same channel. We estimate a set of speech models based upon sinusoidal plus...Show MoreMetadata
Abstract:
This paper deals with the estimation of fundamental frequency for two speakers recorded on the same channel. We estimate a set of speech models based upon sinusoidal plus autoregressive noise representations. We then detect the best model from this set using Rissanen criterion. Equivalent to a penalized log-likelihood, the criterion is also used to carry out a voicing detection. The detector compares the likelihood of a sinusoid plus noise model with the likelihood of a simple autoregressive model. Several simulations are presented to illustrate this estimation method.
Date of Conference: 13-17 May 2002
Date Added to IEEE Xplore: 07 April 2011
Print ISBN:0-7803-7402-9
Print ISSN: 1520-6149