Abstract:
In this paper, we present a joint time/frequency domain approach for pitch estimation of speech at a very low SNR. The kernel of this approach lies in introducing a new f...Show MoreMetadata
First Page of the Article

Abstract:
In this paper, we present a joint time/frequency domain approach for pitch estimation of speech at a very low SNR. The kernel of this approach lies in introducing a new function for detecting the time-domain cue by modifying the circular average magnitude difference function (CAMDF). By using the new function in conjunction with the half-wave rectified version of the autocorrelation function, the pitch-peak can be emphasized and the non-pitch peaks suppressed. To guarantee a robust pitch detection in noisy speech, a priori frequency-domain estimate of the dominant pitch-harmonic is extracted as an additional cue and is utilized to optimally match the pitch-peak in time-domain. The proposed approach is simulated using the Keele reference database. It is shown that the proposed method using joint time and frequency domain cues is able to give a superior accuracy relative to some of the existing methods even at a very low SNR of -10 dB.
Published in: Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.
Date of Conference: 23-23 March 2005
Date Added to IEEE Xplore: 09 May 2005
Print ISBN:0-7803-8874-7
ISSN Information:
First Page of the Article
