Abstract
In this work we present the experimental evaluation of a new beam-search formant tracking algorithm under noisy conditions and compare its performance with three formant tracking methods. The proposed formant tracking algorithm makes use of the roots of the polynomial of a Linear Predictive Coding (LPC) as formant candidates. The best combination of formant candidates respect to a defined cost function are selected applying a beam-search algorithm. The cost function makes use of information about local and neighbor frames using trajectory functions in order to preserve the dynamics of the frequency of formants. Experiments were carried out with a subset of the TIMIT database, contaminated with various types and levels of noises. The results show that the beam-search formant tracker have a robust behavior in noisy environments and it is clearly more precise than the rest of compared methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Rose, P.: Forensic Speaker Identification. Taylor and Francis Forensic Science Series (Robertson, J. (ed.)). Taylor and Francis, London (2002)
McCandless, S.: An algorithm for automatic formant extraction using linear prediction spectra. IEEE TASSP ASSP-22, 135–141 (1974)
Gläser, C., Heckmann, M., Joublin, F., Goerick, C.: Combining auditory preprocessing and Bayesian Estimation for Robust Formant Tracking. IEEE Trans. Audio Speech Lang. Process. (2010)
Welling, L., Ney, H.: Formant Estimation for Speech Recognition. IEEE Transactions on Speech and Audio Processing 6(1) (1998)
Mehta, D.D., Rudoy, D., Wolfe, P.J.: KARMA: Kalman-based autoregressive moving average modeling and inference for formant and antiformant tracking. Stat. AP (2011)
Messaoud, Z.B., Gargouri, D., Zribi, S., Hamida, A.B.: Formant Tracking Linear Prediction Model using HMMs for Noisy Speech Processing. Int. Journal of Inf. and Comm. Eng. 5(4) (2009)
Talkin, D.: Speech formant trajectory estimation using dynamic programming with modulated transition costs. JASA 82(S1), 55 (1987)
Deng, L., Bazzi, I., Acero, A.: Tracking Vocal Tract Resonances Using an Analytical Nonlinear Predictor and a Target-Guided Temporal Constraint (2003)
Xia, K., Espy-Wilson, C.: A new strategy of formant tracking based on dynamic programming. In: Proc. ICSLP (2000)
García Laínez, J.E., Gonzalez, D.R., Artiaga, A.M., Solano, E.L., De Lara, J.R.C.: Beam-Search Formant Tracking Algorithm based on Trajectory Functions for Continuous Speech. To be Plublished in Proceedings of CIARP 2012 (2012)
Mustafa, K., Bruce, I.C.: Robust formant tracking for continuous speech with speaker variability. IEEE Transactions on Speech and Audio Processing (2006)
Snack toolkit, http://www.speech.kth.se/wavesurfer
Deng, L., Cui, X., Pruvenok, R., Huang, J., Momen, S., Chen, Y., Alwan, A.: A Database of Vocal Tract Resonance Trajectories for Research in Speech Processing. In: ICASSP (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ribas González, D., García Laínez, J.E., Miguel, A., Ortega Gimenez, A., Lleida, E., Calvo de Lara, J.R. (2012). Evaluation of a New Beam-Search Formant Tracking Algorithm in Noisy Environments. In: Torre Toledano, D., et al. Advances in Speech and Language Technologies for Iberian Languages. Communications in Computer and Information Science, vol 328. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35292-8_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-35292-8_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35291-1
Online ISBN: 978-3-642-35292-8
eBook Packages: Computer ScienceComputer Science (R0)