Abstract
Spectra is usually shown as a two-dimensional graph where colors are directly related to signal levels. A great deal of speech recognition work and research takes this type of parameter directly. In this paper we propose to combine typical signal level values with the vectorial components of a Slope matrix containing orientation information on spectra surfaces. This additional information will enable us to obtain an enhanced speech signal spectra as well as formant evolution detection and a matching method to compare speech spectra sections. The mathematical formalization is based on vector analysis and matrix operations, where the basic components are the normal vectors to a set of triangular surfaces covering the spectral values. This formalism enables the use of mathematical tools (Matlab or similar) in a very easy way; and from here it is possible to program algorithms and visualize the results efficiently.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cheung, S., Lim, J.S.: CombinedMulti-Resolution (Wideband/Narrowband) Spectrogram. In: ICASSP, pp. 457–460 (1991)
Shin, Y., Choi, H., Kim, C.: A New Method For Enhanced Spectrogram of Speech. In: ICSP, pp. 623–628 (1997)
Chari, V.R., Espy-Wilson, C.Y.: Adaptive Enhancement of Fourier Spectra. IEEE Trans. Speech and Audio Processing 3, 35–39 (1995)
Kodera, K., Gendrin, R., Villey, C.: Analysis of Time-Varying Signals with Small BT Values. IEEE Trans. Acoust., Speech, Signal Processing 26, 64–76 (1978)
Kunz, D., Aach, T.: Lapped directional transform: a new transform for spectral image analysis. In: Proc. ICASSP, pp. 3433–3439 (1999)
Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall, Englewood Cliffs (1993) ISBN: 0130151572
Bruce, I.C., Karkhanis, N.V., Young, E.D.: Robust Formant Tracking in Noise. Acoustics, Speech and Signal Processing (ICASSP) 17, I13–I17 (2002)
Yan, Q., Vaseghi, S.: Analysis, Modeling and Synthesis of Formants of British, American and Australian Accents. Acoustics, Speech and Signal Processing ICASSP 1, 712–715 (2003)
Lippman, R.P.: Review of Neural Networks for Speech Recognition. Neural Computation, 1–46 (1989)
Simon, S.: Neural Networks: A Comprehensive Foundation, 2nd edn. Prentice-Hall, Englewood Cliffs (1998) ISBN: 0132733501
Varoglu, E., Hacioglu, K.: Recurrent Neural Network Speech Predictor Based on Dynamical Systems Approach. Vision Image and Signal Processing IEEE Proceedings 147(2), 149–156 (2000)
Chu, S.M., Huang, T.S.: Audio-Visual Speech Modeling Using Coupled Hidden Markov Models. Acoustics, Speech and Signal Processing (ICASSP) 4, IV4096–IV4099 (2002)
Gold, B., Morgan, N.: Speech and Audio Signal Processing. Wiley, Chichester (2000)
Gold, B., Morgan, N.: Speech and Audio Signal Processing; Processing and Perception of Speech and Music. John Wiley & Sons, Chichester (1999) ISBN: 0471351547
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bobadilla, J. (2004). New Speech Enhancement Approach for Formant Evolution Detection. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2004. Lecture Notes in Computer Science(), vol 3206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30120-2_35
Download citation
DOI: https://doi.org/10.1007/978-3-540-30120-2_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23049-6
Online ISBN: 978-3-540-30120-2
eBook Packages: Springer Book Archive