Abstract
In this paper we propose a new approach based on energy-adaptive matching pursuits to improve sinusoidal modelling of speech and audio signals for coding and recognition purposes. To reduce the complexity of the algorithm, an over-complete dictionary composed of complex exponentials is used and an efficient implementation is presented. An analysis-synthesis windows scheme that avoids overlapping is proposed, too. Experimental results show evidence of the advantages of the proposed method for sinusoidal modelling of speech and audio signals compared to some others proposed in the literature.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
McAulay, R., Quatieri, T.: Speech analysis synthesis based on a sinusoidal representation. IEEE Transaction on Acoustic Speech and Signal Processing 34, 744–754 (1986)
Thomson, D.: Spectral estimation and harmonic analysis. Proceedings of the IEEE 70 (1982)
George, E., Smith, M.: Analysis-by-synthesis/overlap-add sinusoidal modeling applied to the analysis and synthesis of musical tones. Journal of the Audio Engineering Society 40, 497–515 (1992)
Goodwin, M.: Adaptive signal models: theory, algorithms and audio applications. Kluwer Academic Publishers, Dordrecht (1998)
Chang, W., Wang, D.: Perceptual quantisation of lpc excitation parameters. IEE Proc. Vision, Image and Signal Processing 145, 155–159 (1998)
Painter, T., Spanias, A.: Perceptual segmentation and component selection in compact sinusoidal representation of audio. In: Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing, vol. 5, pp. 3289–3292 (2001)
Mallat, S., Zhang, Z.: Matching pursuits with time-frequency dictionaries. IEEE Transaction on Signal Processing 41, 3397–3415 (1993)
Verma, T., Meng, T.: Sinusoidal modeling using frame-based perceptually weighted matching pursuits. In: Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing, vol. 2, pp. 981–984 (1999)
George, E., Smith, M.: Speech analysis/synthesis and modifications using an analysis-by-synthesis/overlap-add sinusoidal model. IEEE Trans. on Speech and Audio Processing 5, 389–406 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vera-Candeas, P., Ruiz-Reyes, N., Martinez-Muñoz, D., Curpian-Alonso, J., Rosa-Zurera, M., Lucena-Lopez, M.J. (2003). Sinusoidal Modelling with Complex Exponentials for Speech and Audio Signals. In: Perales, F.J., Campilho, A.J.C., de la Blanca, N.P., Sanfeliu, A. (eds) Pattern Recognition and Image Analysis. IbPRIA 2003. Lecture Notes in Computer Science, vol 2652. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-44871-6_121
Download citation
DOI: https://doi.org/10.1007/978-3-540-44871-6_121
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40217-6
Online ISBN: 978-3-540-44871-6
eBook Packages: Springer Book Archive