Sinusoidal Modelling with Complex Exponentials for Speech and Audio Signals

Vera-Candeas, P.; Ruiz-Reyes, N.; Martinez-Muñoz, D.; Curpian-Alonso, J.; Rosa-Zurera, M.; Lucena-Lopez, M. J.

doi:10.1007/978-3-540-44871-6_121

P. Vera-Candeas⁵,
N. Ruiz-Reyes⁵,
D. Martinez-Muñoz⁵,
J. Curpian-Alonso⁵,
M. Rosa-Zurera⁶ &
…
M. J. Lucena-Lopez⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2652))

Included in the following conference series:

Iberian Conference on Pattern Recognition and Image Analysis

922 Accesses

Abstract

In this paper we propose a new approach based on energy-adaptive matching pursuits to improve sinusoidal modelling of speech and audio signals for coding and recognition purposes. To reduce the complexity of the algorithm, an over-complete dictionary composed of complex exponentials is used and an efficient implementation is presented. An analysis-synthesis windows scheme that avoids overlapping is proposed, too. Experimental results show evidence of the advantages of the proposed method for sinusoidal modelling of speech and audio signals compared to some others proposed in the literature.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

McAulay, R., Quatieri, T.: Speech analysis synthesis based on a sinusoidal representation. IEEE Transaction on Acoustic Speech and Signal Processing 34, 744–754 (1986)
Article Google Scholar
Thomson, D.: Spectral estimation and harmonic analysis. Proceedings of the IEEE 70 (1982)
Google Scholar
George, E., Smith, M.: Analysis-by-synthesis/overlap-add sinusoidal modeling applied to the analysis and synthesis of musical tones. Journal of the Audio Engineering Society 40, 497–515 (1992)
Google Scholar
Goodwin, M.: Adaptive signal models: theory, algorithms and audio applications. Kluwer Academic Publishers, Dordrecht (1998)
Book Google Scholar
Chang, W., Wang, D.: Perceptual quantisation of lpc excitation parameters. IEE Proc. Vision, Image and Signal Processing 145, 155–159 (1998)
Article Google Scholar
Painter, T., Spanias, A.: Perceptual segmentation and component selection in compact sinusoidal representation of audio. In: Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing, vol. 5, pp. 3289–3292 (2001)
Google Scholar
Mallat, S., Zhang, Z.: Matching pursuits with time-frequency dictionaries. IEEE Transaction on Signal Processing 41, 3397–3415 (1993)
Article Google Scholar
Verma, T., Meng, T.: Sinusoidal modeling using frame-based perceptually weighted matching pursuits. In: Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing, vol. 2, pp. 981–984 (1999)
Google Scholar
George, E., Smith, M.: Speech analysis/synthesis and modifications using an analysis-by-synthesis/overlap-add sinusoidal model. IEEE Trans. on Speech and Audio Processing 5, 389–406 (1997)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Electronic Department, University of Jaén, Polytechnical School, C/ Alfonso X el Sabio 28, 23700, Linares, Jaén, Spain
P. Vera-Candeas, N. Ruiz-Reyes, D. Martinez-Muñoz & J. Curpian-Alonso
Signal Theory and Communications Department, University of Alcalá, Polytechnical School, Ctra. Madrid-Barcelona km 33.6, 28871, Alcalá de Henares, Madrid, Spain
M. Rosa-Zurera
Informatics Department, University of Jaén, Polytechnical School, Avda. Madrid 35, 23008, Jaén, Spain
M. J. Lucena-Lopez

Authors

P. Vera-Candeas
View author publications
You can also search for this author in PubMed Google Scholar
N. Ruiz-Reyes
View author publications
You can also search for this author in PubMed Google Scholar
D. Martinez-Muñoz
View author publications
You can also search for this author in PubMed Google Scholar
J. Curpian-Alonso
View author publications
You can also search for this author in PubMed Google Scholar
M. Rosa-Zurera
View author publications
You can also search for this author in PubMed Google Scholar
M. J. Lucena-Lopez
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Unitat de Gràfics i Visió per Ordinador Departament de Ciències Matemàtiques i Informàtica, Universitat de les Illes Balears Edifici Anselm Turmeda, Ctra. de Valldemossa km 7,5, 07122, Palma de Mallorca, Spain
Francisco José Perales
FEUP - Faculdade de Engenharia, Universidade do Porto, Rua Dr. Roberto Frias, 4200-465, Porto, Portugal
Aurélio J. C. Campilho
Departamento de Ciencias da la Computacíon e I.A., Universidad de Granada, E.T. S. Ing. Informática, 18071, Granada, Spain
Nicolás Pérez de la Blanca
Dept. System Engineering and Automation, Universitat Politècnica de Catalunya (UPC) Barcelona, Spain
Alberto Sanfeliu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vera-Candeas, P., Ruiz-Reyes, N., Martinez-Muñoz, D., Curpian-Alonso, J., Rosa-Zurera, M., Lucena-Lopez, M.J. (2003). Sinusoidal Modelling with Complex Exponentials for Speech and Audio Signals. In: Perales, F.J., Campilho, A.J.C., de la Blanca, N.P., Sanfeliu, A. (eds) Pattern Recognition and Image Analysis. IbPRIA 2003. Lecture Notes in Computer Science, vol 2652. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-44871-6_121

Download citation

DOI: https://doi.org/10.1007/978-3-540-44871-6_121
Published: 18 September 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40217-6
Online ISBN: 978-3-540-44871-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics