Adaptive Signal Models for Wide-Band Speech and Audio Compression

Vera-Candeas, Pedro; Ruiz-Reyes, Nicolás; Rosa-Zurera, Manuel; Cuevas-Martinez, Juan C.; López-Ferreras, Francisco

doi:10.1007/11492542_70

Pedro Vera-Candeas¹⁹,
Nicolás Ruiz-Reyes¹⁹,
Manuel Rosa-Zurera²⁰,
Juan C. Cuevas-Martinez¹⁹ &
…
Francisco López-Ferreras²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3523))

Included in the following conference series:

Iberian Conference on Pattern Recognition and Image Analysis

1606 Accesses
1 Citations

Abstract

This paper deals with the application of adaptive signal models for parametric speech and audio compression. The matching pursuit algorithm is used for extracting sinusoidal components and transients in audio signals. The resulting residue is perceptually modelled as a noise like signal. When a transient is detected, psychoacoustic-adapted matching pursuits are accomplished using a wavelet-based dictionary followed of an harmonic one. Otherwise, matching pursuit is applied only to the harmonic dictionary. This multi-part model (Sines + Transients + Noise) is successfully applied for speech and audio coding purposes, assuring high perceptual quality at low bit rates (close to 16 kbps for most of the signals considered for testing).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Levine, S., Smith, J.: A Sines+Transients+Noise Audio Representation for Data Compression and Time/Pitch Scale Modifications. In: 105th AES Convention, preprint 4781 (1998)
Google Scholar
Verma, T.S.: A perceptually based audio signal model with application to scalable audio compression, PhD Thesis, Standford University (1999)
Google Scholar
Den Brinker, A.C., Schuiijers, A.G.P., Oomen, A.W.J.: Parametric coding for high quality audio, 112th AES Convention, Preprint 5554 (2002)
Google Scholar
McAulay, R., Quatieri, T.: Speech Analysis/Synthesis Based on a Sinusoidal Representation. IEEE Trans. Acoustic, Speech and Signal Processing 34(4), 744–754 (1986)
Article Google Scholar
Nieuwenhuijse, J., Heusdens, R., Deprettere, E.F.: Robust exponential modeling of audio signals. In: Proc. ICASSP 1998, vol. 6, pp. 3581–3584 (1998)
Google Scholar
Vera-Candeas, P., Ruiz-Reyes, N., Rosa-Zurera, M., Martinez-Muñoz, D., Lopez- Ferreras, F.: Transient Modeling by Matching Pursuits with a Wavelet Dictionary for Parametric Audio Coding. IEEE Signal Processing Letters 11(3), 349–352 (2004)
Article Google Scholar
Goodwin, M.: Residual modelling in music analysis-synthesis. In: Proc. ICASSP 1996, vol. 2, pp. 1005–1008 (1996)
Google Scholar
Mallat, S., Zhang, Z.: Matching pursuits with time-frequency dictionaries. IEEE Trans. on Signal Processing 41, 3397–3415 (1993)
Article MATH Google Scholar
Ruiz, N., Rosa, M., López, F., Vera, P.: New algorithm for achieving an adaptive tiling of the time axis for audio coding purposes. Electronic Letters 80, 434–435 (2002)
Article Google Scholar
Goodwin, M.M.: Adaptive Signal Models. Theory, Algorithms and Audio Applications. Kluwer Academic Publishers, Dordrecht (1998)
Google Scholar
Heusdens, R., Vafin, R., Kleijn, W.B.: Sinusoidal Modelling using Psychoacoustic- Adaptive Matching Pursuits. IEEE Signal Processing Letters 9, 8 (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Electronics and Telecommunication Engineering Department, University of Jaén, Polytechnic School, C/ Alfonso X el Sabio 28, 23700, Linares, Jaén, Spain
Pedro Vera-Candeas, Nicolás Ruiz-Reyes & Juan C. Cuevas-Martinez
Signal Theory and Communications Department, University of Alcalá, Polytechnic School, 28871, Alcalá de Henares, Madrid, Spain
Manuel Rosa-Zurera & Francisco López-Ferreras

Authors

Pedro Vera-Candeas
View author publications
You can also search for this author in PubMed Google Scholar
Nicolás Ruiz-Reyes
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Rosa-Zurera
View author publications
You can also search for this author in PubMed Google Scholar
Juan C. Cuevas-Martinez
View author publications
You can also search for this author in PubMed Google Scholar
Francisco López-Ferreras
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Instituto Superior Técnico & Instituto de Sistemas e Robótica,, 1049-001, Lisboa, Portugal
Jorge S. Marques
ETSI Informática y e Telecomunicación, University of Granada, 18071, Granada, Spain
Nicolás Pérez de la Blanca
Instituto Superior Técnico, CERENA-Centro de Recursos Naturais e Ambiente, Av. Rovisco Pais, 1049-001, Lisboa, Portugal
Pedro Pina

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vera-Candeas, P., Ruiz-Reyes, N., Rosa-Zurera, M., Cuevas-Martinez, J.C., López-Ferreras, F. (2005). Adaptive Signal Models for Wide-Band Speech and Audio Compression. In: Marques, J.S., Pérez de la Blanca, N., Pina, P. (eds) Pattern Recognition and Image Analysis. IbPRIA 2005. Lecture Notes in Computer Science, vol 3523. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11492542_70

Download citation

DOI: https://doi.org/10.1007/11492542_70
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26154-4
Online ISBN: 978-3-540-32238-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics