Optimisation de la quantification vectorielle codée par treillis: application au codage des paramètres LSF

Bouzid, Merouane; Djeradi, Amar

doi:10.1007/BF03219945

Optimisation de la quantification vectorielle codée par treillis: application au codage des paramètres LSF

Optimized trellis coded vector quantization of speech coder LSF parameters

Published: June 2005

Volume 60, pages 744–769, (2005)
Cite this article

Annales Des Télécommunications Aims and scope Submit manuscript

Merouane Bouzid¹ &
Amar Djeradi¹

57 Accesses
1 Citation
Explore all metrics

Résumé

Un des problèmes importants dans le codage de la parole à bas débit est la conception de quantificateurs efficaces pour le codage des coefficients de prédiction linéaire (Lpc). Les paramètresLsf (Line spectral Frequencies) sont actuellement classés parmi les choix les plus appropriés pour représenter les coefficientsLpc. Dans cet article, un système optimisé à quantification vectorielle codée par treillis (Tcvq) pour coder des paramètres lsf est mis au point. Afin d’améliorer les performances du codeurTcvq, une mesure de distance pondérée plus appropriée a été utilisée dans la conception du système. Nous avons plus loin appliqué le systèmeTcvq optimisé pour coder les paramètresLsf d’un codeur de parole de la norme FS1016 (Us Federal StandardFs1016) à 4800 bit/s. A bas débits, les résultats d’évaluation objective et subjective montrent que le codeur incorporé (Tcvq pourLsf) présente de meilleures performances que le quantificateur scalaire de 34 bit/trame, utilisé à l’origine dans la normeFs1016. Les tests subjectifs indiquent également que le codeurTcvq de 27 bit/trame produit une qualité perceptuelle équivalente à celle obtenue quand les paramètresLsf ne sont pas quantifiés.

Abstract

Speech coders operating at low bit rates necessitate efficient encoding of the linear predictive coding (Lpc) coefficients. Line spectral Frequencies (Lsf) parameters are currently one of the most efficient choices of transmission parameters for theLpc coefficients. In this paper, an optimized trellis coded vector quantization (Tcvq) scheme for encoding theLsf parameters is presented. When the selection of a proper distortion measure is the most important issue in the design and operation of the encoder, an appropriate weighted distance measure has been used during theTcvq construction process. We further applied the optimizedTcvq system for encoding theLsf parameters of the us Federal Standard (Fs1016) 4.8 kbps speech coder. At lower bit rates, objective and subjective evaluation results show that the incorporatedLsf tcvq encoder performs better than the 34 bits/frameLsf scalar quantizer used originally in the fs1016 coder. The subjective tests reveal also that the 27 bit/frame scheme produces equivalent perceptual quality to that when theLsf parameters are unquantized.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Bibliographie

Rabiner (L. R.), Schafer (R. W.), Digital Processing of speech signals,Prentice-Hall, Englewood Cliffs,Nj, 1978.
Google Scholar
Kleijn (W. B.),Paliwal (K. K.), Speech coding and synthesis, Elsevier ScienceS.v., 1995.
Paliwal (K. K.), Atal (B. S.), Efficient vector quantization ofLpc parameters at 24 bit/frame,Ieee Transactions on Speech and Audio Processing,1, n^o 1, pp. 3–14, January 1993.
Article Google Scholar
Itakura (F.), Line spectrum representation of linear predictive coefficients of speech signals,Journal of Acoustical Society of America,57, p. 535, April 1975.
Article Google Scholar
Jayant (N. S.), Noll (P.), Digital Coding of Waveforms-Principales and Applications to Speech and Video, Prentice-Hall.Inc. englewood Cliffs,Nj, 1984.
Google Scholar
Gray (R. M.), Neuhoff (D. L.), Quantization,Ieee Transactions on Information Theory,44, n^o 6, pp. 1–63, October 1998.
Article MathSciNet Google Scholar
Soong (F. K.),Juang (B. H.), Optimal quantization ofLsp parameters, Proc.Ieee Int. Conf. Acous., Speech Signal Processing, New York, pp. 394–397, April 1988.
Gersho (A.),Gray (R. M.), Vector quantization and Signal compression,Kluwer Academic Publishers, 199
Leblanc (W. F.), Bhattacharya (B.), Mahmoud (S. A.), Cuperman (V), Efficient search and design procedures for robust multi-stageVq ofLpc parameters for 4 kb/s speech coding,Ieee Trans. Speech and Audio Processing,1, n^o 4, pp. 373–385, October 1993.
Article Google Scholar
Xie (M.), Adoul (J. P.), Algebraic vector quantization ofLsf parameters with low storage and computational complexity,Ieee Transactions on Speech and Audio Processing,4, n^o 3, pp. 234–239, May 1996.
Article Google Scholar
Campbell (J. P.),Tremain (T. E.),Welch (V. C), The Proposed Federal Standard 1016 4800 bps Voice Coder:Celp,Speech Technology Magazine, pp. 58–64, April/May 1990.
Boite (R.), Kunt (M.), Traitement de la parole,Presses polytechniques Romandes, Lausanne, 1987.
Google Scholar
Malone (K. T.), Fischer (T. R.), Enumeration and Trellis-Searched Coding Schemes for SpeechLsp Parameters,Ieee Trans. Speech and Audio Proc.,1, n^o 3, pp. 304–314, 1993.
Article Google Scholar
Viterbi (A. J.),Omura (J. K.), Principles of Digital Communication and Coding,McGraw-Hill Kogakusha, 1979.
Marcellin (M. W.), Fischer (T. R.), Trellis coded quantization of memoryless and Gauss-markov sources,Ieee Trans. on Comm.,38, pp. 83–93, January 1990.
Article MathSciNet Google Scholar
Fischer (T. R,), Marcellin (M. W.), Wang (M.), Trellis coded vector quantization,Ieee Transactions on Information Theory,37, pp.1551–1566, Nov. 1991.
Article MathSciNet MATH Google Scholar
Kasner (J. H.), Marcellin (M. W.), Hunt (B.R.), Universal Trellis Coded Quantization,Ieee Transactions on Image Processing,8, n^o 12, pp. 1677–1687, December 1999.
Article Google Scholar
Ungerboeck (G.), Trellis-coded modulation with redundant signal sets, Part I and II,Ieee Commun. Magazine,25, pp. 5–21, February 1987.
Article Google Scholar
Ungerboeck (G.), Channel coding with multilevel/phase signals,Ieee Trans. on Information Theory,IT-28, pp. 55–67, January 1982.
Article Google Scholar
Blahut (R. E.), Computation of Channel Capacity and Rate-Distortion Function,Ieee Transactions on Infor. Theory,18, n^o 4, pp. 460–473, July 1972.
Article MathSciNet MATH Google Scholar
Wang (H. S.), Moayeri (N.), Trellis coded vector quantization,Ieee Trans. on Comm.,40, pp. 1273–1276, August 1992.
Article MATH Google Scholar
Linde (Y.), Buzo (A.), Gray (R. M.), An Algorithm for Vector Quantization Design,Ieee Transactions on Communications,COM-28, pp. 84–95, Jan 1980.
Article Google Scholar
Popescu (A.), Moreau (N.), Lamblin (C.), Celp Coding Using Trellis-Coded Vector Quantization of the Excitation,Ieee Transactions on Speech and Audio Processing,3, n^o 6, pp. 464–472, November 1995.
Article Google Scholar
Aksu (A.), Salehi (M.), Design, Performance, and Complexity Analysis of Residual Trellis-Coded Vector Quantizers,Ieee Trans. on Comm.,46, n^o 8, pp. 1020–1026, August 1998.
Article MATH Google Scholar
Katsavounidis (I.), Kuo (C.), Zhang (Z.), A new initialization technique for generalized Lloyd iteration,Ieee Signal Proc. Letters,1, pp. 144–146, October 1994.
Article Google Scholar
Grassi (S.), Optimized Implementation of Speech Processing Algorithms,Doctorat Thesis, Neuchâtel University, Switzerland, February 1998.
Garofolo (J. S.) et al.,Darpa timit Acoustic-phonetic Continuous Speech Database, Technology Building, National Institute of Standards and Technology (Nist), Gaithersburg, October 1988.
Google Scholar
TheUs fs1016 based 4800Bps celp voice coder, Fortran and C simulation source codes, version 3.3c (Celp 3.3c), disponible sur Net ftp://svr-ftp.eng.cam.ac.uk et autres sites web
Kabal (P.), Ramachandran (R. P.), The computation of line spectral frequencies using Chebyshev polynomials,IEEE Trans. Acoust., Speech, Signal Proc.,34, pp. 1419–1426, Dec. 1986.
Article Google Scholar
Laroia (R.),Phamdo (N.),Farvardin (N.), Robust and efficient quantization of speech lsp parameters using structured vector quantizers,Proc. ieee Int. Conference Acoustic Speech and Signal Processing, pp. 641–644, May 1991.
Ramachandran (R. P.), Sondhi (M. M.), Seshadri (N.), A Two Codebook Format for Robust Quantization of Line Spectral Frequencies,Ieee Transactions on Speech and Audio Processing,3, n^o 3, pp. 157–167, May 1995.
Article Google Scholar
Agarwal (T.), Pre-Processing of Noisy Speech for Voice Coders,Master’s thesis, McGill University, Department of Electrical Engineering, Canada, January 2002.
Google Scholar
Itu-t, Recommendation P.80 Methods of Subjective Determination of Transmission Quality,Itu, 1993.
Schroeder (M. R.),Atal (B. S.), Code-Excited Linear Prediction (Celp): High-quality speech at very low bit rates, Proc.Icassp 85, pp. 937–940, March 1985.
Google Scholar
Moreau (N.), Codage prédictif du signal de parole à débit réduit: une présentation unifiée,Annales des telecom,46, n^o 3–4, pp.223–239, 1991.
Google Scholar
Moreau (N.), Techniques de Compression des Signaux,Edition Masson, 1994.
Kleijn (W. B.),Krasinski (D. J.),Ketchum (R. H.), An Efficient Stochastically Excited Linear Predictive Coding Algorithm for High Quality Low Bit Rate Transmission of Speech,Speech Communication, pp. 305–316, 1988.

Download references

Author information

Authors and Affiliations

Laboratoire Communication Parlée et Traitement du signal, Faculté d’Electronique et d’Informatique, Université des Sciences et de la Technologie Houri Boumediene (USTHB), BP. 32, 16111, El-Alia, Bab-Ezzouar, Alger, Algérie
Merouane Bouzid & Amar Djeradi

Authors

Merouane Bouzid
View author publications
You can also search for this author in PubMed Google Scholar
Amar Djeradi
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bouzid, M., Djeradi, A. Optimisation de la quantification vectorielle codée par treillis: application au codage des paramètres LSF. Ann. Télécommun. 60, 744–769 (2005). https://doi.org/10.1007/BF03219945

Download citation

Received: 08 July 2004
Accepted: 08 November 2004
Issue Date: June 2005
DOI: https://doi.org/10.1007/BF03219945

Mots clés

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Optimisation de la quantification vectorielle codée par treillis: application au codage des paramètres LSF

Résumé

Abstract

Access this article

Bibliographie

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Mots clés

Key words

Search

Navigation