Skip to main content
Log in

Optimisation de la quantification vectorielle codée par treillis: application au codage des paramètres LSF

Optimized trellis coded vector quantization of speech coder LSF parameters

  • Published:
Annales Des Télécommunications Aims and scope Submit manuscript

Résumé

Un des problèmes importants dans le codage de la parole à bas débit est la conception de quantificateurs efficaces pour le codage des coefficients de prédiction linéaire (Lpc). Les paramètresLsf (Line spectral Frequencies) sont actuellement classés parmi les choix les plus appropriés pour représenter les coefficientsLpc. Dans cet article, un système optimisé à quantification vectorielle codée par treillis (Tcvq) pour coder des paramètres lsf est mis au point. Afin d’améliorer les performances du codeurTcvq, une mesure de distance pondérée plus appropriée a été utilisée dans la conception du système. Nous avons plus loin appliqué le systèmeTcvq optimisé pour coder les paramètresLsf d’un codeur de parole de la norme FS1016 (Us Federal StandardFs1016) à 4800 bit/s. A bas débits, les résultats d’évaluation objective et subjective montrent que le codeur incorporé (Tcvq pourLsf) présente de meilleures performances que le quantificateur scalaire de 34 bit/trame, utilisé à l’origine dans la normeFs1016. Les tests subjectifs indiquent également que le codeurTcvq de 27 bit/trame produit une qualité perceptuelle équivalente à celle obtenue quand les paramètresLsf ne sont pas quantifiés.

Abstract

Speech coders operating at low bit rates necessitate efficient encoding of the linear predictive coding (Lpc) coefficients. Line spectral Frequencies (Lsf) parameters are currently one of the most efficient choices of transmission parameters for theLpc coefficients. In this paper, an optimized trellis coded vector quantization (Tcvq) scheme for encoding theLsf parameters is presented. When the selection of a proper distortion measure is the most important issue in the design and operation of the encoder, an appropriate weighted distance measure has been used during theTcvq construction process. We further applied the optimizedTcvq system for encoding theLsf parameters of the us Federal Standard (Fs1016) 4.8 kbps speech coder. At lower bit rates, objective and subjective evaluation results show that the incorporatedLsf tcvq encoder performs better than the 34 bits/frameLsf scalar quantizer used originally in the fs1016 coder. The subjective tests reveal also that the 27 bit/frame scheme produces equivalent perceptual quality to that when theLsf parameters are unquantized.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Bibliographie

  1. Rabiner (L. R.), Schafer (R. W.), Digital Processing of speech signals,Prentice-Hall, Englewood Cliffs,Nj, 1978.

    Google Scholar 

  2. Kleijn (W. B.),Paliwal (K. K.), Speech coding and synthesis, Elsevier ScienceS.v., 1995.

  3. Paliwal (K. K.), Atal (B. S.), Efficient vector quantization ofLpc parameters at 24 bit/frame,Ieee Transactions on Speech and Audio Processing,1, no 1, pp. 3–14, January 1993.

    Article  Google Scholar 

  4. Itakura (F.), Line spectrum representation of linear predictive coefficients of speech signals,Journal of Acoustical Society of America,57, p. 535, April 1975.

    Article  Google Scholar 

  5. Jayant (N. S.), Noll (P.), Digital Coding of Waveforms-Principales and Applications to Speech and Video, Prentice-Hall.Inc. englewood Cliffs,Nj, 1984.

    Google Scholar 

  6. Gray (R. M.), Neuhoff (D. L.), Quantization,Ieee Transactions on Information Theory,44, no 6, pp. 1–63, October 1998.

    Article  MathSciNet  Google Scholar 

  7. Soong (F. K.),Juang (B. H.), Optimal quantization ofLsp parameters, Proc.Ieee Int. Conf. Acous., Speech Signal Processing, New York, pp. 394–397, April 1988.

  8. Gersho (A.),Gray (R. M.), Vector quantization and Signal compression,Kluwer Academic Publishers, 199

  9. Leblanc (W. F.), Bhattacharya (B.), Mahmoud (S. A.), Cuperman (V), Efficient search and design procedures for robust multi-stageVq ofLpc parameters for 4 kb/s speech coding,Ieee Trans. Speech and Audio Processing,1, no 4, pp. 373–385, October 1993.

    Article  Google Scholar 

  10. Xie (M.), Adoul (J. P.), Algebraic vector quantization ofLsf parameters with low storage and computational complexity,Ieee Transactions on Speech and Audio Processing,4, no 3, pp. 234–239, May 1996.

    Article  Google Scholar 

  11. Campbell (J. P.),Tremain (T. E.),Welch (V. C), The Proposed Federal Standard 1016 4800 bps Voice Coder:Celp,Speech Technology Magazine, pp. 58–64, April/May 1990.

  12. Boite (R.), Kunt (M.), Traitement de la parole,Presses polytechniques Romandes, Lausanne, 1987.

    Google Scholar 

  13. Malone (K. T.), Fischer (T. R.), Enumeration and Trellis-Searched Coding Schemes for SpeechLsp Parameters,Ieee Trans. Speech and Audio Proc.,1, no 3, pp. 304–314, 1993.

    Article  Google Scholar 

  14. Viterbi (A. J.),Omura (J. K.), Principles of Digital Communication and Coding,McGraw-Hill Kogakusha, 1979.

  15. Marcellin (M. W.), Fischer (T. R.), Trellis coded quantization of memoryless and Gauss-markov sources,Ieee Trans. on Comm.,38, pp. 83–93, January 1990.

    Article  MathSciNet  Google Scholar 

  16. Fischer (T. R,), Marcellin (M. W.), Wang (M.), Trellis coded vector quantization,Ieee Transactions on Information Theory,37, pp.1551–1566, Nov. 1991.

    Article  MathSciNet  MATH  Google Scholar 

  17. Kasner (J. H.), Marcellin (M. W.), Hunt (B.R.), Universal Trellis Coded Quantization,Ieee Transactions on Image Processing,8, no 12, pp. 1677–1687, December 1999.

    Article  Google Scholar 

  18. Ungerboeck (G.), Trellis-coded modulation with redundant signal sets, Part I and II,Ieee Commun. Magazine,25, pp. 5–21, February 1987.

    Article  Google Scholar 

  19. Ungerboeck (G.), Channel coding with multilevel/phase signals,Ieee Trans. on Information Theory,IT-28, pp. 55–67, January 1982.

    Article  Google Scholar 

  20. Blahut (R. E.), Computation of Channel Capacity and Rate-Distortion Function,Ieee Transactions on Infor. Theory,18, no 4, pp. 460–473, July 1972.

    Article  MathSciNet  MATH  Google Scholar 

  21. Wang (H. S.), Moayeri (N.), Trellis coded vector quantization,Ieee Trans. on Comm.,40, pp. 1273–1276, August 1992.

    Article  MATH  Google Scholar 

  22. Linde (Y.), Buzo (A.), Gray (R. M.), An Algorithm for Vector Quantization Design,Ieee Transactions on Communications,COM-28, pp. 84–95, Jan 1980.

    Article  Google Scholar 

  23. Popescu (A.), Moreau (N.), Lamblin (C.), Celp Coding Using Trellis-Coded Vector Quantization of the Excitation,Ieee Transactions on Speech and Audio Processing,3, no 6, pp. 464–472, November 1995.

    Article  Google Scholar 

  24. Aksu (A.), Salehi (M.), Design, Performance, and Complexity Analysis of Residual Trellis-Coded Vector Quantizers,Ieee Trans. on Comm.,46, no 8, pp. 1020–1026, August 1998.

    Article  MATH  Google Scholar 

  25. Katsavounidis (I.), Kuo (C.), Zhang (Z.), A new initialization technique for generalized Lloyd iteration,Ieee Signal Proc. Letters,1, pp. 144–146, October 1994.

    Article  Google Scholar 

  26. Grassi (S.), Optimized Implementation of Speech Processing Algorithms,Doctorat Thesis, Neuchâtel University, Switzerland, February 1998.

  27. Garofolo (J. S.) et al.,Darpa timit Acoustic-phonetic Continuous Speech Database, Technology Building, National Institute of Standards and Technology (Nist), Gaithersburg, October 1988.

    Google Scholar 

  28. TheUs fs1016 based 4800Bps celp voice coder, Fortran and C simulation source codes, version 3.3c (Celp 3.3c), disponible sur Net ftp://svr-ftp.eng.cam.ac.uk et autres sites web

  29. Kabal (P.), Ramachandran (R. P.), The computation of line spectral frequencies using Chebyshev polynomials,IEEE Trans. Acoust., Speech, Signal Proc.,34, pp. 1419–1426, Dec. 1986.

    Article  Google Scholar 

  30. Laroia (R.),Phamdo (N.),Farvardin (N.), Robust and efficient quantization of speech lsp parameters using structured vector quantizers,Proc. ieee Int. Conference Acoustic Speech and Signal Processing, pp. 641–644, May 1991.

  31. Ramachandran (R. P.), Sondhi (M. M.), Seshadri (N.), A Two Codebook Format for Robust Quantization of Line Spectral Frequencies,Ieee Transactions on Speech and Audio Processing,3, no 3, pp. 157–167, May 1995.

    Article  Google Scholar 

  32. Agarwal (T.), Pre-Processing of Noisy Speech for Voice Coders,Master’s thesis, McGill University, Department of Electrical Engineering, Canada, January 2002.

    Google Scholar 

  33. Itu-t, Recommendation P.80 Methods of Subjective Determination of Transmission Quality,Itu, 1993.

  34. Schroeder (M. R.),Atal (B. S.), Code-Excited Linear Prediction (Celp): High-quality speech at very low bit rates, Proc.Icassp 85, pp. 937–940, March 1985.

    Google Scholar 

  35. Moreau (N.), Codage prédictif du signal de parole à débit réduit: une présentation unifiée,Annales des telecom,46, no 3–4, pp.223–239, 1991.

    Google Scholar 

  36. Moreau (N.), Techniques de Compression des Signaux,Edition Masson, 1994.

  37. Kleijn (W. B.),Krasinski (D. J.),Ketchum (R. H.), An Efficient Stochastically Excited Linear Predictive Coding Algorithm for High Quality Low Bit Rate Transmission of Speech,Speech Communication, pp. 305–316, 1988.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bouzid, M., Djeradi, A. Optimisation de la quantification vectorielle codée par treillis: application au codage des paramètres LSF. Ann. Télécommun. 60, 744–769 (2005). https://doi.org/10.1007/BF03219945

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF03219945

Mots clés

Key words

Navigation