Résumé
Un des problèmes importants dans le codage de la parole à bas débit est la conception de quantificateurs efficaces pour le codage des coefficients de prédiction linéaire (Lpc). Les paramètresLsf (Line spectral Frequencies) sont actuellement classés parmi les choix les plus appropriés pour représenter les coefficientsLpc. Dans cet article, un système optimisé à quantification vectorielle codée par treillis (Tcvq) pour coder des paramètres lsf est mis au point. Afin d’améliorer les performances du codeurTcvq, une mesure de distance pondérée plus appropriée a été utilisée dans la conception du système. Nous avons plus loin appliqué le systèmeTcvq optimisé pour coder les paramètresLsf d’un codeur de parole de la norme FS1016 (Us Federal StandardFs1016) à 4800 bit/s. A bas débits, les résultats d’évaluation objective et subjective montrent que le codeur incorporé (Tcvq pourLsf) présente de meilleures performances que le quantificateur scalaire de 34 bit/trame, utilisé à l’origine dans la normeFs1016. Les tests subjectifs indiquent également que le codeurTcvq de 27 bit/trame produit une qualité perceptuelle équivalente à celle obtenue quand les paramètresLsf ne sont pas quantifiés.
Abstract
Speech coders operating at low bit rates necessitate efficient encoding of the linear predictive coding (Lpc) coefficients. Line spectral Frequencies (Lsf) parameters are currently one of the most efficient choices of transmission parameters for theLpc coefficients. In this paper, an optimized trellis coded vector quantization (Tcvq) scheme for encoding theLsf parameters is presented. When the selection of a proper distortion measure is the most important issue in the design and operation of the encoder, an appropriate weighted distance measure has been used during theTcvq construction process. We further applied the optimizedTcvq system for encoding theLsf parameters of the us Federal Standard (Fs1016) 4.8 kbps speech coder. At lower bit rates, objective and subjective evaluation results show that the incorporatedLsf tcvq encoder performs better than the 34 bits/frameLsf scalar quantizer used originally in the fs1016 coder. The subjective tests reveal also that the 27 bit/frame scheme produces equivalent perceptual quality to that when theLsf parameters are unquantized.
Bibliographie
Rabiner (L. R.), Schafer (R. W.), Digital Processing of speech signals,Prentice-Hall, Englewood Cliffs,Nj, 1978.
Kleijn (W. B.),Paliwal (K. K.), Speech coding and synthesis, Elsevier ScienceS.v., 1995.
Paliwal (K. K.), Atal (B. S.), Efficient vector quantization ofLpc parameters at 24 bit/frame,Ieee Transactions on Speech and Audio Processing,1, no 1, pp. 3–14, January 1993.
Itakura (F.), Line spectrum representation of linear predictive coefficients of speech signals,Journal of Acoustical Society of America,57, p. 535, April 1975.
Jayant (N. S.), Noll (P.), Digital Coding of Waveforms-Principales and Applications to Speech and Video, Prentice-Hall.Inc. englewood Cliffs,Nj, 1984.
Gray (R. M.), Neuhoff (D. L.), Quantization,Ieee Transactions on Information Theory,44, no 6, pp. 1–63, October 1998.
Soong (F. K.),Juang (B. H.), Optimal quantization ofLsp parameters, Proc.Ieee Int. Conf. Acous., Speech Signal Processing, New York, pp. 394–397, April 1988.
Gersho (A.),Gray (R. M.), Vector quantization and Signal compression,Kluwer Academic Publishers, 199
Leblanc (W. F.), Bhattacharya (B.), Mahmoud (S. A.), Cuperman (V), Efficient search and design procedures for robust multi-stageVq ofLpc parameters for 4 kb/s speech coding,Ieee Trans. Speech and Audio Processing,1, no 4, pp. 373–385, October 1993.
Xie (M.), Adoul (J. P.), Algebraic vector quantization ofLsf parameters with low storage and computational complexity,Ieee Transactions on Speech and Audio Processing,4, no 3, pp. 234–239, May 1996.
Campbell (J. P.),Tremain (T. E.),Welch (V. C), The Proposed Federal Standard 1016 4800 bps Voice Coder:Celp,Speech Technology Magazine, pp. 58–64, April/May 1990.
Boite (R.), Kunt (M.), Traitement de la parole,Presses polytechniques Romandes, Lausanne, 1987.
Malone (K. T.), Fischer (T. R.), Enumeration and Trellis-Searched Coding Schemes for SpeechLsp Parameters,Ieee Trans. Speech and Audio Proc.,1, no 3, pp. 304–314, 1993.
Viterbi (A. J.),Omura (J. K.), Principles of Digital Communication and Coding,McGraw-Hill Kogakusha, 1979.
Marcellin (M. W.), Fischer (T. R.), Trellis coded quantization of memoryless and Gauss-markov sources,Ieee Trans. on Comm.,38, pp. 83–93, January 1990.
Fischer (T. R,), Marcellin (M. W.), Wang (M.), Trellis coded vector quantization,Ieee Transactions on Information Theory,37, pp.1551–1566, Nov. 1991.
Kasner (J. H.), Marcellin (M. W.), Hunt (B.R.), Universal Trellis Coded Quantization,Ieee Transactions on Image Processing,8, no 12, pp. 1677–1687, December 1999.
Ungerboeck (G.), Trellis-coded modulation with redundant signal sets, Part I and II,Ieee Commun. Magazine,25, pp. 5–21, February 1987.
Ungerboeck (G.), Channel coding with multilevel/phase signals,Ieee Trans. on Information Theory,IT-28, pp. 55–67, January 1982.
Blahut (R. E.), Computation of Channel Capacity and Rate-Distortion Function,Ieee Transactions on Infor. Theory,18, no 4, pp. 460–473, July 1972.
Wang (H. S.), Moayeri (N.), Trellis coded vector quantization,Ieee Trans. on Comm.,40, pp. 1273–1276, August 1992.
Linde (Y.), Buzo (A.), Gray (R. M.), An Algorithm for Vector Quantization Design,Ieee Transactions on Communications,COM-28, pp. 84–95, Jan 1980.
Popescu (A.), Moreau (N.), Lamblin (C.), Celp Coding Using Trellis-Coded Vector Quantization of the Excitation,Ieee Transactions on Speech and Audio Processing,3, no 6, pp. 464–472, November 1995.
Aksu (A.), Salehi (M.), Design, Performance, and Complexity Analysis of Residual Trellis-Coded Vector Quantizers,Ieee Trans. on Comm.,46, no 8, pp. 1020–1026, August 1998.
Katsavounidis (I.), Kuo (C.), Zhang (Z.), A new initialization technique for generalized Lloyd iteration,Ieee Signal Proc. Letters,1, pp. 144–146, October 1994.
Grassi (S.), Optimized Implementation of Speech Processing Algorithms,Doctorat Thesis, Neuchâtel University, Switzerland, February 1998.
Garofolo (J. S.) et al.,Darpa timit Acoustic-phonetic Continuous Speech Database, Technology Building, National Institute of Standards and Technology (Nist), Gaithersburg, October 1988.
TheUs fs1016 based 4800Bps celp voice coder, Fortran and C simulation source codes, version 3.3c (Celp 3.3c), disponible sur Net ftp://svr-ftp.eng.cam.ac.uk et autres sites web
Kabal (P.), Ramachandran (R. P.), The computation of line spectral frequencies using Chebyshev polynomials,IEEE Trans. Acoust., Speech, Signal Proc.,34, pp. 1419–1426, Dec. 1986.
Laroia (R.),Phamdo (N.),Farvardin (N.), Robust and efficient quantization of speech lsp parameters using structured vector quantizers,Proc. ieee Int. Conference Acoustic Speech and Signal Processing, pp. 641–644, May 1991.
Ramachandran (R. P.), Sondhi (M. M.), Seshadri (N.), A Two Codebook Format for Robust Quantization of Line Spectral Frequencies,Ieee Transactions on Speech and Audio Processing,3, no 3, pp. 157–167, May 1995.
Agarwal (T.), Pre-Processing of Noisy Speech for Voice Coders,Master’s thesis, McGill University, Department of Electrical Engineering, Canada, January 2002.
Itu-t, Recommendation P.80 Methods of Subjective Determination of Transmission Quality,Itu, 1993.
Schroeder (M. R.),Atal (B. S.), Code-Excited Linear Prediction (Celp): High-quality speech at very low bit rates, Proc.Icassp 85, pp. 937–940, March 1985.
Moreau (N.), Codage prédictif du signal de parole à débit réduit: une présentation unifiée,Annales des telecom,46, no 3–4, pp.223–239, 1991.
Moreau (N.), Techniques de Compression des Signaux,Edition Masson, 1994.
Kleijn (W. B.),Krasinski (D. J.),Ketchum (R. H.), An Efficient Stochastically Excited Linear Predictive Coding Algorithm for High Quality Low Bit Rate Transmission of Speech,Speech Communication, pp. 305–316, 1988.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Bouzid, M., Djeradi, A. Optimisation de la quantification vectorielle codée par treillis: application au codage des paramètres LSF. Ann. Télécommun. 60, 744–769 (2005). https://doi.org/10.1007/BF03219945
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF03219945
Mots clés
- Codage parole
- Quantification signal
- Quantification bloc
- Codage treillis
- Optimisation
- Prédiction linéaire
- Raie spectrale