Skip to main content

Split Vector Quantization of Psychoacoustical Modified LSF Coefficients in Speech Coder Based on Pitch-Tracking Periodic-Aperiodic Decomposition

  • Conference paper
Information Processing and Security Systems

Abstract

This paper presents methods of detection and quantization coefficients from speech coder based on Pitch-Tracking Periodic-Aperiodic Decomposition. Spectral envelopes of harmonic and noise components of speech are represented by linear spectral frequencies (LSF) coefficients. In this article we show new methods of quality improvement in coding signal, using perceptual properties of human ear. Spectrum envelopes of the harmonic frequencies and noise components envelopes are represented in psychoacousticaly based phon-Bark and sone-Bark scale. Line Spectral Frequencies coefficients are quantized, using methods proposed by R.M. Gray, Y. Linde and A. Buzo. For better performance, reduction of computational complexity and memory space, Split Vector Quantization methods are used. Structure of Vector Codebook is condtructed based on dependency between LSF coefficents in vectors of training sequentions. LSF vectors are patritioned into two or three sub-vectors, and each of them is coded separately. Bit realocation between sub-codebooks allow obtain improvement of codebook quality. Combination of perceptual properties and modified split vector quantization of speech signal parameters, permit achieve good quality signal in speech coder for transmission rate 2.8 kbit/s and below.

This work is supported by Bialystok Technical University under the grant WAVI/2/04

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

5. References

  1. Petrowsky A., Zubrycki P., Sawicki A. Tonal and Noise Components Separation Based on a Pitch Synchronous DFT Analyzer as a Speech Coding Method, Proceedings of ECCTD, 2003, Vol. III, pp. 169–172

    Google Scholar 

  2. Linde, Y., Buzo, A., and Gray, R.M. “An algorithm for vector quantizer design”, IEEE Transactions on Communications, vol. COM-28, pp. 84–95, Jan. 1980.

    Article  Google Scholar 

  3. Gersho, A. and Gray, R. Vector quantization and signal compression. Boston, Kluwer Academic Publishers, 1992.

    Google Scholar 

  4. DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus, Department of Commerce, NIST, Springfield, Virginia, Oct. 1990.

    Google Scholar 

  5. Palival, K.K. and Atal, B.S. “Efficient vector quantization of LPC parameters at 24 bits/frame”, IEEE Transactions on Acoustics, Speech and Signal Processing., vol.1, № 1, pp. 3–14, Jan. 1993.

    Google Scholar 

  6. Katsavounidis, I., Kuo, C.-C.J., Zhang, Z. “A new initialization technique for Generalized Lloyd Iteration”, IEEE Signal Processing Letters, vol. 1, № 10, pp. 144–146, Oct. 1994.

    Article  Google Scholar 

  7. Laroia, R., Phamdo, N., Farvardin, N. “Robust and efficient quantization of speech LSP parameters using structured vector quantizers”, in Proceedings IEEE International Conference on Acoustics, Speech, Signal Processing, (Toronto, Canada), pp. 641–644, May 1991.

    Google Scholar 

  8. A. H. Gray, Jr. and J. D. Markel, “Quantization and bit allocation in speech processing,” IEEE Trans. Acoust, Speech, Signal Processing, vol. ASSP-24, pp. 459–473, 1976.

    Article  Google Scholar 

  9. W.R. Gardner and B. D. Rao, “Theoretical Analysis of the High Rate Vector Quantization of LPC Parameters” IEEE Trans. Speech Audio Processing, vol. 3, no. 5 pp. 367–381, 1995.

    Article  Google Scholar 

  10. E. Zwicker, H. Fasti, “Psychoacoustics: Facts and Models”. Berlin: Springer-Verlag, 1990

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer Science+Business Media, Inc.

About this paper

Cite this paper

Petrovsky, A., Sawicki, A., Pavlovec, A. (2005). Split Vector Quantization of Psychoacoustical Modified LSF Coefficients in Speech Coder Based on Pitch-Tracking Periodic-Aperiodic Decomposition. In: Saeed, K., Pejaś, J. (eds) Information Processing and Security Systems. Springer, Boston, MA. https://doi.org/10.1007/0-387-26325-X_7

Download citation

  • DOI: https://doi.org/10.1007/0-387-26325-X_7

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-25091-5

  • Online ISBN: 978-0-387-26325-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics