Split Vector Quantization of Psychoacoustical Modified LSF Coefficients in Speech Coder Based on Pitch-Tracking Periodic-Aperiodic Decomposition

Petrovsky, Alexander; Sawicki, Andrzej; Pavlovec, Alexander

doi:10.1007/0-387-26325-X_7

Alexander Petrovsky³,
Andrzej Sawicki³ &
Alexander Pavlovec³

543 Accesses

Abstract

This paper presents methods of detection and quantization coefficients from speech coder based on Pitch-Tracking Periodic-Aperiodic Decomposition. Spectral envelopes of harmonic and noise components of speech are represented by linear spectral frequencies (LSF) coefficients. In this article we show new methods of quality improvement in coding signal, using perceptual properties of human ear. Spectrum envelopes of the harmonic frequencies and noise components envelopes are represented in psychoacousticaly based phon-Bark and sone-Bark scale. Line Spectral Frequencies coefficients are quantized, using methods proposed by R.M. Gray, Y. Linde and A. Buzo. For better performance, reduction of computational complexity and memory space, Split Vector Quantization methods are used. Structure of Vector Codebook is condtructed based on dependency between LSF coefficents in vectors of training sequentions. LSF vectors are patritioned into two or three sub-vectors, and each of them is coded separately. Bit realocation between sub-codebooks allow obtain improvement of codebook quality. Combination of perceptual properties and modified split vector quantization of speech signal parameters, permit achieve good quality signal in speech coder for transmission rate 2.8 kbit/s and below.

This work is supported by Bialystok Technical University under the grant WAVI/2/04

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

5. References

Petrowsky A., Zubrycki P., Sawicki A. Tonal and Noise Components Separation Based on a Pitch Synchronous DFT Analyzer as a Speech Coding Method, Proceedings of ECCTD, 2003, Vol. III, pp. 169–172
Google Scholar
Linde, Y., Buzo, A., and Gray, R.M. “An algorithm for vector quantizer design”, IEEE Transactions on Communications, vol. COM-28, pp. 84–95, Jan. 1980.
Article Google Scholar
Gersho, A. and Gray, R. Vector quantization and signal compression. Boston, Kluwer Academic Publishers, 1992.
Google Scholar
DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus, Department of Commerce, NIST, Springfield, Virginia, Oct. 1990.
Google Scholar
Palival, K.K. and Atal, B.S. “Efficient vector quantization of LPC parameters at 24 bits/frame”, IEEE Transactions on Acoustics, Speech and Signal Processing., vol.1, № 1, pp. 3–14, Jan. 1993.
Google Scholar
Katsavounidis, I., Kuo, C.-C.J., Zhang, Z. “A new initialization technique for Generalized Lloyd Iteration”, IEEE Signal Processing Letters, vol. 1, № 10, pp. 144–146, Oct. 1994.
Article Google Scholar
Laroia, R., Phamdo, N., Farvardin, N. “Robust and efficient quantization of speech LSP parameters using structured vector quantizers”, in Proceedings IEEE International Conference on Acoustics, Speech, Signal Processing, (Toronto, Canada), pp. 641–644, May 1991.
Google Scholar
A. H. Gray, Jr. and J. D. Markel, “Quantization and bit allocation in speech processing,” IEEE Trans. Acoust, Speech, Signal Processing, vol. ASSP-24, pp. 459–473, 1976.
Article Google Scholar
W.R. Gardner and B. D. Rao, “Theoretical Analysis of the High Rate Vector Quantization of LPC Parameters” IEEE Trans. Speech Audio Processing, vol. 3, no. 5 pp. 367–381, 1995.
Article Google Scholar
E. Zwicker, H. Fasti, “Psychoacoustics: Facts and Models”. Berlin: Springer-Verlag, 1990
Google Scholar

Download references

Author information

Authors and Affiliations

Real-Time Systems Department, Bialystok Technical University, Wiejska 45A, 15-351, Bialystok, Poland
Alexander Petrovsky, Andrzej Sawicki & Alexander Pavlovec

Authors

Alexander Petrovsky
View author publications
You can also search for this author in PubMed Google Scholar
Andrzej Sawicki
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Pavlovec
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Białystock Technical University, Poland
Khalid Saeed
Technical University of Szczecin, Poland
Jerzy Pejaś

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Petrovsky, A., Sawicki, A., Pavlovec, A. (2005). Split Vector Quantization of Psychoacoustical Modified LSF Coefficients in Speech Coder Based on Pitch-Tracking Periodic-Aperiodic Decomposition. In: Saeed, K., Pejaś, J. (eds) Information Processing and Security Systems. Springer, Boston, MA. https://doi.org/10.1007/0-387-26325-X_7

Download citation

DOI: https://doi.org/10.1007/0-387-26325-X_7
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-25091-5
Online ISBN: 978-0-387-26325-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics