Abstract
This paper presents methods of detection and quantization coefficients from speech coder based on Pitch-Tracking Periodic-Aperiodic Decomposition. Spectral envelopes of harmonic and noise components of speech are represented by linear spectral frequencies (LSF) coefficients. In this article we show new methods of quality improvement in coding signal, using perceptual properties of human ear. Spectrum envelopes of the harmonic frequencies and noise components envelopes are represented in psychoacousticaly based phon-Bark and sone-Bark scale. Line Spectral Frequencies coefficients are quantized, using methods proposed by R.M. Gray, Y. Linde and A. Buzo. For better performance, reduction of computational complexity and memory space, Split Vector Quantization methods are used. Structure of Vector Codebook is condtructed based on dependency between LSF coefficents in vectors of training sequentions. LSF vectors are patritioned into two or three sub-vectors, and each of them is coded separately. Bit realocation between sub-codebooks allow obtain improvement of codebook quality. Combination of perceptual properties and modified split vector quantization of speech signal parameters, permit achieve good quality signal in speech coder for transmission rate 2.8 kbit/s and below.
This work is supported by Bialystok Technical University under the grant WAVI/2/04
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
5. References
Petrowsky A., Zubrycki P., Sawicki A. Tonal and Noise Components Separation Based on a Pitch Synchronous DFT Analyzer as a Speech Coding Method, Proceedings of ECCTD, 2003, Vol. III, pp. 169–172
Linde, Y., Buzo, A., and Gray, R.M. “An algorithm for vector quantizer design”, IEEE Transactions on Communications, vol. COM-28, pp. 84–95, Jan. 1980.
Gersho, A. and Gray, R. Vector quantization and signal compression. Boston, Kluwer Academic Publishers, 1992.
DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus, Department of Commerce, NIST, Springfield, Virginia, Oct. 1990.
Palival, K.K. and Atal, B.S. “Efficient vector quantization of LPC parameters at 24 bits/frame”, IEEE Transactions on Acoustics, Speech and Signal Processing., vol.1, № 1, pp. 3–14, Jan. 1993.
Katsavounidis, I., Kuo, C.-C.J., Zhang, Z. “A new initialization technique for Generalized Lloyd Iteration”, IEEE Signal Processing Letters, vol. 1, № 10, pp. 144–146, Oct. 1994.
Laroia, R., Phamdo, N., Farvardin, N. “Robust and efficient quantization of speech LSP parameters using structured vector quantizers”, in Proceedings IEEE International Conference on Acoustics, Speech, Signal Processing, (Toronto, Canada), pp. 641–644, May 1991.
A. H. Gray, Jr. and J. D. Markel, “Quantization and bit allocation in speech processing,” IEEE Trans. Acoust, Speech, Signal Processing, vol. ASSP-24, pp. 459–473, 1976.
W.R. Gardner and B. D. Rao, “Theoretical Analysis of the High Rate Vector Quantization of LPC Parameters” IEEE Trans. Speech Audio Processing, vol. 3, no. 5 pp. 367–381, 1995.
E. Zwicker, H. Fasti, “Psychoacoustics: Facts and Models”. Berlin: Springer-Verlag, 1990
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer Science+Business Media, Inc.
About this paper
Cite this paper
Petrovsky, A., Sawicki, A., Pavlovec, A. (2005). Split Vector Quantization of Psychoacoustical Modified LSF Coefficients in Speech Coder Based on Pitch-Tracking Periodic-Aperiodic Decomposition. In: Saeed, K., Pejaś, J. (eds) Information Processing and Security Systems. Springer, Boston, MA. https://doi.org/10.1007/0-387-26325-X_7
Download citation
DOI: https://doi.org/10.1007/0-387-26325-X_7
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-25091-5
Online ISBN: 978-0-387-26325-0
eBook Packages: Computer ScienceComputer Science (R0)