Abstract
This paper explores the application of information theoretic based Vector Quantization algorithm called VQIT for speaker verification. Unlike the K-means and LBG Vector Quantization algorithms, VQIT has a physical interpretation and relies on minimization of quantization error in an efficient way. Vector Quantization based Speaker Verification has proven to be successful; usually a codebook is trained to minimize the quantization error for the data from an individual speaker. In this paper we use a set of 36 speakers from TIMIT database and evaluate MFCC and LPC coefficients of speech samples and later apply it to the K-means Vector Quantization, LBG Vector Quantization and VQIT Vector Quantization and suggest that VQIT performs better than other VQ implementations. We also obtain the results from the GMM classifier for the similar coefficient data and compare it to the VQIT.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Jialong, H., Liu, L., Gunther, P.: A new codebook training algorithm For VQ-based speaker recognition. In: IEEE international conference on acoustics, speech and signal Processing, vol. 2, pp. 1091–1094 (1997)
Singh, G., Panda, A., Bhattacharyya, S., Srikanthan, T.: Vector quantization techniques for GMM based speaker verification. In: IEEE international conference on acoustics, speech and signal Processing, vol. 2, pp. 1165–1168 (2003) DEFANGED.19618
Pelecanos, J., Myers, S., Sridharan, S., Chandran, V.: Vector Quantization Based Gaussian Modelling for Speaker Verification. In: International conference on pattern recognition, vol. 3, pp. 294–297 (2000)
Tue, L., Anant, H., Deniz, E., Jose, C.: Vector Quantization using information theoretic concepts. Natural Computing: an international journal 4(1), 39–51 (2005)
Furui, S.: Digital Speech Processing, Synthesis and Recognition. Marcel Dekker Inc., New York (1989)
Erwin, E., Obermayer, K., Schulten, K.: Self organizing maps, ordering, convergence properties and energy functions. Biological Cybernetics 67(1), 47–55 (1991)
Heskes, T., Kapen, B.: Error potentials for Self organization. In: IEEE international conference on Neural Networks, vol. 3, pp. 1219–1223 (1993)
Heskes, T.: Energy functions for self organizing maps. In: Kohonen Maps, E., Oja, Kaski, S. (eds.) Kohonen Maps, pp. 303–315. Elsevier, Amsterdam (1999)
Hulle, M.V.: Kernel based topographic map formation achieved with an information-theoretic approach. Neural Networks 15(8-9), 1029–1039 (2002)
Bishop, C.M., Svensen, M., Williams, C.K.I.: GTM: a principled alternative to the self-organizing map. In: International Conference proceedings on Artificial neural networks - ICANN 1996, pp. 165–701 (1996)
Lynch Jr., J.J., Crochiere, R.: Speech/Silence segmentation for real-time coding via rule based adaptive endpoint detection. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 12, pp. 1348–1351 (1987)
Douglas, A.R.: Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Transactions on Speech and Audio Processing 3(1), 72–83 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Memon, S., Lech, M. (2008). Speaker Verification Based on Information Theoretic Vector Quantization. In: Hussain, D.M.A., Rajput, A.Q.K., Chowdhry, B.S., Gee, Q. (eds) Wireless Networks, Information Processing and Systems. IMTIC 2008. Communications in Computer and Information Science, vol 20. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89853-5_42
Download citation
DOI: https://doi.org/10.1007/978-3-540-89853-5_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89852-8
Online ISBN: 978-3-540-89853-5
eBook Packages: Computer ScienceComputer Science (R0)