Speaker Verification Based on Information Theoretic Vector Quantization

Memon, Sheeraz; Lech, Margaret

doi:10.1007/978-3-540-89853-5_42

Sheeraz Memon⁵ &
Margaret Lech⁵

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 20))

Included in the following conference series:

International Multi Topic Conference

1449 Accesses
3 Citations

Abstract

This paper explores the application of information theoretic based Vector Quantization algorithm called VQIT for speaker verification. Unlike the K-means and LBG Vector Quantization algorithms, VQIT has a physical interpretation and relies on minimization of quantization error in an efficient way. Vector Quantization based Speaker Verification has proven to be successful; usually a codebook is trained to minimize the quantization error for the data from an individual speaker. In this paper we use a set of 36 speakers from TIMIT database and evaluate MFCC and LPC coefficients of speech samples and later apply it to the K-means Vector Quantization, LBG Vector Quantization and VQIT Vector Quantization and suggest that VQIT performs better than other VQ implementations. We also obtain the results from the GMM classifier for the similar coefficient data and compare it to the VQIT.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Jialong, H., Liu, L., Gunther, P.: A new codebook training algorithm For VQ-based speaker recognition. In: IEEE international conference on acoustics, speech and signal Processing, vol. 2, pp. 1091–1094 (1997)
Google Scholar
Singh, G., Panda, A., Bhattacharyya, S., Srikanthan, T.: Vector quantization techniques for GMM based speaker verification. In: IEEE international conference on acoustics, speech and signal Processing, vol. 2, pp. 1165–1168 (2003) DEFANGED.19618
Google Scholar
Pelecanos, J., Myers, S., Sridharan, S., Chandran, V.: Vector Quantization Based Gaussian Modelling for Speaker Verification. In: International conference on pattern recognition, vol. 3, pp. 294–297 (2000)
Google Scholar
Tue, L., Anant, H., Deniz, E., Jose, C.: Vector Quantization using information theoretic concepts. Natural Computing: an international journal 4(1), 39–51 (2005)
Article Google Scholar
Furui, S.: Digital Speech Processing, Synthesis and Recognition. Marcel Dekker Inc., New York (1989)
Google Scholar
Erwin, E., Obermayer, K., Schulten, K.: Self organizing maps, ordering, convergence properties and energy functions. Biological Cybernetics 67(1), 47–55 (1991)
Article Google Scholar
Heskes, T., Kapen, B.: Error potentials for Self organization. In: IEEE international conference on Neural Networks, vol. 3, pp. 1219–1223 (1993)
Google Scholar
Heskes, T.: Energy functions for self organizing maps. In: Kohonen Maps, E., Oja, Kaski, S. (eds.) Kohonen Maps, pp. 303–315. Elsevier, Amsterdam (1999)
Chapter Google Scholar
Hulle, M.V.: Kernel based topographic map formation achieved with an information-theoretic approach. Neural Networks 15(8-9), 1029–1039 (2002)
Article PubMed Google Scholar
Bishop, C.M., Svensen, M., Williams, C.K.I.: GTM: a principled alternative to the self-organizing map. In: International Conference proceedings on Artificial neural networks - ICANN 1996, pp. 165–701 (1996)
Google Scholar
Lynch Jr., J.J., Crochiere, R.: Speech/Silence segmentation for real-time coding via rule based adaptive endpoint detection. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 12, pp. 1348–1351 (1987)
Google Scholar
Douglas, A.R.: Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Transactions on Speech and Audio Processing 3(1), 72–83 (1995)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Electrical and Computer Engineering, Royal Melbourne Institute of Technology, Melbourne, VIC, 3001, Australia
Sheeraz Memon & Margaret Lech

Authors

Sheeraz Memon
View author publications
You can also search for this author in PubMed Google Scholar
Margaret Lech
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Software Engineering & Media Technology, Aalborg University, Niels Bohrs Vej 8, 6700, Esbjerg, Denmark
D. M. Akbar Hussain
Mehran University of Engineering & Technology, Jamshoro, Pakistan
Abdul Qadeer Khan Rajput
Department of Electronics and Telecommunication Engineering, Faculty of Electrical, Electronics & Computer Engineering, Mehran UET, Jamshoro, Pakistan
Bhawani Shankar Chowdhry
Learning Societies Lab, Electronics and Computer Science, University of Southampton, United Kingdom
Quintin Gee

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Memon, S., Lech, M. (2008). Speaker Verification Based on Information Theoretic Vector Quantization. In: Hussain, D.M.A., Rajput, A.Q.K., Chowdhry, B.S., Gee, Q. (eds) Wireless Networks, Information Processing and Systems. IMTIC 2008. Communications in Computer and Information Science, vol 20. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89853-5_42

Download citation

DOI: https://doi.org/10.1007/978-3-540-89853-5_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89852-8
Online ISBN: 978-3-540-89853-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics