Skip to main content

Speaker Verification Based on Information Theoretic Vector Quantization

  • Conference paper
Wireless Networks, Information Processing and Systems (IMTIC 2008)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 20))

Included in the following conference series:

Abstract

This paper explores the application of information theoretic based Vector Quantization algorithm called VQIT for speaker verification. Unlike the K-means and LBG Vector Quantization algorithms, VQIT has a physical interpretation and relies on minimization of quantization error in an efficient way. Vector Quantization based Speaker Verification has proven to be successful; usually a codebook is trained to minimize the quantization error for the data from an individual speaker. In this paper we use a set of 36 speakers from TIMIT database and evaluate MFCC and LPC coefficients of speech samples and later apply it to the K-means Vector Quantization, LBG Vector Quantization and VQIT Vector Quantization and suggest that VQIT performs better than other VQ implementations. We also obtain the results from the GMM classifier for the similar coefficient data and compare it to the VQIT.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Jialong, H., Liu, L., Gunther, P.: A new codebook training algorithm For VQ-based speaker recognition. In: IEEE international conference on acoustics, speech and signal Processing, vol. 2, pp. 1091–1094 (1997)

    Google Scholar 

  2. Singh, G., Panda, A., Bhattacharyya, S., Srikanthan, T.: Vector quantization techniques for GMM based speaker verification. In: IEEE international conference on acoustics, speech and signal Processing, vol. 2, pp. 1165–1168 (2003) DEFANGED.19618

    Google Scholar 

  3. Pelecanos, J., Myers, S., Sridharan, S., Chandran, V.: Vector Quantization Based Gaussian Modelling for Speaker Verification. In: International conference on pattern recognition, vol. 3, pp. 294–297 (2000)

    Google Scholar 

  4. Tue, L., Anant, H., Deniz, E., Jose, C.: Vector Quantization using information theoretic concepts. Natural Computing: an international journal 4(1), 39–51 (2005)

    Article  Google Scholar 

  5. Furui, S.: Digital Speech Processing, Synthesis and Recognition. Marcel Dekker Inc., New York (1989)

    Google Scholar 

  6. Erwin, E., Obermayer, K., Schulten, K.: Self organizing maps, ordering, convergence properties and energy functions. Biological Cybernetics 67(1), 47–55 (1991)

    Article  Google Scholar 

  7. Heskes, T., Kapen, B.: Error potentials for Self organization. In: IEEE international conference on Neural Networks, vol. 3, pp. 1219–1223 (1993)

    Google Scholar 

  8. Heskes, T.: Energy functions for self organizing maps. In: Kohonen Maps, E., Oja, Kaski, S. (eds.) Kohonen Maps, pp. 303–315. Elsevier, Amsterdam (1999)

    Chapter  Google Scholar 

  9. Hulle, M.V.: Kernel based topographic map formation achieved with an information-theoretic approach. Neural Networks 15(8-9), 1029–1039 (2002)

    Article  PubMed  Google Scholar 

  10. Bishop, C.M., Svensen, M., Williams, C.K.I.: GTM: a principled alternative to the self-organizing map. In: International Conference proceedings on Artificial neural networks - ICANN 1996, pp. 165–701 (1996)

    Google Scholar 

  11. Lynch Jr., J.J., Crochiere, R.: Speech/Silence segmentation for real-time coding via rule based adaptive endpoint detection. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 12, pp. 1348–1351 (1987)

    Google Scholar 

  12. Douglas, A.R.: Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Transactions on Speech and Audio Processing 3(1), 72–83 (1995)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Memon, S., Lech, M. (2008). Speaker Verification Based on Information Theoretic Vector Quantization. In: Hussain, D.M.A., Rajput, A.Q.K., Chowdhry, B.S., Gee, Q. (eds) Wireless Networks, Information Processing and Systems. IMTIC 2008. Communications in Computer and Information Science, vol 20. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89853-5_42

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-89853-5_42

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-89852-8

  • Online ISBN: 978-3-540-89853-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics