Skip to main content

Speaker Recognition Using MFCC and Hybrid Model of VQ and GMM

  • Conference paper
Recent Advances in Intelligent Informatics

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 235))

Abstract

Speaker recognition is widely used for automatic authentication of speaker’s identity based on human biological features. Speaker recognition extracts, characterizes and recognizes the information about speaker identity. For feature extraction and speaker modeling many algorithms are being used. In this paper, we have proposed speaker recognition system based on hybrid approach using Mel Frequency Cepstrum Coefficient (MFCC) as feature extraction and combination of vector quantization (VQ) and Gaussian Mixture Modeling (GMM) for speaker modeling. Our approach is able to recognize speaker for both text dependent and text independent speech and uses relative index as confidence measures in case of contradiction in recognition process by GMM and VQ. Simulation results highlight the efficacy of proposed method compared to earlier work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hai, J., Joo, E.M.: Improved linear predictive coding method for speech recognition. In: Information, Communications and Signal Processing and Fourth Pacific Rim Conference on Multimedia. Proceedings of the Joint Conference of the Fourth International Conference, vol. 3, pp. 1614–1618 (2003)

    Google Scholar 

  2. Muda, L., Begam, M., Elamvazuthi, I.: Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques. Journal of Computing 2(3), 138–141 (2010)

    Google Scholar 

  3. Hasan, R., Jamil, M., Rahman, G.R.S.: Speaker Identification Using Mel Frequency Cepstral Coefficients. In: 3rd International Conference on Electrical & Computer Engineering ICECE, pp. 565–568 (2004)

    Google Scholar 

  4. Tiwari, V.: MFCC and its applications in speaker recognition. International Journal on Emerging Technologies 1, 19–22 (2010)

    Google Scholar 

  5. Shende, A., Mishra, S., Kumar, S.: Comparison of Different Parameters Used In GMM Based Automatic Speaker Recognition. International Journal of Soft Computing and Engineering (IJSCE) 1(3), 14–18 (2011) ISSN: 2231-2307

    Google Scholar 

  6. Reynolds, D.A., Rose, R.C.: Robust Text-Independent Speaker Identification using Gaussian Mixture Speaker Models. IEEE Transactions on Speech and Audio Processing 3, 72–83 (1995)

    Article  Google Scholar 

  7. Bagul, S.G., Shastri, R.K.: Text Independent Speaker Recognition System using GMM. International Journal of Scientific and Research Publications 2(10), 1–5 (2012)

    Google Scholar 

  8. Jayana, H.S., Mahadeva Prasana, S.R.: Analysis, Feature Extraction, Modeling and Testing Techniques for Speaker Recognition. International Journal of Institution of Electronics and Telecommunication Engineers (IETE ) 26(3), 181–190 (2009)

    Google Scholar 

  9. Kumar, P., Jakhanwal, N., Chandra, M.: Text Dependent Speaker Identification in Noisy Environment. In: International Conference on Device and Communication (ICDeCom), pp. 1–4 (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dhruv Desai .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Desai, D., Joshi, M. (2014). Speaker Recognition Using MFCC and Hybrid Model of VQ and GMM. In: Thampi, S., Abraham, A., Pal, S., Rodriguez, J. (eds) Recent Advances in Intelligent Informatics. Advances in Intelligent Systems and Computing, vol 235. Springer, Cham. https://doi.org/10.1007/978-3-319-01778-5_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-01778-5_6

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-01777-8

  • Online ISBN: 978-3-319-01778-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics