Speaker Recognition Using MFCC and Hybrid Model of VQ and GMM

Desai, Dhruv; Joshi, Maulin

doi:10.1007/978-3-319-01778-5_6

Dhruv Desai⁶ &
Maulin Joshi⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 235))

1700 Accesses
4 Citations

Abstract

Speaker recognition is widely used for automatic authentication of speaker’s identity based on human biological features. Speaker recognition extracts, characterizes and recognizes the information about speaker identity. For feature extraction and speaker modeling many algorithms are being used. In this paper, we have proposed speaker recognition system based on hybrid approach using Mel Frequency Cepstrum Coefficient (MFCC) as feature extraction and combination of vector quantization (VQ) and Gaussian Mixture Modeling (GMM) for speaker modeling. Our approach is able to recognize speaker for both text dependent and text independent speech and uses relative index as confidence measures in case of contradiction in recognition process by GMM and VQ. Simulation results highlight the efficacy of proposed method compared to earlier work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hai, J., Joo, E.M.: Improved linear predictive coding method for speech recognition. In: Information, Communications and Signal Processing and Fourth Pacific Rim Conference on Multimedia. Proceedings of the Joint Conference of the Fourth International Conference, vol. 3, pp. 1614–1618 (2003)
Google Scholar
Muda, L., Begam, M., Elamvazuthi, I.: Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques. Journal of Computing 2(3), 138–141 (2010)
Google Scholar
Hasan, R., Jamil, M., Rahman, G.R.S.: Speaker Identification Using Mel Frequency Cepstral Coefficients. In: 3rd International Conference on Electrical & Computer Engineering ICECE, pp. 565–568 (2004)
Google Scholar
Tiwari, V.: MFCC and its applications in speaker recognition. International Journal on Emerging Technologies 1, 19–22 (2010)
Google Scholar
Shende, A., Mishra, S., Kumar, S.: Comparison of Different Parameters Used In GMM Based Automatic Speaker Recognition. International Journal of Soft Computing and Engineering (IJSCE) 1(3), 14–18 (2011) ISSN: 2231-2307
Google Scholar
Reynolds, D.A., Rose, R.C.: Robust Text-Independent Speaker Identification using Gaussian Mixture Speaker Models. IEEE Transactions on Speech and Audio Processing 3, 72–83 (1995)
Article Google Scholar
Bagul, S.G., Shastri, R.K.: Text Independent Speaker Recognition System using GMM. International Journal of Scientific and Research Publications 2(10), 1–5 (2012)
Google Scholar
Jayana, H.S., Mahadeva Prasana, S.R.: Analysis, Feature Extraction, Modeling and Testing Techniques for Speaker Recognition. International Journal of Institution of Electronics and Telecommunication Engineers (IETE ) 26(3), 181–190 (2009)
Google Scholar
Kumar, P., Jakhanwal, N., Chandra, M.: Text Dependent Speaker Identification in Noisy Environment. In: International Conference on Device and Communication (ICDeCom), pp. 1–4 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronics enginnering, Sarvajanik College of Engineering and Technology, Surat, 395001, India
Dhruv Desai
Department of Electronics & Communication, Sarvajanik College of Engineering and Technology, Surat, 395001, India
Maulin Joshi

Authors

Dhruv Desai
View author publications
You can also search for this author in PubMed Google Scholar
Maulin Joshi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dhruv Desai .

Editor information

Editors and Affiliations

Technopark Campus Trivandrum, Indian Inst. of Information Technology and Management – Kerala (IIITM-K), Kerala, India
Sabu M. Thampi
Machine Intelligence Research Labs (MIR Labs), Auburn, USA
Ajith Abraham
Indian Statistical Institute, Kolkata, India
Sankar Kumar Pal
Department of Computer Science School of Science, University of Salamanca, Salamanca, Spain
Juan Manuel Corchado Rodriguez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Desai, D., Joshi, M. (2014). Speaker Recognition Using MFCC and Hybrid Model of VQ and GMM. In: Thampi, S., Abraham, A., Pal, S., Rodriguez, J. (eds) Recent Advances in Intelligent Informatics. Advances in Intelligent Systems and Computing, vol 235. Springer, Cham. https://doi.org/10.1007/978-3-319-01778-5_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-01778-5_6
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01777-8
Online ISBN: 978-3-319-01778-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics