Skip to main content

Efficient Parameterization for Automatic Speaker Recognition Using Support Vector Machines

  • Conference paper
  • First Online:
Book cover Intelligent Systems Design and Applications (ISDA 2016)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 557))

  • 1613 Accesses

Abstract

Recent advances in the field of speaker recognition have proved to highly outperform algorithms. However this performance degrades when limited data are presented. This paper presents examples on how SVM can improve speaker recognition. The main contribution in this approach is the use of new low-dimensional vectors when training data are limited. We show how different kernels function of Support Vector Machines (SVM) can be used to deal a new approach for speaker recognition system. We achieved remarkable results using TIMIT database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Jain, A., Ross, A., Prabhakar, S.: An introduction to biometric recognition. IEEE Trans. Circuits Syst. Video Technol. 14(1), 4–20 (2004)

    Article  Google Scholar 

  2. Reynolds, D.: An overview of automatic speaker recognition technology. In: Proceedings of the IEEE International Conference on Acoustics Speech Signal Processing (ICASSP), vol. 4, pp. 4072–4075 (2002)

    Google Scholar 

  3. Togneri, R., Pullella, D.: An overview of speaker identification: accuracy and robustness issues. IEEE Circuits Syst. Mag. 11(2), 23–61 (2011). ISSN: 1531-636X

    Article  Google Scholar 

  4. Li, Q., Huang, Y.: An auditory-based feature extraction algorithm for robust speaker identification under mismatched conditions. IEEE Trans. Audio Speech and Lang. Process. 19(6), 1791–1801 (2011)

    Article  Google Scholar 

  5. Wa Maina, C., Walsh, J.M.L.: Joint speech enhancement and speaker identification using approximate Bayesian inference. IEEE Trans. Audio Speech Lang. Process. 19(6), 5491–5510 (2011)

    Article  Google Scholar 

  6. Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support vector machines using GMM supervectors for speaker verification. IEEE Signal Process. Lett. 13(5), 308–311 (2006)

    Article  Google Scholar 

  7. Xiang, B., Berger, T.: Efficient text-independent speaker verification with structural Gaussian mixture models and neural network. IEEE Trans. Speech Audio Process. 11(5), 447–456 (2003)

    Article  Google Scholar 

  8. Cortes, C., Vapnick, V.: Support vector networks. Mach. Learn. 20, 1–25 (1995)

    Google Scholar 

  9. Kamppari, S.O., Hazen, T.J.: Word and phone level acoustic confidence scoring. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (2000)

    Google Scholar 

  10. Reynolds, D.A., Quatieri, T.F., Dunn, R.: Speaker verification using adapted Gaussian mixture models. Digital Signal Process. 10(1–3), 19–41 (2000)

    Article  Google Scholar 

  11. Keshet, J., Bengio, S.: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods. Wiley, Hoboken (2009)

    Book  Google Scholar 

  12. Louradour, J., Daoudi, K., Bach, F.: Feature space mahalanobis sequence kernels: application to SVM speaker verification. IEEE Trans. Audio Speech Lang. Process. 15(8), 2465–2475 (2007)

    Article  Google Scholar 

  13. Campbell, W.M.: Generalized linear discriminant sequence kernels for speaker recognition. In: Proceedings of the International Conference on Acoustics Speech and Signal Processing, pp. 161–164 (2002)

    Google Scholar 

  14. Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support vector machine using GMM supervectors for speaker verification. IEEE Signal Process. Lett. 13(5), 308–311 (2006)

    Article  Google Scholar 

  15. Dehak, R., Dehak, N., Kenny, P., Dumouchel, P.: Kernel combination for SVM speaker verification. In: Proceedings of the Speaker and Language Recognition Workshop, Odyssey (2008)

    Google Scholar 

  16. Reynolds, D.A.: Automatic Speaker Recognition Using Gaussian Mixture Speaker Models. Linc. Lab. J. 8(2), 173–192 (1995)

    Google Scholar 

  17. Young, S., Kershaw, D., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: Hidden Markov model toolkit (HTK) version 3.4 user’s guide (2002)

    Google Scholar 

  18. Jourani, R.: Reconnaissance automatique du locuteur par des GMM à grande marge, UT3 Paul Sabatier (2012)

    Google Scholar 

  19. Dehak, R., Dehak, N., Kenny, P., Dumouchel, P.: Linear and non linear kernel GMM supervector machines for speaker verification. In: Proceedings of the Interspeech, Antwerp, Belgium, pp. 302–305 (2007)

    Google Scholar 

  20. Reynolds, D.: Speaker identification and verification using Gaussian mixture speaker models. Speech Commun. 17(1–2), 91–108 (1995)

    Article  Google Scholar 

  21. Pitsikalis, V., Maragos, P.: Some advances on speech analysis using generalized dimensions. In: ISCA Tutorial and Research Workshop on Non-Linear Speech Processing (NOLISP) (2003)

    Google Scholar 

  22. Hsu, C.W., Chang, C.C., Lin, C.J.: A practical guide to support vector classification. In: Technical report, Department of Computer Science and Information Engineering, University of National Taiwan, Taipei, pp. 1–12 (2010)

    Google Scholar 

  23. Keerthi, S.S., Lin, C.-J.: Asymptotic behaviors of support vector machines with Gaussian kernel. Neural Comput. 15(7), 1667–1689 (2003)

    Article  MATH  Google Scholar 

  24. Dehak, N., Karam, Z., Reynolds, D., Dehak, R., Campbell, W., Glass, J.: A channel-blind system for speaker verification. In: Proceedings of the ICASSP, Prague, Czech Republic, pp. 4536–4539, May 2011

    Google Scholar 

  25. Dehak, N., Kenny, P., Dehak, R., Dumouchel, P., Ouellet, P.: Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech Lang. Process. 19(4), 788–798 (2011)

    Article  Google Scholar 

  26. Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker verification using adapted Gaussian mixture models. Digital Signal Process. 10(1), 19–41 (2000)

    Article  Google Scholar 

  27. Wang, J.C., Lian, L.X., Lin, Y.Y., Zhao, J.H.: VLSI design for SVM-based speaker verification system. IEEE Trans. Very Large Scale Integr. VLSI Syst. 23(7), 1355–1359 (2015)

    Article  Google Scholar 

  28. Mishra, P., Lotia, P.: Speaker recognition using dynamic time warping polynomial kernel SVM with confusion matrix. i-Manager’s J. Comput. Sci. 3(3), 23 (2015)

    Google Scholar 

  29. Wang, J.C., Wang, C.Y., Chin, Y.H., Liu, Y.T., Chen, E.T., Chang, P.C.: Spectral-temporal receptive fields and MFCC balanced feature extraction for robust speaker recognition. Multimedia Tools Appl., 1–14 (2016)

    Google Scholar 

  30. Mat, P., Cernock, J.H.: Analysis of DNN approaches to speaker identification. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, pp. 5100–5104. IEEE, March 2016

    Google Scholar 

  31. http://www.nist.gov/speech/tests/spk/2006/

  32. Chakroun, R., Zouari, L.B., Frikha, M., Hamida, A.B.: A novel approach based on support vector machines for automatic speaker identification. In: 2015 IEEE/ACS 12th International Conference of Computer Systems and Applications (AICCSA), pp. 1–5 (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rania Chakroun .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Chakroun, R., Frikha, M., Zouari, L.B. (2017). Efficient Parameterization for Automatic Speaker Recognition Using Support Vector Machines. In: Madureira, A., Abraham, A., Gamboa, D., Novais, P. (eds) Intelligent Systems Design and Applications. ISDA 2016. Advances in Intelligent Systems and Computing, vol 557. Springer, Cham. https://doi.org/10.1007/978-3-319-53480-0_65

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-53480-0_65

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-53479-4

  • Online ISBN: 978-3-319-53480-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics