Abstract
Recent advances in the field of speaker recognition have proved to highly outperform algorithms. However this performance degrades when limited data are presented. This paper presents examples on how SVM can improve speaker recognition. The main contribution in this approach is the use of new low-dimensional vectors when training data are limited. We show how different kernels function of Support Vector Machines (SVM) can be used to deal a new approach for speaker recognition system. We achieved remarkable results using TIMIT database.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Jain, A., Ross, A., Prabhakar, S.: An introduction to biometric recognition. IEEE Trans. Circuits Syst. Video Technol. 14(1), 4–20 (2004)
Reynolds, D.: An overview of automatic speaker recognition technology. In: Proceedings of the IEEE International Conference on Acoustics Speech Signal Processing (ICASSP), vol. 4, pp. 4072–4075 (2002)
Togneri, R., Pullella, D.: An overview of speaker identification: accuracy and robustness issues. IEEE Circuits Syst. Mag. 11(2), 23–61 (2011). ISSN: 1531-636X
Li, Q., Huang, Y.: An auditory-based feature extraction algorithm for robust speaker identification under mismatched conditions. IEEE Trans. Audio Speech and Lang. Process. 19(6), 1791–1801 (2011)
Wa Maina, C., Walsh, J.M.L.: Joint speech enhancement and speaker identification using approximate Bayesian inference. IEEE Trans. Audio Speech Lang. Process. 19(6), 5491–5510 (2011)
Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support vector machines using GMM supervectors for speaker verification. IEEE Signal Process. Lett. 13(5), 308–311 (2006)
Xiang, B., Berger, T.: Efficient text-independent speaker verification with structural Gaussian mixture models and neural network. IEEE Trans. Speech Audio Process. 11(5), 447–456 (2003)
Cortes, C., Vapnick, V.: Support vector networks. Mach. Learn. 20, 1–25 (1995)
Kamppari, S.O., Hazen, T.J.: Word and phone level acoustic confidence scoring. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (2000)
Reynolds, D.A., Quatieri, T.F., Dunn, R.: Speaker verification using adapted Gaussian mixture models. Digital Signal Process. 10(1–3), 19–41 (2000)
Keshet, J., Bengio, S.: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods. Wiley, Hoboken (2009)
Louradour, J., Daoudi, K., Bach, F.: Feature space mahalanobis sequence kernels: application to SVM speaker verification. IEEE Trans. Audio Speech Lang. Process. 15(8), 2465–2475 (2007)
Campbell, W.M.: Generalized linear discriminant sequence kernels for speaker recognition. In: Proceedings of the International Conference on Acoustics Speech and Signal Processing, pp. 161–164 (2002)
Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support vector machine using GMM supervectors for speaker verification. IEEE Signal Process. Lett. 13(5), 308–311 (2006)
Dehak, R., Dehak, N., Kenny, P., Dumouchel, P.: Kernel combination for SVM speaker verification. In: Proceedings of the Speaker and Language Recognition Workshop, Odyssey (2008)
Reynolds, D.A.: Automatic Speaker Recognition Using Gaussian Mixture Speaker Models. Linc. Lab. J. 8(2), 173–192 (1995)
Young, S., Kershaw, D., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: Hidden Markov model toolkit (HTK) version 3.4 user’s guide (2002)
Jourani, R.: Reconnaissance automatique du locuteur par des GMM à grande marge, UT3 Paul Sabatier (2012)
Dehak, R., Dehak, N., Kenny, P., Dumouchel, P.: Linear and non linear kernel GMM supervector machines for speaker verification. In: Proceedings of the Interspeech, Antwerp, Belgium, pp. 302–305 (2007)
Reynolds, D.: Speaker identification and verification using Gaussian mixture speaker models. Speech Commun. 17(1–2), 91–108 (1995)
Pitsikalis, V., Maragos, P.: Some advances on speech analysis using generalized dimensions. In: ISCA Tutorial and Research Workshop on Non-Linear Speech Processing (NOLISP) (2003)
Hsu, C.W., Chang, C.C., Lin, C.J.: A practical guide to support vector classification. In: Technical report, Department of Computer Science and Information Engineering, University of National Taiwan, Taipei, pp. 1–12 (2010)
Keerthi, S.S., Lin, C.-J.: Asymptotic behaviors of support vector machines with Gaussian kernel. Neural Comput. 15(7), 1667–1689 (2003)
Dehak, N., Karam, Z., Reynolds, D., Dehak, R., Campbell, W., Glass, J.: A channel-blind system for speaker verification. In: Proceedings of the ICASSP, Prague, Czech Republic, pp. 4536–4539, May 2011
Dehak, N., Kenny, P., Dehak, R., Dumouchel, P., Ouellet, P.: Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech Lang. Process. 19(4), 788–798 (2011)
Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker verification using adapted Gaussian mixture models. Digital Signal Process. 10(1), 19–41 (2000)
Wang, J.C., Lian, L.X., Lin, Y.Y., Zhao, J.H.: VLSI design for SVM-based speaker verification system. IEEE Trans. Very Large Scale Integr. VLSI Syst. 23(7), 1355–1359 (2015)
Mishra, P., Lotia, P.: Speaker recognition using dynamic time warping polynomial kernel SVM with confusion matrix. i-Manager’s J. Comput. Sci. 3(3), 23 (2015)
Wang, J.C., Wang, C.Y., Chin, Y.H., Liu, Y.T., Chen, E.T., Chang, P.C.: Spectral-temporal receptive fields and MFCC balanced feature extraction for robust speaker recognition. Multimedia Tools Appl., 1–14 (2016)
Mat, P., Cernock, J.H.: Analysis of DNN approaches to speaker identification. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, pp. 5100–5104. IEEE, March 2016
Chakroun, R., Zouari, L.B., Frikha, M., Hamida, A.B.: A novel approach based on support vector machines for automatic speaker identification. In: 2015 IEEE/ACS 12th International Conference of Computer Systems and Applications (AICCSA), pp. 1–5 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Chakroun, R., Frikha, M., Zouari, L.B. (2017). Efficient Parameterization for Automatic Speaker Recognition Using Support Vector Machines. In: Madureira, A., Abraham, A., Gamboa, D., Novais, P. (eds) Intelligent Systems Design and Applications. ISDA 2016. Advances in Intelligent Systems and Computing, vol 557. Springer, Cham. https://doi.org/10.1007/978-3-319-53480-0_65
Download citation
DOI: https://doi.org/10.1007/978-3-319-53480-0_65
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-53479-4
Online ISBN: 978-3-319-53480-0
eBook Packages: EngineeringEngineering (R0)