Efficient Parameterization for Automatic Speaker Recognition Using Support Vector Machines

Chakroun, Rania; Frikha, Mondher; Zouari, Leila Beltaïfa

doi:10.1007/978-3-319-53480-0_65

Rania Chakroun²⁰,
Mondher Frikha¹⁹ &
Leila Beltaïfa Zouari¹⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 557))

Included in the following conference series:

International Conference on Intelligent Systems Design and Applications

1613 Accesses

Abstract

Recent advances in the field of speaker recognition have proved to highly outperform algorithms. However this performance degrades when limited data are presented. This paper presents examples on how SVM can improve speaker recognition. The main contribution in this approach is the use of new low-dimensional vectors when training data are limited. We show how different kernels function of Support Vector Machines (SVM) can be used to deal a new approach for speaker recognition system. We achieved remarkable results using TIMIT database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Jain, A., Ross, A., Prabhakar, S.: An introduction to biometric recognition. IEEE Trans. Circuits Syst. Video Technol. 14(1), 4–20 (2004)
Article Google Scholar
Reynolds, D.: An overview of automatic speaker recognition technology. In: Proceedings of the IEEE International Conference on Acoustics Speech Signal Processing (ICASSP), vol. 4, pp. 4072–4075 (2002)
Google Scholar
Togneri, R., Pullella, D.: An overview of speaker identification: accuracy and robustness issues. IEEE Circuits Syst. Mag. 11(2), 23–61 (2011). ISSN: 1531-636X
Article Google Scholar
Li, Q., Huang, Y.: An auditory-based feature extraction algorithm for robust speaker identification under mismatched conditions. IEEE Trans. Audio Speech and Lang. Process. 19(6), 1791–1801 (2011)
Article Google Scholar
Wa Maina, C., Walsh, J.M.L.: Joint speech enhancement and speaker identification using approximate Bayesian inference. IEEE Trans. Audio Speech Lang. Process. 19(6), 5491–5510 (2011)
Article Google Scholar
Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support vector machines using GMM supervectors for speaker verification. IEEE Signal Process. Lett. 13(5), 308–311 (2006)
Article Google Scholar
Xiang, B., Berger, T.: Efficient text-independent speaker verification with structural Gaussian mixture models and neural network. IEEE Trans. Speech Audio Process. 11(5), 447–456 (2003)
Article Google Scholar
Cortes, C., Vapnick, V.: Support vector networks. Mach. Learn. 20, 1–25 (1995)
Google Scholar
Kamppari, S.O., Hazen, T.J.: Word and phone level acoustic confidence scoring. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (2000)
Google Scholar
Reynolds, D.A., Quatieri, T.F., Dunn, R.: Speaker verification using adapted Gaussian mixture models. Digital Signal Process. 10(1–3), 19–41 (2000)
Article Google Scholar
Keshet, J., Bengio, S.: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods. Wiley, Hoboken (2009)
Book Google Scholar
Louradour, J., Daoudi, K., Bach, F.: Feature space mahalanobis sequence kernels: application to SVM speaker verification. IEEE Trans. Audio Speech Lang. Process. 15(8), 2465–2475 (2007)
Article Google Scholar
Campbell, W.M.: Generalized linear discriminant sequence kernels for speaker recognition. In: Proceedings of the International Conference on Acoustics Speech and Signal Processing, pp. 161–164 (2002)
Google Scholar
Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support vector machine using GMM supervectors for speaker verification. IEEE Signal Process. Lett. 13(5), 308–311 (2006)
Article Google Scholar
Dehak, R., Dehak, N., Kenny, P., Dumouchel, P.: Kernel combination for SVM speaker verification. In: Proceedings of the Speaker and Language Recognition Workshop, Odyssey (2008)
Google Scholar
Reynolds, D.A.: Automatic Speaker Recognition Using Gaussian Mixture Speaker Models. Linc. Lab. J. 8(2), 173–192 (1995)
Google Scholar
Young, S., Kershaw, D., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: Hidden Markov model toolkit (HTK) version 3.4 user’s guide (2002)
Google Scholar
Jourani, R.: Reconnaissance automatique du locuteur par des GMM à grande marge, UT3 Paul Sabatier (2012)
Google Scholar
Dehak, R., Dehak, N., Kenny, P., Dumouchel, P.: Linear and non linear kernel GMM supervector machines for speaker verification. In: Proceedings of the Interspeech, Antwerp, Belgium, pp. 302–305 (2007)
Google Scholar
Reynolds, D.: Speaker identification and verification using Gaussian mixture speaker models. Speech Commun. 17(1–2), 91–108 (1995)
Article Google Scholar
Pitsikalis, V., Maragos, P.: Some advances on speech analysis using generalized dimensions. In: ISCA Tutorial and Research Workshop on Non-Linear Speech Processing (NOLISP) (2003)
Google Scholar
Hsu, C.W., Chang, C.C., Lin, C.J.: A practical guide to support vector classification. In: Technical report, Department of Computer Science and Information Engineering, University of National Taiwan, Taipei, pp. 1–12 (2010)
Google Scholar
Keerthi, S.S., Lin, C.-J.: Asymptotic behaviors of support vector machines with Gaussian kernel. Neural Comput. 15(7), 1667–1689 (2003)
Article MATH Google Scholar
Dehak, N., Karam, Z., Reynolds, D., Dehak, R., Campbell, W., Glass, J.: A channel-blind system for speaker verification. In: Proceedings of the ICASSP, Prague, Czech Republic, pp. 4536–4539, May 2011
Google Scholar
Dehak, N., Kenny, P., Dehak, R., Dumouchel, P., Ouellet, P.: Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech Lang. Process. 19(4), 788–798 (2011)
Article Google Scholar
Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker verification using adapted Gaussian mixture models. Digital Signal Process. 10(1), 19–41 (2000)
Article Google Scholar
Wang, J.C., Lian, L.X., Lin, Y.Y., Zhao, J.H.: VLSI design for SVM-based speaker verification system. IEEE Trans. Very Large Scale Integr. VLSI Syst. 23(7), 1355–1359 (2015)
Article Google Scholar
Mishra, P., Lotia, P.: Speaker recognition using dynamic time warping polynomial kernel SVM with confusion matrix. i-Manager’s J. Comput. Sci. 3(3), 23 (2015)
Google Scholar
Wang, J.C., Wang, C.Y., Chin, Y.H., Liu, Y.T., Chen, E.T., Chang, P.C.: Spectral-temporal receptive fields and MFCC balanced feature extraction for robust speaker recognition. Multimedia Tools Appl., 1–14 (2016)
Google Scholar
Mat, P., Cernock, J.H.: Analysis of DNN approaches to speaker identification. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, pp. 5100–5104. IEEE, March 2016
Google Scholar
http://www.nist.gov/speech/tests/spk/2006/
Chakroun, R., Zouari, L.B., Frikha, M., Hamida, A.B.: A novel approach based on support vector machines for automatic speaker identification. In: 2015 IEEE/ACS 12th International Conference of Computer Systems and Applications (AICCSA), pp. 1–5 (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

Advanced Technologies for Medicine and Signals (ATMS) Research Unit, National School of Engineering of Sousse, Sousse, Tunisia
Leila Beltaïfa Zouari
Advanced Technologies for Medicine and Signals (ATMS) Research Unit, National School of Electronics and Telecommunications of Sfax, Sfax, Tunisia
Mondher Frikha
Advanced Technologies for Medicine and Signals (ATMS) Research Unit, National School of Engineering of Sfax, Sfax, Tunisia
Rania Chakroun

Authors

Rania Chakroun
View author publications
You can also search for this author in PubMed Google Scholar
Mondher Frikha
View author publications
You can also search for this author in PubMed Google Scholar
Leila Beltaïfa Zouari
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rania Chakroun .

Editor information

Editors and Affiliations

Departamento de Engenharia Informática, Instituto Superior de Engenharia do Port, Porto, Portugal
Ana Maria Madureira
Scientific Network for Innovation and Research Excellence, Machine Intelligence Research Labs, Auburn, Washington, USA
Ajith Abraham
Polytechnic Institute of Porto, Felgueiras, Portugal
Dorabela Gamboa
Campus of Gualtar, University of Minho, Braga, Portugal
Paulo Novais

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chakroun, R., Frikha, M., Zouari, L.B. (2017). Efficient Parameterization for Automatic Speaker Recognition Using Support Vector Machines. In: Madureira, A., Abraham, A., Gamboa, D., Novais, P. (eds) Intelligent Systems Design and Applications. ISDA 2016. Advances in Intelligent Systems and Computing, vol 557. Springer, Cham. https://doi.org/10.1007/978-3-319-53480-0_65

Download citation

DOI: https://doi.org/10.1007/978-3-319-53480-0_65
Published: 23 February 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-53479-4
Online ISBN: 978-3-319-53480-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics