Abstract
For the purpose of improving system performance in high channel variability, an improved i-vector speaker verification algorithm is proposed in this paper. Firstly, i-vectors are obtained from GMM-UBM of registered speakers. And then, the weighted linear discriminant analysis is utilized to play the role of channel compensation and dimensionality reduction in i-vectors. By doing this, more discriminant vectors could be extracted. Immediately following, WCCN and ZT-norm are combined to normalize the scores from cosine distance score classifier for the sake of removing channel disturbance. Finally, cosine distance score classifier of high robustness is generated to find target speaker. Experiment results demonstrate that our proposed i-vector system has better performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Campbell, W., Sturim, D., Reynolds, D.: Support vector machines using GMM supervectors for speaker verification. IEEE Sig. Process. Lett. 13(5), 308–311 (2006)
Sarkar, A.K., Bonastre, J.F., Matrouf, D.: A study on the roles of total variability space and session variability modeling in speaker recognition. Int. J. Speech Technol. 19(1), 111–120 (2016)
Ma, X., Tan D.T., Jin, Y.K., et al.: Speaker verification using a modified adaptive GMM approach based on low rank matrix recovery. Mobile and Wireless Technologies (2016)
Xing, Y.J., Tan, P.: A novel SVM Kernel with GMM super-vector based on Bhattacharyya distance clustering plus within class covariance normalization. In: International Conference on Natural Computation. IEEE, pp. 47–51 (2015)
Solomonoff, A., Campbell, W.M., Boardman, I.: Advances in channel compensation for svm speaker recognition. In: International Conference on Acoustics, Speech, and Signal Processing. IEEE, Pennsylvania, pp. I-629−I-632 (2005)
Dehak, N.: Front-end factor analysis for speaker verification. Audio Speech Lang. Process. 19(4), 788–798 (2011)
Gang, L.V., Heming, Z.H.A.O.: Joint factor analysis of channel mismatch in whispering speaker verification. Arch. Acoust. 37(4), 555–559 (2012)
McLaren, M., van Leeuwen, D.: Improved speaker recognition when using i-vectors from multiple speech sources. In: IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, Prague, pp. 5460−5463 (2011)
Kenny, P., Boulianne, G., Ouellet, P., et al.: Joint factor analysis versus eigenchannels in speaker recognition. IEEE Trans. Audio Speech Lang. Process. 15(4), 1435–1447 (2007)
Cumani, S., Plchot, O., Laface, P.: On the use of i–vector posterior distributions in probabilistic linear discriminant analysis. IEEE/ACM Trans. Audio Speech Lang. Process. 22(4), 846–857 (2014)
Aronowitz, H.: Inter dataset variability compensation for speaker recognition. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 4002–4006. IEEE (2014)
Gao, X.J., Bi-Cheng, L.I.: Method for speaker recognition based on with-in class covariance normalization and SVM. Comput. Eng. Appl. 45(10), 168–171 (2009)
Acknowledgments
This paper is supported by youth science and technology foundation of Gansu (1506RJYA111), china.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Xing, Y., Tan, P., Zhang, C. (2016). Improved i-vector Speaker Verification Based on WCCN and ZT-norm. In: You, Z., et al. Biometric Recognition. CCBR 2016. Lecture Notes in Computer Science(), vol 9967. Springer, Cham. https://doi.org/10.1007/978-3-319-46654-5_47
Download citation
DOI: https://doi.org/10.1007/978-3-319-46654-5_47
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46653-8
Online ISBN: 978-3-319-46654-5
eBook Packages: Computer ScienceComputer Science (R0)