Improved i-vector Speaker Verification Based on WCCN and ZT-norm

Xing, Yujuan; Tan, Ping; Zhang, Chengwen

doi:10.1007/978-3-319-46654-5_47

Yujuan Xing²¹,
Ping Tan²¹ &
Chengwen Zhang²¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9967))

Included in the following conference series:

Chinese Conference on Biometric Recognition

2776 Accesses

Abstract

For the purpose of improving system performance in high channel variability, an improved i-vector speaker verification algorithm is proposed in this paper. Firstly, i-vectors are obtained from GMM-UBM of registered speakers. And then, the weighted linear discriminant analysis is utilized to play the role of channel compensation and dimensionality reduction in i-vectors. By doing this, more discriminant vectors could be extracted. Immediately following, WCCN and ZT-norm are combined to normalize the scores from cosine distance score classifier for the sake of removing channel disturbance. Finally, cosine distance score classifier of high robustness is generated to find target speaker. Experiment results demonstrate that our proposed i-vector system has better performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Campbell, W., Sturim, D., Reynolds, D.: Support vector machines using GMM supervectors for speaker verification. IEEE Sig. Process. Lett. 13(5), 308–311 (2006)
Article Google Scholar
Sarkar, A.K., Bonastre, J.F., Matrouf, D.: A study on the roles of total variability space and session variability modeling in speaker recognition. Int. J. Speech Technol. 19(1), 111–120 (2016)
Article Google Scholar
Ma, X., Tan D.T., Jin, Y.K., et al.: Speaker verification using a modified adaptive GMM approach based on low rank matrix recovery. Mobile and Wireless Technologies (2016)
Google Scholar
Xing, Y.J., Tan, P.: A novel SVM Kernel with GMM super-vector based on Bhattacharyya distance clustering plus within class covariance normalization. In: International Conference on Natural Computation. IEEE, pp. 47–51 (2015)
Google Scholar
Solomonoff, A., Campbell, W.M., Boardman, I.: Advances in channel compensation for svm speaker recognition. In: International Conference on Acoustics, Speech, and Signal Processing. IEEE, Pennsylvania, pp. I-629−I-632 (2005)
Google Scholar
Dehak, N.: Front-end factor analysis for speaker verification. Audio Speech Lang. Process. 19(4), 788–798 (2011)
Article Google Scholar
Gang, L.V., Heming, Z.H.A.O.: Joint factor analysis of channel mismatch in whispering speaker verification. Arch. Acoust. 37(4), 555–559 (2012)
Google Scholar
McLaren, M., van Leeuwen, D.: Improved speaker recognition when using i-vectors from multiple speech sources. In: IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, Prague, pp. 5460−5463 (2011)
Google Scholar
Kenny, P., Boulianne, G., Ouellet, P., et al.: Joint factor analysis versus eigenchannels in speaker recognition. IEEE Trans. Audio Speech Lang. Process. 15(4), 1435–1447 (2007)
Article Google Scholar
Cumani, S., Plchot, O., Laface, P.: On the use of i–vector posterior distributions in probabilistic linear discriminant analysis. IEEE/ACM Trans. Audio Speech Lang. Process. 22(4), 846–857 (2014)
Article Google Scholar
Aronowitz, H.: Inter dataset variability compensation for speaker recognition. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 4002–4006. IEEE (2014)
Google Scholar
Gao, X.J., Bi-Cheng, L.I.: Method for speaker recognition based on with-in class covariance normalization and SVM. Comput. Eng. Appl. 45(10), 168–171 (2009)
Google Scholar

Download references

Acknowledgments

This paper is supported by youth science and technology foundation of Gansu (1506RJYA111), china.

Author information

Authors and Affiliations

School of Digital Media, Lanzhou University of Arts and Science, Lanzhou, 730000, China
Yujuan Xing, Ping Tan & Chengwen Zhang

Authors

Yujuan Xing
View author publications
You can also search for this author in PubMed Google Scholar
Ping Tan
View author publications
You can also search for this author in PubMed Google Scholar
Chengwen Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yujuan Xing .

Editor information

Editors and Affiliations

Sichuan University , Chengdu, China
Zhisheng You
Tsinghua University , Beijing, China
Jie Zhou
Beihang University , Beijing, China
Yunhong Wang
Chinese Academy of Sciences , Beijing, China
Zhenan Sun
Chinese Academy of Sciences , Beijing, China
Shiguang Shan
Sun Yat-sen University , Guangzhou, China
Weishi Zheng
Tsinghua University , Beijing, China
Jianjiang Feng
Sichuan University , Chengdu, China
Qijun Zhao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xing, Y., Tan, P., Zhang, C. (2016). Improved i-vector Speaker Verification Based on WCCN and ZT-norm. In: You, Z., et al. Biometric Recognition. CCBR 2016. Lecture Notes in Computer Science(), vol 9967. Springer, Cham. https://doi.org/10.1007/978-3-319-46654-5_47

Download citation

DOI: https://doi.org/10.1007/978-3-319-46654-5_47
Published: 21 September 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46653-8
Online ISBN: 978-3-319-46654-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics