Skip to main content

A Simple Way to Extract I-vector from Normalized Statastics

  • Conference paper
Biometric Recognition (CCBR 2014)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8833))

Included in the following conference series:

  • 2260 Accesses

Abstract

In the i-vector model, the utterance statistics are extracted from features using universal background model. The utterance is mapped to a vector in the total variability space, which is called i-vector. The total variability space provides a basis to obtain a low dimensional fixed-length representation of a speech utterance. But, the processing is complicated for the interweaving of the statistics and machine learning method. So, we considered separating them and proposed a simple way to extract i-vector by classical principal component analysis, factor analysis and independent component analysis from normalized statistics. The results on NIST 2008 telephone data show that the performance is very close to the traditional method and they can be improved obviously after score fusion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kinnunena, T., Li, H.: An overview of text-independent speaker recognition: From features to supervectors. Speech Communication 52(1), 12–40 (2010)

    Article  Google Scholar 

  2. Dehak, N., Kenny, P., Dehak, R., Dumouchel, P., Ouellet, P.: Front-End Factor Analysis for Speaker Verification. IEEE Transactions on Audio, Speech and Language Processing 19(4), 788–798 (2011)

    Article  Google Scholar 

  3. Reynolds, D.A., Quatieri, T., Dunn, R.: Speaker verification using adapted Gaussian mixture models. Digital Signal Processing 10(3) (2000)

    Google Scholar 

  4. Campbell, W.M., Sturim, D.E., Reynolds, D.A., Solomonoff, A.: SVM based speaker verification using a GMM supervector kernel and NAP variability compensation. In: Proc. ICASSP, vol. 1, pp. 97–100 (2006)

    Google Scholar 

  5. Kenny, P., Boulianne, G., Ouellet, P., Dumouchel, P.: Joint factor analysis versus eigenchannels in speaker recognition. IEEE Transactions on Audio, Speech and Language Processing 15(4), 1435–1447 (2007)

    Article  Google Scholar 

  6. Kenny, P., Gilles, B., Pierre, D.: Eigenvoice Modeling With Sparse Training Data. IEEE Trans. Speech and Audio Proc. 13(3), 345–354 (2005)

    Article  Google Scholar 

  7. Tipping, M., Bishop, C.: Mixtures of probabilistic principal component analyzers. Neural Computation 11, 435–474 (1999)

    Article  Google Scholar 

  8. Glembekl, O., et al.: Simplification And Optimization of I-Vector Extraction. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Prague, Czech Republic, May 22-27, pp. 4516–4519 (2011)

    Google Scholar 

  9. Li, M., et al.: Speaker Verification Using Simplified and Supervised I-Vector Modeling. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (2013)

    Google Scholar 

  10. Prince, S.J.D., Elder, J.H.: Probabilistic Linear Discriminant Analysis for Inferences About Identity. In: IEEE 11th International Conference on Computer Vision (2007)

    Google Scholar 

  11. Jiang, Y., Lee, K.A., Tang, Z., Ma, B., Larcher, A., Li, H.: PLDA Modeling in I-vector and Supervector Space for Speaker Verification. In: Annual Conference of the International Speech Communication Association, Interspeech (2012)

    Google Scholar 

  12. Machlica, L., Zajıc, Z.: An efficient implementation of Probabilistic Linear Discriminant Analysis. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (2013)

    Google Scholar 

  13. Garcia-Romero, D., Espy-Wilson, C.Y.: Analysis of i-vector length normalization in speaker recognition systems. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 249–252 (2011)

    Google Scholar 

  14. Martinez, A.M., Kak, A.C.: PCA versus LDA. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(2), 228–233 (2004)

    Article  Google Scholar 

  15. Johnson, R.A., Wichern, D.W.: Applied Multivariate Statistical Analysis, 6th edn. Pearson Education (2007)

    Google Scholar 

  16. Hyvärinen, A., Oja, E.: Independent Component Analysis: Algorithms and Applications. Neural Networks 13(4-5), 411–430 (2000)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Lei, Z., Luo, J., Yang, Y. (2014). A Simple Way to Extract I-vector from Normalized Statastics. In: Sun, Z., Shan, S., Sang, H., Zhou, J., Wang, Y., Yuan, W. (eds) Biometric Recognition. CCBR 2014. Lecture Notes in Computer Science, vol 8833. Springer, Cham. https://doi.org/10.1007/978-3-319-12484-1_41

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12484-1_41

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12483-4

  • Online ISBN: 978-3-319-12484-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics