Skip to main content

Efficient Speaker Identification Based on Robust VQ-PCA

  • Conference paper
  • First Online:
Computational Science and Its Applications — ICCSA 2003 (ICCSA 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2668))

Included in the following conference series:

Abstract

In this paper, an efficient speaker identification based on robust vector quantization principal component analysis (VQ-PCA) is proposed to solve the problems from outliers and high dimensionality of training feature vectors in speaker identification. Firstly, the proposed method partitions the data space into several disjoint regions by roust VQ based on M-estimation. Secondly, the robust PCA is obtained from the covariance matrix in each region. Finally, our method obtains the Gaussian Mixture model (GMM) for speaker from the transformed feature vectors with reduced dimension by the robust PCA in each region. Compared to the conventional GMM with diagonal covariance matrix, under the same performance, the proposed method gives faster results with less storage and, moreover, shows robust performance to outliers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. D.A. Reynolds and R.C. Rose, “Robust text-independent speaker identification using Gaussian mixture speaker models,” IEEE Tr. SAP., 3,1, (1995) 72–82.

    Google Scholar 

  2. Sadaoki Furui, “Recent advances in speaker recognition, Pattern Recognition Letters, 18, (1997) 859–872.

    Article  Google Scholar 

  3. L. Liu and J. He, “On the use of orthogonal GMM in speaker recognition,” Proc. ICASSP, (1999) 845–849.

    Google Scholar 

  4. C. Seo, K.Y. Lee and J. Lee, “GMM based on local PCA for speaker identification,” IEEE Electronic Letters, vol.37, no. 24, (2001) 1486–1488.

    Article  Google Scholar 

  5. Ariki, Y. & Tagashira, S. & Nishijima, M. (1996). Speaker recognition and speaker normalization by projection to speaker subspace, International Conference on Acoustics, Speech, and Signal Processing 96, 319–322

    Google Scholar 

  6. Croux, C. & Haesbroeck, G. (2000). Principal component analysis based on robust estimators of the covariance or correlation matrix: influence functions and efficiencies, Biometrika 87,3, 603–618

    Article  MATH  MathSciNet  Google Scholar 

  7. Gersho, A. & Gray, R.M. Vector quantization and signal compression, Kluwer Academic

    Google Scholar 

  8. Huber, P. (1981). Robust Statistics, New York: Wiley.

    MATH  Google Scholar 

  9. Kambhatla, N. & Leen, T.K. (1997). Dimension reduction by local PCA, Neural Computation 9, 1493–1503.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lee, Y., Lee, J., Lee, K.Y. (2003). Efficient Speaker Identification Based on Robust VQ-PCA. In: Kumar, V., Gavrilova, M.L., Tan, C.J.K., L’Ecuyer, P. (eds) Computational Science and Its Applications — ICCSA 2003. ICCSA 2003. Lecture Notes in Computer Science, vol 2668. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44843-8_69

Download citation

  • DOI: https://doi.org/10.1007/3-540-44843-8_69

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40161-2

  • Online ISBN: 978-3-540-44843-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics