Skip to main content

Speaker Modeling with Various Speech Representations

  • Conference paper
Biometric Authentication (ICBA 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3072))

Included in the following conference series:

  • 1277 Accesses

Abstract

Although numerous speech representations have been reported to be useful in speaker recognition, there is much less agreement on which speech representation provides a perfect representation of speaker-specific information. In this paper, we charaterize a speaker’s identity through the simultaneous use of various speech representations of his/her voice. We present a parametric statistical model, generalized Gaussian mixture model, and develop an EM algorithm for parameter estimation. Our approach has been applied to speaker recognition and comparative results on KING corpus demonstrate its effectiveness.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Huang, X.D., Acero, A., Hon, H.W.: Spoken Language Processing. Wiley Inter-Science, Chichester (2000)

    Google Scholar 

  2. Reynolds, D.A., Rose, R.C.: Robust text-independent speaker identification using Gaussian mixture models. IEEE Transactions on Speech Audio Processing 3(1), 72–83 (1995)

    Article  Google Scholar 

  3. Chen, K.: A connectionist method for pattern classification with diverse features. Pattern Recognition Letters 19(7), 545–558 (1998)

    Article  MATH  Google Scholar 

  4. Chen, K., Wang, L., Chi, H.: Methods of combining multiple classifiers with different features and their applications to text-independent speaker identification. International Journal of Pattern Recognition and Artificial Intelligence 11(3), 417–445 (1997)

    Article  Google Scholar 

  5. Chen, K., Xie, D., Chi, H.: A modified HME architecture for text-dependent speaker identification. IEEE Transactions on Neural Networks 7(5), 1309–1313 (1996)

    Article  Google Scholar 

  6. Chen, K., Xu, L., Chi, H.: Improved learning algorithms for mixture of experts in multiclass classification. Neural Networks 12(9), 1225–1252 (1999)

    Article  Google Scholar 

  7. Chen, K.: On the use of different speech representations for speaker modeling. IEEE Transactions on Systems, Man and Cybernetics (Part C) 34 (2004) (Special issue on Biometric Systems, in press)

    Google Scholar 

  8. McLanchlan, G., Peel, D.: Finite Mixture Models. Wiley Inter-Science, Chichester (2000)

    Book  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chen, K. (2004). Speaker Modeling with Various Speech Representations. In: Zhang, D., Jain, A.K. (eds) Biometric Authentication. ICBA 2004. Lecture Notes in Computer Science, vol 3072. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-25948-0_81

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-25948-0_81

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22146-3

  • Online ISBN: 978-3-540-25948-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics