Skip to main content

SVM Based Speaker Selection Using GMM Supervector for Rapid Speaker Adaptation

  • Conference paper
Simulated Evolution and Learning (SEAL 2006)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4247))

Included in the following conference series:

  • 1431 Accesses

Abstract

In this paper, we propose a novel method for rapid speaker adaptation called speaker support vector selection (SSVS). By taking gaussian mixture model (GMM) as speaker model, the speakers acoustically close to the test speaker are selected .Different from other selection method, just computing the likelihood between models, we utilizing support vector machines (SVM) to obtain a ‘more optimal speaker subset’. Such selection is dynamically determined according to the distribution of reference speakers close the test. Furthermore, a single-pass re-estimation procedure conditioned on the selected speakers is shown. This adaptation strategy was evaluated in a large vocabulary speech recognition task. The presented method improves the relative accuracy rates by 13% compared to the baseline system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Legetter, C.J., Woodland, P.C.: Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density HMM’s in Compute. Speech Lang. 9, 171–186 (1996)

    Article  Google Scholar 

  2. Gauvain, J.L., Lee, C.H.: Maximum a posterior estimation for multivariate Gaussian observations of Markov chains. IEEE Trans. Speech Audio Processing 2, 291–298 (1994)

    Article  Google Scholar 

  3. Sankar, A., Beaufays, F., Digalakis, V.: Training data clustering for improved speech recognition. In: Proc. Eurospeech, pp. 502–505 (1995)

    Google Scholar 

  4. Huang, C., Chen, T., Chang, E.: Speaker Selection Training for Large Vocabulary Continuous Speech Recognition. In: Proc. ICASSP (2002)

    Google Scholar 

  5. Reynolds, D.A., Rose, R.C.: Robust text dependent speaker identification using Gaussian mixture speaker models. IEEE Transactions on Speech and Audio Processing 3, 72–83 (1995)

    Article  Google Scholar 

  6. Reynolds, D.A., Quatieri, T., Dunn, R.B.: Speaker Verification Using Adapted Gaussian Mixture Models. Digital Signal Processing 10, 19–41 (2000)

    Article  Google Scholar 

  7. Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (1998)

    MATH  Google Scholar 

  8. Burges, C.J.C.: A tutorial on support vector machines for pattern recognition. Knowledge Discovery Data Mining 2(2), 121–167 (1998)

    Article  Google Scholar 

  9. Gunn, S.R.: Support vector machines for classification and regression. Technical Report Image Speech and Intelligent Systems Research Group, University of Southampton (1997)

    Google Scholar 

  10. Schmidt, M., Gish, H.: Speaker identification via support vector classifiers. In: Proc. ICASSP, pp. 105–108 (1996)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wang, J., Lei, J., Guo, J., Yang, Z. (2006). SVM Based Speaker Selection Using GMM Supervector for Rapid Speaker Adaptation. In: Wang, TD., et al. Simulated Evolution and Learning. SEAL 2006. Lecture Notes in Computer Science, vol 4247. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11903697_78

Download citation

  • DOI: https://doi.org/10.1007/11903697_78

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-47331-2

  • Online ISBN: 978-3-540-47332-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics