Skip to main content

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 56))

Abstract

This paper presents our Mandarin pronunciation quality assessment system for the examination of Putonghua Shuiping Kaoshi (PSK) and investigates a novel Support Vector Machine (SVM) based method to improve its assessment accuracy. Firstly, an selective speaker adaptation module is introduced, in which we select well pronounced speech from results of the first-pass automatic pronunciation scoring as the adaptation data, and adopt Maximum Likelihood Linear Regression to update the acoustic model (AM). Then, compared with the traditional triphone based AM, the monophone based AM is studied. Finally, we propose a new method of incorporating all kinds of posterior probabilities using SVM classifier. Experimental results show that the average correlation coefficient between machine and human scores is improved from 83.72% to 85.48%. It suggests that the two methods of selective speaker adaptation and multi-model combination using SVM are very effective to improve the accuracy of pronunciation quality assessment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Neumeyer, L., Franco, H., Weintraub, M., Price, P.: Automatic Text-Independent Pronunciation Scoring of Foreign Language Student Speech. In: Proc. of ICSLP 1996, Philadelphia, Pennsylvania, pp. 1457–1460 (1996)

    Google Scholar 

  2. Tatsuya, K., Masatake, D., Yasushi, T.: Practical Use of English Pronunciation System for Japanese Students in the CALL Classroom. In: INTERSPEECH 2004, pp. 1689–1692 (2004)

    Google Scholar 

  3. Franco, H., Neumeyer, L., Kim, Y., Ronen, O.: Automatic Pronunciation Scoring for Language Instruction. In: Proc. Int’l. Conf. on Acoust., Speech and Signal Processing, Munich, pp. 1471–1474 (1997)

    Google Scholar 

  4. Neumeyer, L., Franco, H., Digalakis, V., Weintraub, M.: Automatic Scoring of Pronunciation Quality. Speech Communication 30(2-3), 83–93 (2000)

    Article  Google Scholar 

  5. Franco, H., Neumeyer, L., Digalakis, V., Ronen, O.: Combination of Machine Scores for Automatic Grading of Pronunciation Quality. Speech Communication 30 (2000)

    Google Scholar 

  6. Witt, S.M., Young, S.J.: Phone-level Pronunciation Scoring and Assessment for Interactive Language Learning. Speech communication 30(2)-32(3), 95–108 (2000)

    Article  Google Scholar 

  7. Bernstein, J., Cohen, M., Murveit, H., Rtischev, D., Weintraub, M.: Automatic Evaluation and Training in English Pronunciation. In: ICSLP Kobe, Japan (1990)

    Google Scholar 

  8. Chen, J.C., Jang, J.S.R., Li, J.Y., Wu, M.J.: Automatic Pronunciation Assessment for Mandarin Chinese. In: IEEE International Conference on Multimedia and Expo., Taipei, Taiwan (June 2004)

    Google Scholar 

  9. Pan, F.P., Zhao, Q.W., Yan, Y.H.: Improvements in Tone Pronunciation Scoring for Strongly Accented Mandarin Speech. In: Proceedings of ISCSLP 2006, pp. 592–602 (2006)

    Google Scholar 

  10. Leggetter, C., Woodland, P.: Speaker adaptation of HMMs using linear regression. Technical Report CUED/F-INFENG/TR. 181. Cambridge University Engineering Department, Cambridge, UK (1994)

    Google Scholar 

  11. Vapnik, V.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)

    MATH  Google Scholar 

  12. http://www.csie.ntu.edu.tw/~cjlin/libsvm/

  13. http://ntu.csie.org/~piaip/svm/svm_tutorial.html

  14. Implementation Outline for Putonghua Shuiping Kaoshi. Commercial Press, Beijing (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Ge, F. et al. (2009). An SVM-Based Mandarin Pronunciation Quality Assessment System. In: Wang, H., Shen, Y., Huang, T., Zeng, Z. (eds) The Sixth International Symposium on Neural Networks (ISNN 2009). Advances in Intelligent and Soft Computing, vol 56. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01216-7_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-01216-7_27

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-01215-0

  • Online ISBN: 978-3-642-01216-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics