Skip to main content

Automatic Detection of Tone Mispronunciation in Mandarin

  • Conference paper
Chinese Spoken Language Processing (ISCSLP 2006)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4274))

Included in the following conference series:

Abstract

In this paper we present our study on detecting tone mispronunciations in Mandarin. Both template and HMM approaches are investigated. Schematic templates of pitch contours are shown to be impractical due to their larger pitch range of inter-, even intra-speaker variation. The statistical Hidden Markov Models (HMM) is used to generate a Goodness of Pronunciation (GOP) score for detection with an optimized threshold. To deal with the discontinuity issue of the F0 in speech, the multi-space distribution (MSD) modeling is used for building corresponding HMMs. Under an MSD-HMM framework, detection performance of different choices of features, HMM types and GOP measures are evaluated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chen, J.-C., Jang, J.-S.R., Li, J.-Y., Wu, M.-C.: Automatic Pronunciation Assessment for Mandarin Chinese. In: Proc. ICME, pp. 1979–1982 (2004)

    Google Scholar 

  2. Wei, S., Liu, Q.S., Hu, Y., Wang, R.H.: Automatic Pronunciation Assessment for Mandarin Chinese with Accent, NCMMSC8, pp. 22–25 (2005) (in Chinese)

    Google Scholar 

  3. Dong, B., Zhao, Q.W., Yan, Y.H.: Analysis of Methods for Automatic Pronunciation Assessment, NCMMSC8, pp. 26–30 (2005) (in Chinese)

    Google Scholar 

  4. Franco, H., Neumeyer, L., Digalakis, V., Ronen, O.: Combination of machine scores for automatic grading of pronunciation quality. Speech Communication 30, 121–130 (2000)

    Article  Google Scholar 

  5. Witt, S.M., Young, S.J.: Computer-assisted pronunciation teaching based on automatic speech recognition. In: Language Teaching and Language Technology Groningen, The Netherlands (April 1997)

    Google Scholar 

  6. Neumeyer, L., Franco, H., Digalakis, V., Weintraub, M.: Automatic Scoring of Pronunciation Quality. Speech Communication 30, 83–93 (2000)

    Article  Google Scholar 

  7. Ronen, O., Neumeyer, L., Franco, H.: Automatic Detection of Mispronunciation for Language Instruction. In: Proc. European Conf. on Speech Commun. and Technology, Rodhes, pp. 645–648 (1997)

    Google Scholar 

  8. Menzel, W., Herron, D., Bonaventura, P., Morton, R.: Automatic detection and correction of non-native English pronunciations. In: Proc. of InSTIL, Scotland, pp. 49–56 (2000)

    Google Scholar 

  9. Witt, S.M., Young, S.J.: Performance measures for phone–level pronunciation teaching in CALL. In: Proc. Speech Technology in Language Learning 1998, Marholmen, Sweden (May 1998)

    Google Scholar 

  10. Huang, C., Chang, E., Zhou, J.-L., Lee, K.-F.: Accent Modeling Based on Pronunciation Dictionary Adaptation for Large Vocabulary Mandarin Speech Recognition. In: Proc. ICSLP 2000, October 2000, vol. III, pp. 818–821 (2000)

    Google Scholar 

  11. Chang, E., Zhou, J.-L., Di, S., Huang, C., Lee, K.-F.: Large Vocabulary Mandarin Speech Recognition with Different Approach in Modeling Tones. In: Proc. ICSLP 2000 (2000)

    Google Scholar 

  12. Hirst, D., Espesser, R.: Automatic Modelling of Fundamental Frequency Using a Quadratic Spline Function. Travaux de l’Institut de Phontique d’Aixen -Provence 15, 75–85 (1993)

    Google Scholar 

  13. Tokuda, K., Masuko, T., Miyazaki, N., Kobayashi, T.: Multi-space Probability Distribution HMM. IEICE Trans.Inf. &Syst. E85-D(3), 455–464 (2002)

    Google Scholar 

  14. Wang, H.L., Qian, Y., Soong, F.K.: A Multi-Space Distribution (MSD) Approach To Speech Recognition of Tonal Languages. Accepted by ICSLP 2006

    Google Scholar 

  15. Zhou, J.-L., Tian, Y., Shi, Y., Huang, C., Chang, E.: Tone Articulation Modeling for Mandarin spontaneous Speech recognition. In: Proc. ICASSP 2004, pp. 997–1000 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhang, L., Huang, C., Chu, M., Soong, F., Zhang, X., Chen, Y. (2006). Automatic Detection of Tone Mispronunciation in Mandarin. In: Huo, Q., Ma, B., Chng, ES., Li, H. (eds) Chinese Spoken Language Processing. ISCSLP 2006. Lecture Notes in Computer Science(), vol 4274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11939993_61

Download citation

  • DOI: https://doi.org/10.1007/11939993_61

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-49665-6

  • Online ISBN: 978-3-540-49666-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics