Automatic Detection of Tone Mispronunciation in Mandarin

Zhang, Li; Huang, Chao; Chu, Min; Soong, Frank; Zhang, Xianda; Chen, Yudong

doi:10.1007/11939993_61

Li Zhang^22,23,
Chao Huang²³,
Min Chu²³,
Frank Soong²³,
Xianda Zhang²² &
…
Yudong Chen^23,24

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4274))

Included in the following conference series:

International Symposium on Chinese Spoken Language Processing

1627 Accesses
4 Citations

Abstract

In this paper we present our study on detecting tone mispronunciations in Mandarin. Both template and HMM approaches are investigated. Schematic templates of pitch contours are shown to be impractical due to their larger pitch range of inter-, even intra-speaker variation. The statistical Hidden Markov Models (HMM) is used to generate a Goodness of Pronunciation (GOP) score for detection with an optimized threshold. To deal with the discontinuity issue of the F0 in speech, the multi-space distribution (MSD) modeling is used for building corresponding HMMs. Under an MSD-HMM framework, detection performance of different choices of features, HMM types and GOP measures are evaluated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Chen, J.-C., Jang, J.-S.R., Li, J.-Y., Wu, M.-C.: Automatic Pronunciation Assessment for Mandarin Chinese. In: Proc. ICME, pp. 1979–1982 (2004)
Google Scholar
Wei, S., Liu, Q.S., Hu, Y., Wang, R.H.: Automatic Pronunciation Assessment for Mandarin Chinese with Accent, NCMMSC8, pp. 22–25 (2005) (in Chinese)
Google Scholar
Dong, B., Zhao, Q.W., Yan, Y.H.: Analysis of Methods for Automatic Pronunciation Assessment, NCMMSC8, pp. 26–30 (2005) (in Chinese)
Google Scholar
Franco, H., Neumeyer, L., Digalakis, V., Ronen, O.: Combination of machine scores for automatic grading of pronunciation quality. Speech Communication 30, 121–130 (2000)
Article Google Scholar
Witt, S.M., Young, S.J.: Computer-assisted pronunciation teaching based on automatic speech recognition. In: Language Teaching and Language Technology Groningen, The Netherlands (April 1997)
Google Scholar
Neumeyer, L., Franco, H., Digalakis, V., Weintraub, M.: Automatic Scoring of Pronunciation Quality. Speech Communication 30, 83–93 (2000)
Article Google Scholar
Ronen, O., Neumeyer, L., Franco, H.: Automatic Detection of Mispronunciation for Language Instruction. In: Proc. European Conf. on Speech Commun. and Technology, Rodhes, pp. 645–648 (1997)
Google Scholar
Menzel, W., Herron, D., Bonaventura, P., Morton, R.: Automatic detection and correction of non-native English pronunciations. In: Proc. of InSTIL, Scotland, pp. 49–56 (2000)
Google Scholar
Witt, S.M., Young, S.J.: Performance measures for phone–level pronunciation teaching in CALL. In: Proc. Speech Technology in Language Learning 1998, Marholmen, Sweden (May 1998)
Google Scholar
Huang, C., Chang, E., Zhou, J.-L., Lee, K.-F.: Accent Modeling Based on Pronunciation Dictionary Adaptation for Large Vocabulary Mandarin Speech Recognition. In: Proc. ICSLP 2000, October 2000, vol. III, pp. 818–821 (2000)
Google Scholar
Chang, E., Zhou, J.-L., Di, S., Huang, C., Lee, K.-F.: Large Vocabulary Mandarin Speech Recognition with Different Approach in Modeling Tones. In: Proc. ICSLP 2000 (2000)
Google Scholar
Hirst, D., Espesser, R.: Automatic Modelling of Fundamental Frequency Using a Quadratic Spline Function. Travaux de l’Institut de Phontique d’Aixen -Provence 15, 75–85 (1993)
Google Scholar
Tokuda, K., Masuko, T., Miyazaki, N., Kobayashi, T.: Multi-space Probability Distribution HMM. IEICE Trans.Inf. &Syst. E85-D(3), 455–464 (2002)
Google Scholar
Wang, H.L., Qian, Y., Soong, F.K.: A Multi-Space Distribution (MSD) Approach To Speech Recognition of Tonal Languages. Accepted by ICSLP 2006
Google Scholar
Zhou, J.-L., Tian, Y., Shi, Y., Huang, C., Chang, E.: Tone Articulation Modeling for Mandarin spontaneous Speech recognition. In: Proc. ICASSP 2004, pp. 997–1000 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Tsinghua University, Beijing, 100084
Li Zhang & Xianda Zhang
Microsoft Research Asia, Beijing, 100080
Li Zhang, Chao Huang, Min Chu, Frank Soong & Yudong Chen
Communication University of China, 100024
Yudong Chen

Authors

Li Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chao Huang
View author publications
You can also search for this author in PubMed Google Scholar
Min Chu
View author publications
You can also search for this author in PubMed Google Scholar
Frank Soong
View author publications
You can also search for this author in PubMed Google Scholar
Xianda Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yudong Chen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, The University of Hong Kong, Hong Kong
Qiang Huo
Human Language Technology Department, Institute for Infocomm Research (I2R), 119613, Singapore
Bin Ma
School of Computer Engineering, Nanyang Technological University (NTU), 639798, Singapore
Eng-Siong Chng
Institute for Infocomm Research, 21 Heng Mui Keng Terrace, 119613, Singapore
Haizhou Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, L., Huang, C., Chu, M., Soong, F., Zhang, X., Chen, Y. (2006). Automatic Detection of Tone Mispronunciation in Mandarin. In: Huo, Q., Ma, B., Chng, ES., Li, H. (eds) Chinese Spoken Language Processing. ISCSLP 2006. Lecture Notes in Computer Science(), vol 4274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11939993_61

Download citation

DOI: https://doi.org/10.1007/11939993_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49665-6
Online ISBN: 978-3-540-49666-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics