Nonnative Speech Recognition Based on Bilingual Model Modification at State Level

Zhang, Qingqing; Pan, Jielin; Chan, Shui-duen; Yan, Yonghong

doi:10.1007/978-3-642-01216-7_32

Qingqing Zhang⁴,
Jielin Pan⁴,
Shui-duen Chan⁵ &
…
Yonghong Yan⁴

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 56))

1407 Accesses

Abstract

This paper presents a novel bilingual model modification approach to improve nonnative speech recognition accuracy when the variations of accented pronunciations occur. Each state of baseline nonnative acoustic model is modified with several candidate states from the auxiliary acoustic model, which is trained on speakers’ mother language. State mapping criterion and n-best candidates are investigated, and different numbers of Gaussian mixtures of the auxiliary acoustic model are compared based on a grammar-constrained speech recognition system. Using this bilingual model modification approach, compared to the nonnative acoustic model which has already been well trained by adaptation technique MAP, the Phrase Error Rate further achieves a 5.83% relative reduction, while only a small relative increase on Real Time Factor occurs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Tomokiyo, L.M., Waibel, A.: Adaptation Methods for Nonnative Speech. In: Proceedings of Multilinguality in Spoken Language Processing (2001)
Google Scholar
Zhang, Q., Pan, J., Yan, Y.: Mandarin-English Bilingual Speech Recognition for Real World Music Retrieval. In: ICASSP 2008, paper 1147, Las Vegas, March 30 - April 4 (2008)
Google Scholar
Humphries, J., Woodland, P., Pearce, D.: Using accent-specific pronunciation modeling for robust speech recognition. In: Proc. ICSLP 1996, Philadelphia, PA, October 1996, pp. 2324–2327 (1996)
Google Scholar
Teixeira, C., Trancoso, C., Serralheiro, A.: Recognition of Non-native Accents. In: Proc. Eurospeech 1997, Rhodes, Greece, September 1997, pp. 2375–2378 (1997)
Google Scholar
Livescu, K.: Analysis and Modeling of Non-native Speech for Automatic Speech Recognition. Master’s thesis, MIT (August 1999)
Google Scholar
Wang, Z., Schultz, T., Waibel, A.: Comparison of Acoustic Model Adaptation Techniques on Non-native Speech. In: Proc. ICASSP (2003)
Google Scholar
Clarke, Constance, Jurafsky, Daniel: Limitations of MLLR Adaptation with Spanish-accented English: an Error Analysis. In: INTERSPEECH 2006, paper 1611-Tue2BuP.7 (2006)
Google Scholar
Bohn, O.-S., Flege, J.E.: The production of New and Similar Vowels by Adult German Learners of English. Stud. Second Lang. Acquis. 14, 131–158 (1992)
Article Google Scholar
The CMU Pronouncing Dictionary v0.6, The Carnegie Mellon University, http://www.speech.cs.cmu.edu/cgi-bin/cmudict
IPA. The International Phonetic Association (revised to 1993) IPA Chart. Journal of the International Phonetic Association 23 (1993)
Google Scholar
Flege, J.E.: Production and Perception of a Novel, Second-language Phonetic Contrast. Journal of the Acoustical Society of America 93, 1589–1608 (1993)
Article Google Scholar
Li, A., Yin, Z., Wang, T., Fang, Q., Hu, F.: RASC863 - A Chinese Speech Corpus with Four Regional Accents. In: ICSLT-o-COCOSDA, New Delhi, India (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

ThinkIT Speech Laboratory, Institute of Acoustics Chinese Academy of Sciences, Beijing, 100190, China
Qingqing Zhang, Jielin Pan & Yonghong Yan
Department of Chinese and Bilingual Studies, The Hong Kong Polytechnic University,
Shui-duen Chan

Authors

Qingqing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jielin Pan
View author publications
You can also search for this author in PubMed Google Scholar
Shui-duen Chan
View author publications
You can also search for this author in PubMed Google Scholar
Yonghong Yan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Control Science and Engineering, Huazhong University of Science and Technology, No. 1037, Luoyu Road, 430074, Wuhan, Hubei, China
Hongwei Wang , Yi Shen & Zhigang Zeng , &
Texas A&M University at Qatar, PO Box 23874, Doha, Qatar,
Tingwen Huang

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Zhang, Q., Pan, J., Chan, Sd., Yan, Y. (2009). Nonnative Speech Recognition Based on Bilingual Model Modification at State Level. In: Wang, H., Shen, Y., Huang, T., Zeng, Z. (eds) The Sixth International Symposium on Neural Networks (ISNN 2009). Advances in Intelligent and Soft Computing, vol 56. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01216-7_32

Download citation

DOI: https://doi.org/10.1007/978-3-642-01216-7_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01215-0
Online ISBN: 978-3-642-01216-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics