Minimum Phone Error (MPE) Model and Feature Training on Mandarin Broadcast News Task

Chen, Jia-Yu; Wan, Chia-Yu; Chen, Yi; Chen, Berlin; Lee, Lin-shan

doi:10.1007/11939993_31

Minimum Phone Error (MPE) Model and Feature Training on Mandarin Broadcast News Task

Jia-Yu Chen²²,
Chia-Yu Wan²²,
Yi Chen²²,
Berlin Chen²³ &
…
Lin-shan Lee²²

Conference paper

1595 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4274))

Abstract

The Minimum Phone Error (MPE) criterion for discriminative training was shown to be able to offer acoustic models with significantly improved performance. This concept was then further extended to Feature-space Minimum Phone Error (fMPE) and offset fMPE for training feature parameters as well. This paper reviews the concept of MPE and reports the experiments and results in performing MPE, fMPE and offset fMPE on the task of Mandarin Broadcast News, and significant improvements were obtained similar to the results reported for other languages and other tasks by other sites. In addition, a new concept of dimension-weighted offset fMPE is proposed in this work and even better performance than offset fMPE was obtained.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Povey, D., Woodland, P.C.: Minimum Phone Error and I-smoothing for Improved Discriminative Training. In: Proc. ICASSP 2002 (2002)
Google Scholar
Povey, D.: Discriminative Training for Large Vocabulary Speech Recognition, Ph.D Dissertation, Peterhouse, University of Cambridge (2004)
Google Scholar
Povey, D., Kingsbury, B., Mangu, L., Saon, G., Soltau, H., Zweig, G.: fMPE: Discriminatively Trained Features for Speech Recognition. In: Proc. ICASSP 2005 (2005)
Google Scholar
Povey, D.: Improvements to fMPE for Discriminative Training of Features. In: Proc. Interspeech 2005 (2005)
Google Scholar
Kaiser, J., Horvat, B., Kacic, Z.: A Novel Loss Function for the Overall Risk Criterion Based Discriminative Training of HMM Models. In: Proc. ICSLP 2000 (2000)
Google Scholar
Kumar, N.: Investigation of Silicon Auditory Models and Generalization of Linear Discriminant Analysis for Improved Speech Recognition, PhD Dissertation, Johns Hopkins University (1997)
Google Scholar
Zhang, B., Matsoukas, S.: Minimum Phoneme Error Based Heteroscedastic Linear Discriminant Analysis for Speech Recognition. In: Proc. ICASSP 2005 (2005)
Google Scholar
Gopinath, R.A.: Maximum Likelihood Modeling with Gaussian Distributions for Classification. In: Proc. ICASSP 1998 (1998)
Google Scholar
Wang, H.-M., Chen, B., Kuo, J.-W., Cheng, S.-S.: MATBN: A Mandarin Chinese Broadcast News Corpus. International Journal of Computational Linguistics and Chinese Language Processing (2005)
Google Scholar
Droppo, J., Acero, A.: Maximum Mutual Information SPLICE Transform for Seen and Unseen Conditions. In: Proc. Interspeech 2005 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Graduate Institute of Communication Engineering, National Taiwan University, Taipei
Jia-Yu Chen, Chia-Yu Wan, Yi Chen & Lin-shan Lee
Graduate Institute of Computer Science and Information Engineering, National Taiwan Normal University, Taipei
Berlin Chen

Authors

Jia-Yu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Chia-Yu Wan
View author publications
You can also search for this author in PubMed Google Scholar
Yi Chen
View author publications
You can also search for this author in PubMed Google Scholar
Berlin Chen
View author publications
You can also search for this author in PubMed Google Scholar
Lin-shan Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, The University of Hong Kong, Hong Kong
Qiang Huo
Human Language Technology Department, Institute for Infocomm Research (I2R), 119613, Singapore
Bin Ma
School of Computer Engineering, Nanyang Technological University (NTU), 639798, Singapore
Eng-Siong Chng
Institute for Infocomm Research, 21 Heng Mui Keng Terrace, 119613, Singapore
Haizhou Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, JY., Wan, CY., Chen, Y., Chen, B., Lee, Ls. (2006). Minimum Phone Error (MPE) Model and Feature Training on Mandarin Broadcast News Task. In: Huo, Q., Ma, B., Chng, ES., Li, H. (eds) Chinese Spoken Language Processing. ISCSLP 2006. Lecture Notes in Computer Science(), vol 4274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11939993_31

Download citation

DOI: https://doi.org/10.1007/11939993_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49665-6
Online ISBN: 978-3-540-49666-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics