Improved Large Vocabulary Continuous Chinese Speech Recognition by Character-Based Consensus Networks

Fu, Yi-Sheng; Pan, Yi-Cheng; Lee, Lin-shan

doi:10.1007/11939993_45

Yi-Sheng Fu²²,
Yi-Cheng Pan²² &
Lin-shan Lee²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4274))

Included in the following conference series:

International Symposium on Chinese Spoken Language Processing

Abstract

Word-based consensus networks have been verified to be very useful in minimizing word error rates (WER) for large vocabulary continuous speech recognition for western languages. By considering the special structure of Chinese language, this paper points out that character-based rather then word-based consensus networks should work better for Chinese language. This was verified by extensive experimental results also reported in the paper.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

The NECTEC 2015 Thai Open-Domain Automatic Speech Recognition System

Automatic Speech Recognition Based on Neural Networks

Automatic Speech Recognition for Moroccan Dialects: A Review

References

Bahl, L.R., Jelinek, F., Mercer, R.L.: A Maximum Likelihood Approach to Continuous Speech Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 5(2), 170–190 (1983)
Article Google Scholar
Stolcke, A., Konig, Y., Weintraub, M.: Explicit Word Error Minimization in N-best List Rescoring. In: Proc. Eurospeech, pp. 163–166 (1997)
Google Scholar
Mangu, L., Brill, E., Stolckes, A.: Finding Consensus in Speech Recognition: Word Error Minimizaiton and Other Applications of Confusion Networks. Computer Speech and Language 14(4), 373–400 (2000)
Article Google Scholar
Soong, F.K., Lo, W.K., Nakamura, S.: Generalized Word Posterior Probability (GWPP) for Measuring Reliability of Reconized Words. In: Proc. SWIM 2004 (2004)
Google Scholar
Soong, F.K., Lo, W.K., Nakamura, S.: Optimal Acoustic and Language Model Weights for Minimizing Word Verification Errors. In: Proc. ICSLP 2004 (2004)
Google Scholar
Qian, Y., Soong, F.K., Lee, T.: Tone-enhanced Generalized Character Posterior Probability (GCPP) for Cantonese LVCSR. In: Proc. ICASSP 2006 (2006)
Google Scholar
Wessel, F., Schluter, R., Ney, H.: Using Posterior Probabilities for Improved Speech Recognition. In: Proc., ICASSP (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Speech Lab, College of Electrical Engineering and Computer Science, National Taiwan University,
Yi-Sheng Fu, Yi-Cheng Pan & Lin-shan Lee

Authors

Yi-Sheng Fu
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Cheng Pan
View author publications
You can also search for this author in PubMed Google Scholar
Lin-shan Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, The University of Hong Kong, Hong Kong
Qiang Huo
Human Language Technology Department, Institute for Infocomm Research (I2R), 119613, Singapore
Bin Ma
School of Computer Engineering, Nanyang Technological University (NTU), 639798, Singapore
Eng-Siong Chng
Institute for Infocomm Research, 21 Heng Mui Keng Terrace, 119613, Singapore
Haizhou Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fu, YS., Pan, YC., Lee, Ls. (2006). Improved Large Vocabulary Continuous Chinese Speech Recognition by Character-Based Consensus Networks. In: Huo, Q., Ma, B., Chng, ES., Li, H. (eds) Chinese Spoken Language Processing. ISCSLP 2006. Lecture Notes in Computer Science(), vol 4274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11939993_45

Download citation

DOI: https://doi.org/10.1007/11939993_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49665-6
Online ISBN: 978-3-540-49666-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Improved Large Vocabulary Continuous Chinese Speech Recognition by Character-Based Consensus Networks

Abstract

Access this chapter

Preview

Similar content being viewed by others

The NECTEC 2015 Thai Open-Domain Automatic Speech Recognition System

Automatic Speech Recognition Based on Neural Networks

Automatic Speech Recognition for Moroccan Dialects: A Review

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Improved Large Vocabulary Continuous Chinese Speech Recognition by Character-Based Consensus Networks

Abstract

Access this chapter

Preview

Similar content being viewed by others

The NECTEC 2015 Thai Open-Domain Automatic Speech Recognition System

Automatic Speech Recognition Based on Neural Networks

Automatic Speech Recognition for Moroccan Dialects: A Review

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation