Abstract
According to the statistic analysis of common errors found in those texts that are typed-in, OCR-recognized or phonetics-recognized but not be proofread and the characteristics of such texts,we propose a error-detecting principle and error-detecting algorithm based on the orderly-neighborship. Furthermore, Factors that affect performance index of error-detecting system such as recall ratio and accurate ratio are discussed.
Shanxi Province Natural Science Fund Item (981031)
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Reference
I. Dagan and S. Marcus: Contextual Word Similarity and Estimation from Sparse Data. Computer Speech and Language, 9, (1995).123–152
K. W. Church and R. L. Mercer: Introduction to the Special Issue on Computational Linguistic Using Large Corpora. Computational Linguistics, 19(1), (1993)1–24
P. F. Seitz and U. N. Gupta: A Dictionary for A Very Large Vocabulary Word Recognition System, Computer Speech and Language, 4, (1990)193–202
Yangshen Zhang and Bingqing Ding: Present Condition and Prospect of Chinese Text Automatic Proofread Technology. Journal of Chinese Information, 3,(1998)23–32
Yong Mu, Cai Sun and Zhensheng Luo, Research on Automatic Checking and Confirmative Correction of Chinese Text. TsingHua Press, (1995) 100–105
Cai Sun and Zhensheng Luo, Research on the Lexical Errors in Chinese Text. Journal of the 4 th Computational Linguistics Conference: Language Engineering, TsingHua Press (1997), 319–324.
Chaojie Qiu, Rou Song and Longgen Ouyang, Statistical Results and Their Analysis of the Neighboring Pairs of Words on Very Large Corpora. Journal of the 4 th Computational Linguistic Conference: Language Engineering, TsingHua Press, (1997),88–94
Rou Song, Chaojie Qiu etc.: Bi-Orderly-Neighborship and its application to Chinese Word Segmentation and Proof
Yangshen Zhang and Bingqing Ding: A Method of Automatic Checking and Correction on English Words Spelling-the Method of Skeleton Key, Computer Development and Application, 2, (1999),9–11
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, Y. (2000). Automatic Lexical Errors Detecting of Chinese Texts Based on the Orderly-Neighborship1 . In: Tan, T., Shi, Y., Gao, W. (eds) Advances in Multimodal Interfaces — ICMI 2000. ICMI 2000. Lecture Notes in Computer Science, vol 1948. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-40063-X_36
Download citation
DOI: https://doi.org/10.1007/3-540-40063-X_36
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41180-2
Online ISBN: 978-3-540-40063-9
eBook Packages: Springer Book Archive