Abstract
Anaphora is a common phenomenon in discourses as well as an important research issue in the applications of natural language processing. In this paper, both intra-sentential and inter-sentential zero anaphora in Chinese texts are addressed. Unlike general rule-based approaches, our resolution method is embedded with a case-based reasoning mechanism which has the benefit of knowledge acquisition if the case size varies. In addition, the presented approach employs informative features with the help of two outer knowledge resources. Compared to rule-based approaches, our resolution to 1047 zero anaphora instances achieved 82% recall and 77% precision.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Mitkov, R.: Robust pronoun resolution with limited knowledge. In: Proceedings of the 18th International Conference on Computational Linguistics, pp. 869–875 (1998)
Xu, J.J.: Anaphora in Chinese Texts. China social science, Beijing (2003)
Hobbs, J.: Pronoun Resolution. Research Report 76-1, Department of Computer Sciences, City College, City University of New York (1976)
Lappin, S., Leass, H.: An Algorithm for Pronominal Anaphora Resolution. Computational Linguistics 20, 535–561 (1994)
Kennedy, C., Boguraev, B.: Anaphora for everyone: Pronominal anaphora resolution without a parser. In: Proceedings of the 16th International Conference on Computational Linguistics, pp. 113–118 (1996)
Dagan, I., Itai, A.: Automatic processing of large corpora for the resolution of anaphora references. In: Proceedings of the 13th International Conference on Computational Linguistics, pp. 330–332 (1990)
Mitkov, R., Richard, E., Orasan, C.: A new, fully automatic version of Mitkov’s knowledge-poor pronoun resolution method. In: Proceedings of the 3rd International Conference on Computational Linguistics and Intelligent Text Processing, pp. 168–186 (2002)
Liang, T., Wu, D.S.: Automatic Pronominal Anaphora Resolution in English Texts. International journal of Computational Linguistics and Chinese Language Processing 9, 1–20 (2004)
Modjeska, N.N., Markert, K., Nissim, M.: Using the Web in Machine Learning for Other-Anaphora Resolution. In: Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, pp. 176–183 (2003)
Wang, Y.K., Chen, Y.S., Hsu, W.L.: Empirical Study of Mandarin Chinese Discourse Analysis: An event-based approach. In: Proceedings of the 10th IEEE International Conference on Tools with AI, pp. 466–473 (1998)
Wang, N., Yuan, C.F., Wang, K.F., Li, W.J.: Anaphora Resolution in Chinese Financial News for Information Extraction. In: Proceedings of the 4th World Congress on Intelligent Control and Automation, pp. 2422–2426 (2002)
Yeh, C.L., Chen, Y.C.: Zero anaphora resolution in Chinese with shallow parsing. Journal of Chinese Language and Computing (to appear, 2005)
Converse, S.P.: Resolving Pronominal References in Chinese with the Hobbs Algorithm. In: Proceedings of the 4th SIGHAN Workshop on Chinese Language Processing, pp. 116–122 (2005)
Aamodt, A., Plaza, E.: Case-Based Reasoning: Foundational Issues, Methodological Variations, and System Approaches. AI Communications 7, 39–59 (1994)
Chen, K.J., Liu, S.H.: Word identification for Mandarin Chinese sentences. In: Proceedings of the 14th Conference on Computational linguistics, pp. 101–107 (1992)
Wang, H.F., Mei, Z.: Robust Pronominal Resolution within Chinese Text. Journal of Software 16, 700–707 (2005)
CKIP.: A study of Chinese Word Boundaries and Segmentation Standard for Information processing. Technical Report, Taiwan, Taipei, Academia Sinica (1996)
Ding, B.G., Huang, C.N., Huang, D.G.: Chinese Main Verb Identification: From Specification to Realization. International journal of Computational Linguistics and Chinese Language Processing 10, 53–94 (2005)
Liu, Y.H., Pan, W.Y., Gu, W.: Shiyong Xiandai Hanyu Yufa (Practical Modern Chinese Grammar). The Commercial Press (2002)
CKIP.: The content and illustration of Sinica corpus of Academia Sinica. Technical Report no. 95–02, Institute of Information Science, Academia Sinica (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wu, DS., Liang, T. (2006). A Case-Based Reasoning Approach to Zero Anaphora Resolution in Chinese Texts. In: Matsumoto, Y., Sproat, R.W., Wong, KF., Zhang, M. (eds) Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead. ICCPOL 2006. Lecture Notes in Computer Science(), vol 4285. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11940098_55
Download citation
DOI: https://doi.org/10.1007/11940098_55
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49667-0
Online ISBN: 978-3-540-49668-7
eBook Packages: Computer ScienceComputer Science (R0)