Linguistic Theory Based Contextual Evidence Mining for Statistical Chinese Co-Reference Resolution

Zhao, Jun; Liu, Fei-Fan

doi:10.1007/s11390-007-9076-9

Linguistic Theory Based Contextual Evidence Mining for Statistical Chinese Co-Reference Resolution

Regular Paper
Published: 13 September 2007

Volume 22, pages 608–617, (2007)
Cite this article

Journal of Computer Science and Technology Aims and scope Submit manuscript

Jun Zhao¹ &
Fei-Fan Liu¹

39 Accesses
Explore all metrics

Abstract

Under statistical learning framework, the paper focuses on how to use traditional linguistic findings on anaphora resolution as a guide for mining and organizing contextual features for Chinese co-reference resolution. The main achievements are as follows. (1) In order to simulate “syntactic and semantic parallelism factor”, we extract “bags of word form and POS” feature and “bag of semes” feature from the contexts of the entity mentions and incorporate them into the baseline feature set. (2) Because it is too coarse to use the feature of bags of word form, POS tag and seme to determine the syntactic and semantic parallelism between two entity mentions, we propose a method for contextual feature reconstruction based on semantic similarity computation, in order that the reconstructed contextual features could better approximate the anaphora resolution factor of “Syntactic and Semantic Parallelism Preferences”. (3) We use an entity-mention-based contextual feature representation instead of isolated word-based contextual feature representation, and expand the size of the contextual windows in addition, in order to approximately simulate “the selectional restriction factor” for anaphora resolution. The experiments show that the multi-level contextual features are useful for co-reference resolution, and the statistical system incorporated with these features performs well on the standard ACE datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-perspective context aggregation for document-level relation extraction

Article 12 July 2022

Chinese Zero Pronoun Resolution: A Chain to Chain Approach

CMCoref: A Constraint-Based Approach for Document Coreference Resolution

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Mitkov R. Anaphora Resolution. London: Longman Press, 2002.
Google Scholar
NIST. The Official Evaluation Plan for the ACE 2005 Evaluation. 2005, http://www.nist.gov/speech/tests/ace/ace05/.
Soon W M, Ng H T, Lim D. A machine learning approach to co-reference resolution of noun phrases. Computational Linguistics, 2001, 27(4): 521–544.
Article Google Scholar
Ng V, Cardie C. Improving machine learning approaches to co-reference resolution. In Proc. the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL-02), Philadelphia, PA, USA, 2002, pp.104–111.
Vincent Ng. Machine learning for coreference resolution: From local classification to global ranking. In Proc. the 43rd Annual Meeting of the Association for Computational Linguistics (ACL-05), Ann Arbor, MI, 2005, pp.157–164.
Yang X, Zhou G, Su J, Tan C L. Improving noun phrase co-reference resolution by matching strings. In Proc. IJCNLP-04, Hainan, China, Lecture Notes in Computer Science, Volume 3248, 2004, pp.22–31.
Strube M, Rapp S, Muller C. The influence of minimum edit distance on reference resolution. In Proc. the Conference on Empirical Methods in Natural Language Processing (EMNLP-2002), Philadelphia, USA, 2002, pp.312–319.
Houfeng Wang, Tingting He. Research on Chinese pronominal anaphora resolution. Chinese Journal of Computers, 2001, 24(2): 136–143.
Google Scholar
Houfeng Wang, Zheng Mei. Robust pronominal resolution within Chinese text. Journal of Software, 2005, 16(5): 700–707.
Article Google Scholar
Chinchor N, Marsh E, MUC-7 Information Extraction Task Definition, In Proc. the Seventh Message Understanding Conference (MUC-7), San Diego, CA, USA, Chinchor NA (ed.), Science Applications International Corporation, 1998.
Vilain M, Burger J, Aberdeen J et al. A model-theoretic coreference scoring scheme. In Proc. the Sixth Message Understanding Conference (MUC-6), Columbia, Maryland, USA, Morgan Kaufmann, 1995, pp.45–52.
Doddington G, Mitchell A, Przybocki M et al. Automatic Content Extraction (ACE) program — Task definitions and performance measures. In Proc. the Fourth International Conference on Language Resources and Evaluation (LREC 2004), Lisbon, Portugal, 2004, pp.837–840.
Florian R, Hassan H, Ittycheriah A et al. A statistical model for multilingual entity detection and tracking. In Proc. the Human Language Technology Conference — North American Chapter of the Association for Computational Linguistics Annual Meeting (HLT/NAACL-2006), Boston, Massachusetts, USA, 2004, pp.1–8.
Iida R, Inui K, Takamura H et al. Incorporating contextual cues in trainable models for coreference resolution. In Proc. the EACL’03 Workshop on the Computational Treatment of Anaphora, Budapest, Hungary, 2003, pp.23–30.
John Bryant. Combining feature based and semantic information for co-reference resolution. Research Report at U.C. Berkeley and ICSI.
Van Deemter K, Kibble R. On Coreferring: Coreference in MUC and Related Annotation Schemes 2000. Computational Linguistics, 2004, 26(4): 629–637.
Article Google Scholar
Aone C, Halverson L, Hampton T, Ramos-Santacruz M. SRA: Description of the IE² System Used for MUC-7. In Proc. the Seventh Message Understanding Conference (MUC-7), Chinchor N A (ed). San Diego, CA, Science Applications International Corporation, 1998.
Google Scholar
Jurafsky Dan, James Martin. Speech and Language Processing. Prentice-Hall, Englewood Cliffs NJ, 2000.
Google Scholar
Zhendong Dong, Qiang Dong. HowNet and the Computation of Meaning. Singapore: World Scientific 2006.
Google Scholar
Qun Liu, Sujian Li. Word similarity computing based on How-net. Journal of Computational Linguistics and Chinese Language Processing, 2002, 7(2): 59–76.
Google Scholar

Download references

Author information

Authors and Affiliations

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, 100080, China
Jun Zhao & Fei-Fan Liu

Authors

Jun Zhao
View author publications
You can also search for this author inPubMed Google Scholar
Fei-Fan Liu
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Jun Zhao.

Additional information

Supported by the National Natural Science Foundation of China under Grant Nos. 60372016, 60121302, 60673042, the National High Technology Development 863 Program of China under Grant No. 2006AA01Z144, and the Natural Science Foundation of Beijing under Grant No. 4052027.

Electronic Supplementary Material

Supplementary material - Chinese Abstract (PDF 84.8 Kb).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhao, J., Liu, FF. Linguistic Theory Based Contextual Evidence Mining for Statistical Chinese Co-Reference Resolution. J Comput Sci Technol 22, 608–617 (2007). https://doi.org/10.1007/s11390-007-9076-9

Download citation

Received: 04 July 2006
Revised: 19 March 2007
Published: 13 September 2007
Issue Date: July 2007
DOI: https://doi.org/10.1007/s11390-007-9076-9

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Linguistic Theory Based Contextual Evidence Mining for Statistical Chinese Co-Reference Resolution

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Multi-perspective context aggregation for document-level relation extraction

Chinese Zero Pronoun Resolution: A Chain to Chain Approach

CMCoref: A Constraint-Based Approach for Document Coreference Resolution

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic Supplementary Material

Supplementary material - Chinese Abstract (PDF 84.8 Kb).

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now