skip to main content
10.1145/2382936.2382951acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article

Anaphora resolution in biomedical literature: a hybrid approach

Published: 07 October 2012 Publication History

Abstract

While traditional work on anaphora resolution has focused on resolving anaphors in newspaper and newswire articles, the surge of interest in biomedical natural language processing in recent years has stimulated work on anaphora resolution in biomedical texts. Existing anaphora resolvers, whether applied to the biomedical domain or not, have adopted either a learning-based or a rule-based approach. We hypothesize that both approaches have their unique strengths, and propose in this paper a hybrid approach to anaphora resolution in biomedical texts that aims to combine their strengths. Our hybrid approach achieves an F-score of 60.9 on the BioNLP-2011 coreference dataset, which to our knowledge is the best result reported to date on this dataset.

References

[1]
D. Bean and E. Riloff. Unsupervised learning of contextual role knowledge for coreference resolution. In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics: HLT-NAACL 2004, pages 297--304, 2004.
[2]
J. Castaño, J. Zhang, and J. Pustejovsky. Anaphora resolution in biomedical literature. In Proceedings of the 2002 International Symposium on Reference Resolution, 2002.
[3]
P. Denis and J. Baldridge. Specialized models and ranking for coreference resolution. In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, pages 660--669, 2008.
[4]
C. Gasperin and T. Briscoe. Statistical anaphora resolution in biomedical texts. In Proceedings of the 22nd International Conference on Computational Linguistics, pages 257--264, 2008.
[5]
B. J. Grosz, A. K. Joshi, and S. Weinstein. Centering: A framework for modeling the local coherence of discourse. Computational Linguistics, 21(2):203--226, 1995.
[6]
B. J. Grosz and C. L. Sidner. Attention, intentions, and the structure of discourse. Computational Linguistics, 12(3):175--204, 1986.
[7]
S. Harabagiu, R. Bunescu, and S. Maiorano. Text and knowledge mining for coreference resolution. In Proceedings of the 2nd Meeting of the North American Chapter of the Association for Computational Linguistics, pages 55--62.
[8]
C. Huang, Y. Wang, Y. Zhang, Y. Jin, and Z. Yu. Coreference resolution in biomedical full-text articles with domain dependent features. In Proceedings of the 2nd International Conference on Computer Technology and Development, 2010.
[9]
R. Iida, K. Inui, H. Takamura, and Y. Matsumoto. Incorporating contextual cues in trainable models for coreference resolution. In Proceedings of the EACL Workshop on The Computational Treatment of Anaphora, 2003.
[10]
J. jae Kim and J. C. Park. BioAR: Anaphora resolution for relating pronoun names to proteome database entries. In Proceedings of the ACL Workshop on Reference Resolution and its Applications, pages 79--86, 2004.
[11]
T. Joachims. Optimizing search engines using clickthrough data. In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 133--142, 2002.
[12]
J.-D. Kim, T. Ohta, and J. Tsujii. Corpus annotation for mining biomedical events from literature. BMC Bioinformatics, 9(1):10, 2008.
[13]
Y. Kim, E. Riloff, and N. Gilbert. The taming of Reconcile as a biomedical coreference resolver. In Proceedings of the BioNLP Shared Task 2011 Workshop, pages 89--93, 2011.
[14]
Y.-H. Lin and T. Liang. Pronominal and sortal anaphora resolution for biomedical literature. In Proceedings of the 16th Conference on Computational Linguistics and Speech Processing, 2004.
[15]
D. McClosky, E. Charniak, and M. Johnson. Reranking and self-training for parser adaptation. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, pages 337--344, 2006.
[16]
R. Mitkov. Robust pronoun resolution with limited knowledge. In Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, pages 869--875, 1998.
[17]
M. Miwa, P. Thompson, and S. Ananiadou. Boosting automatic event extraction from the literature using domain adaptation and coreference resolution. Bioinformatics (Advance Access), 2012.
[18]
A. Moschitti. Making tree kernels practical for natural language processing. In Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics, pages 113--120, 2006.
[19]
MUC-6. Proceedings of the Sixth Message Understanding Conference. Morgan Kaufmann, San Francisco, CA, 1995.
[20]
MUC-7. Proceedings of the Seventh Message Understanding Conference. Morgan Kaufmann, San Francisco, CA, 1998.
[21]
V. Ng. Semantic class induction and coreference resolution. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 536--543, 2007.
[22]
V. Ng. Supervised noun phrase coreference research: The first fifteen years. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 1396--1411, 2010.
[23]
V. Ng and C. Cardie. Improving machine learning approaches to coreference resolution. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pages 104--111, 2002.
[24]
N. Nguyen, J.-D. Kim, and J. Tsujii. Overview of the protein coreference task in BioNLP shared task 2011. In Proceedings of the BioNLP Shared Task 2011 Workshop, pages 74--82, 2011.
[25]
S. P. Ponzetto and M. Strube. Exploiting semantic role labeling, WordNet and Wikipedia for coreference resolution. In Proceedings of the Human Language Technology Conference and Conference of the North American Chapter of the Association for Computational Linguistics, pages 192--199, 2006.
[26]
A. Rahman and V. Ng. Supervised models for coreference resolution. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pages 968--977, 2009.
[27]
A. Rahman and V. Ng. Coreference resolution with world knowledge. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pages 814--824, 2011.
[28]
W. M. Soon, H. T. Ng, and D. C. Y. Lim. A machine learning approach to coreference resolution of noun phrases. Computational Linguistics, 27(4):521--544, 2001.
[29]
V. Stoyanov, C. Cardie, N. Gilbert, E. Riloff, D. Buttler, and D. Hysom. Reconcile: A coreference resolution research platform. In Proceedings of the ACL 2010 Conference Short Papers, 2010.
[30]
J. Su, X. Yang, H. Hong, Y. Tateisi, and J. Tsujii. Coreference resolution in biomedical texts: A machine learning approach. In Ontologies and Text Mining for Life Sciences: Current Status and Future Perspectives, 2008.
[31]
Y. Tateisi, A. Yakushiji, T. Ohta, and J. Tsujii. Syntax annotation for the Genia corpus. In Proceedings of the Second Interational Joint Conference on Natural Language Processing, pages 222--227, 2005.
[32]
M. Torii and K. Vijay-Shanker. Anaphora resolution of demonstrative noun phrases in medline abstracts. In Proceedings of 2005 Pacific-Asia Conference on Computational Linguistics, pages 332--339, 2005.
[33]
K. van Deemter and R. Kibble. On coreferring: Coreference in MUC and related annotation schemes. Computational Linguistics, 26(4):629--637, 2000.
[34]
X. Yang and J. Su. Coreference resolution using semantic relatedness information from automatically discovered patterns. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, pages 528--535, 2007.
[35]
X. Yang, J. Su, and C. L. Tan. Kernel based pronoun resolution with structured syntactic knowledge. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, pages 41--48, 2006.
[36]
X. Yang, J. Su, G. Zhou, and C. L. Tan. An NP-cluster based approach to coreference resolution. In Proceedings of the 20th International Conference on Computational Linguistics, pages 226--232, 2004.
[37]
X. Yang, G. Zhou, J. Su, and C. L. Tan. Coreference resolution using competitive learning approach. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, pages 176--183, 2003.

Cited By

View all
  • (2024)Integrating K+ Entities Into Coreference Resolution on Biomedical TextsIEEE/ACM Transactions on Computational Biology and Bioinformatics10.1109/TCBB.2024.344727321:6(2145-2155)Online publication date: Nov-2024
  • (2023)Development of the Co-reference Resolution Tagged Data set in Assamese @ A Semi-Automated Approach2023 IEEE Guwahati Subsection Conference (GCON)10.1109/GCON58516.2023.10183580(1-4)Online publication date: 23-Jun-2023
  • (2022)Distinguished representation of identical mentions in bio-entity coreference resolutionBMC Medical Informatics and Decision Making10.1186/s12911-022-01862-122:1Online publication date: 30-Apr-2022
  • Show More Cited By

Index Terms

  1. Anaphora resolution in biomedical literature: a hybrid approach

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    BCB '12: Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine
    October 2012
    725 pages
    ISBN:9781450316705
    DOI:10.1145/2382936
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 07 October 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. anaphora resolution
    2. bioinformatics
    3. coreference resolution

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    BCB' 12
    Sponsor:

    Acceptance Rates

    BCB '12 Paper Acceptance Rate 33 of 159 submissions, 21%;
    Overall Acceptance Rate 254 of 885 submissions, 29%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)16
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 20 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Integrating K+ Entities Into Coreference Resolution on Biomedical TextsIEEE/ACM Transactions on Computational Biology and Bioinformatics10.1109/TCBB.2024.344727321:6(2145-2155)Online publication date: Nov-2024
    • (2023)Development of the Co-reference Resolution Tagged Data set in Assamese @ A Semi-Automated Approach2023 IEEE Guwahati Subsection Conference (GCON)10.1109/GCON58516.2023.10183580(1-4)Online publication date: 23-Jun-2023
    • (2022)Distinguished representation of identical mentions in bio-entity coreference resolutionBMC Medical Informatics and Decision Making10.1186/s12911-022-01862-122:1Online publication date: 30-Apr-2022
    • (2022)Arabic Anaphora Resolution System Using New Features: Pronominal and Verbal CasesAnalysis and Application of Natural Language and Speech Processing10.1007/978-3-031-11035-1_5(101-121)Online publication date: 4-Aug-2022
    • (2021)A Comparative Study of Linguistic and Computational Features Based on a Machine Learning for Arabic Anaphora ResolutionProcedia Computer Science10.1016/j.procs.2021.05.068189(37-47)Online publication date: 2021
    • (2020)Named Entity Recognition and Relation Detection for Biomedical Information ExtractionFrontiers in Cell and Developmental Biology10.3389/fcell.2020.006738Online publication date: 28-Aug-2020
    • (2016)Coreference resolution improves extraction of Biological Expression Language statements from textsDatabase10.1093/database/baw0762016(baw076)Online publication date: 3-Jul-2016
    • (2016)Towards a Procedure Model for Developing Anaphora Processing ApplicationsAnaphora Resolution10.1007/978-3-662-47909-4_16(457-484)Online publication date: 5-Aug-2016
    • (2014)EXPLORING A SUBGRAPH MATCHING APPROACH FOR EXTRACTING BIOLOGICAL EVENTS FROM LITERATUREComputational Intelligence10.1111/coin.1200930:3(600-635)Online publication date: 1-Aug-2014
    • (2014)Coreference resolution in biomedical texts2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)10.1109/BIBM.2014.6999392(12-14)Online publication date: Nov-2014

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media