Abstract
Semantic information is one of the indispensable ingredients that are necessary to raise the performance of anaphora resolution – both for pronominal anaphors and for anaphoric definite descriptions – beyond the baseline level. In contrast to hard criteria such as binding and agreement constraints, however, the question of semantic constraints and preferences and its operationalization in a system that performs anaphora resolution, is more complex and a larger variety of solutions can be found in practice.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
TüBa-D/Z corpus, sentence 190.
- 2.
Translated from TüBa-D/Z corpus, sentence 2015.
- 3.
Clear is meant in the sense that it holds up to lexicographic criteria, as opposed to the contents of both terms being incomparable logically.
- 4.
A small subset of the Wall Street Journal section of the Penn Treebank.
- 5.
- 6.
ACE-2, document NWIRE/APW19980213.1305.
- 7.
Brants, Thorsten, and Alex Franz. Web 1T 5-gram Version 1 LDC2006T13. Web Download. Philadelphia: Linguistic Data Consortium, 2006.
- 8.
- 9.
Both the error analysis code and the code for Martschat’s CoRT system are publically available at https://github.com/smartschat/cort
References
Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley FrameNet project. In: Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, pp. 86–90 (1998)
Bansal, M., Klein, D.: Coreference semantics from web features. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012), Jeju Island, pp. 839–398 (2012)
Bean, D., Riloff, E.: Unsupervised learning of contextual role knowledge for coreference resolution. In: Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2004), Boston, pp. 297–304 (2004)
Bengtson, E., Roth, D.: Understanding the value of features for coreference resolution. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Honolulu, pp. 294–303 (2008)
Berland, M., Charniak, E.: Finding parts in very large corpora. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics (ACL 1999), College Park, pp. 57–64 (1999)
Björkelund, A., Farkas, R.: Data-driven multilingual coreference resolution using resolver stacking. In: Joint Conference on EMNLP and CoNLL – Shared Task, Jeju Island, pp. 49–55 (2012)
Björkelund, A., Kuhn, J.: Learning structured perceptrons for coreference resolution with latent antecedents and non-local features. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014), Baltimore, pp. 47–57 (2014)
Bollacker, K., Evans, C., Paritosh, P., Sturge, T., Taylor, J.: Freebase: a collaboratively created graph database for structuring human knowledge. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, SIGMOD ’08. ACM, New York, pp. 1247–1250 (2008). doi:10.1145/1376616.1376746
Bunescu, R.: Associative anaphora resolution: a web-based approach. In: Proceedings of the EACL 2003 Workshop on the Computational Treatment of Anaphora, Budapest, pp. 47–52 (2003)
Burnard, L. (ed.): Users Reference Guide British National Corpus Version 1.0. Oxford University Computing Service, Oxford (1995)
Carter, D.M.: Common sense inference in a focus-guided anaphor resolver. J. Semant. 4, 237–246 (1985)
Charniak, E.: Toward a model of children’s story comprehension. Ph.D. thesis, MIT Computer Science and Artificial Intelligence Lab (CSAIL) (1972)
Dagan, I., Itai, A.: Automatic processing of large corpora for the resolution of anaphora references. Papers Presented to the 13th International Conference on Computational Linguistics, Helsinki (1990)
Dagan, I., Justeson, J., Lappin, S., Leass, H., Ribak, A.: Syntax and lexical statistics in anaphora resolution. Appl. Artif. Intell. 9, 633–644 (1995)
Dalton, J., Dietz, L.: A neighborhood relevance model for entity linking. In: Proceedings of the 10th Conference on Open Research Areas in Information Retrieval, Lisbon, pp. 149–156 (2013)
Daumé III, H., Marcu, D.: A large-scale exploration of effective global features for a joint entity detection and tracking model. In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP 2005), Vancouver, pp 97–104 (2005)
Durrett, G., Klein, D.: Easy victories and uphill battles in coreference resolution. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP 2013), Seattle, pp. 1971–1982 (2013)
Durrett, G., Klein, D.: A joint model for entity analysis: coreference, typing and linking. Trans. Assoc. Comput. Linguist. 2, 477–490 (2014)
Fernandes, E.R., dos Santos, C.N., Milidiu, R.L.: Latent trees for coreference resolution. Comput. Linguist. 40 (4), 801–835 (2014)
Fleischman, M., Hovy, E., Echihabi, A.: Offline strategies for online question answering: answering questions before they are asked. In: Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics (ACL 2003), Sapporo, pp. 1–7 (2003)
Garera, N., Yarowsky, D.: Resolving and generating definite anaphora by modeling hypernymy using unlabeled corpora. In: Proceedings of the Tenth Conference on Computational Natural Language Learning (CoNLL-X), New York, pp. 37–44 (2006)
Gasperin, C., Vieira, R.: Using word similarity lists for resolving indirect anaphora. In: ACL’04 Workshop on Reference Resolution and Its Applications, Barcelona, pp. 40–46 (2004)
Gasperin, C., Gamallo, P., Augustini, A., Lopes, G., de Lima, V.: Using syntactic contexts for measuring word similarity. In: Proceedings of the ESSLLI 2001 Workshop on Knowledge Acquisition and Categorization, Helsinki, pp. 18–23 (2001)
Ge, N., Hale, J., Charniak, E.: A statistical approach to anaphora resolution. In: Proceedings of the Sixth Workshop on Very Large Corpora (WVLC/EMNLP 1998), Montreal, pp. 161–171 (1998)
Giles, J.: Internet encyclopedias go head to head. Nature 438 (7070), 900–901 (2005)
Hajishirzi, H., Zilles, L., Weld, D.S., Zettlemoyer, L.: Joint coreference resolution and named-entity linking with mult-pass sieves. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP 2013), Seattle, pp. 289–299 (2013)
Harabagiu, S., Bunescu, R., Maiorano, S.: Text and knowledge mining for coreference resolution. In: Proceedings of the 2nd Meeting of the North American Chapter of the Association of Computational Linguistics (NAACL-2001), Pittsburgh, pp. 55–62 (2001)
Hearst, M.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the 14th International Conference on Computational Linguistics (COLING 92), Nantes, pp. 539–545 (1992)
Ji, H., Westbrook, D., Grishman, R.: Using semantic relations to refine coreference decisions. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT-EMNLP 2005), Prague, pp. 17–24 (2005)
Ji, H., Grishman, R., Dang, H.T., Griffin, K., Ellis, J.: Overview of the TAC 2010 knowledge base population track. In: Text Analytics Conference (TAC 2010), Gaithersburg (2010)
Ji, H., Nothman, J., Hachey, B.: Overview of TAC-KBP2014 entity discovery and linking tasks. In: Proceedings of Text Analytics Conference (TAC 2014), Gaithersburg (2014)
Kameyama, M.: Recognizing referential links: an information extraction prespective. In: ACL Workshop on Operational Factors in Practical, Robust Anaphora Resolution for Unrestricted Texts, Madrid, pp 46–53 (1997)
Kehler, A., Appelt, D., Taylor, L., Simma, A.: The (non)utility of predicate-argument frequencies for pronoun interpretation. In: Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2004), Boston, pp 289–296 (2004)
Kipper, K., Dang, H.T., Palmer, M.: Class-based construction of a verb lexicon. In: Proceedings of the Seventeenth National Conference on Artificial Intelligence (AAAI 2000), Austin, pp 691–696 (2000)
Klebanov, B., Wiemer-Hastings, P.M.: Using LSA for pronominal anaphora resolution. In: Proeedings of the Computational Linguistics and Intelligent Text Processing, Third International Conference, (CICLing 2002), Mexico City, pp. 197–199 (2002)
Kummerfeld, J.K., Klein, D.: Error-driven analysis of challenges in coreference resolution. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP 2013), Seattle, pp. 265–277 (2013)
Kunze, C., Lemnitzer, L.: GermaNet – representation, visualization, application. In: Proceedings of LREC 2002, Las Palmas (2002)
Landauer, T.K., Dumais, S.T.: A solution to Plato’s problem: the latent semantic analysis theory of acquisition. Psychol. Rev. 104 (2), 211–240 (1997)
Leacock, C., Chodorow, M.: Combining local context and WordNet similarity for word sense identification. In: Fellbaum, C. (ed.) WordNet, an Electronic Lexical Database, pp. 265–283. MIT, Cambridge (1998)
Lee, H., Chang, A., Peirsman, Y., Chambers, N., Surdeanu, M., Jurafsky, D.: Deterministic coreference resolution based on entity-centric, precision-ranked rules. Comput. Linguist. 39 (4), 885–916 (2013)
Lin, D.: University of Manitoba: description of the PIE system used for MUC-6. In: Proceedings of the 6th Message Understanding Conference (MUC-6), Columbia, pp. 113–126 (1995)
Lin, D.: Automatic retrieval and clustering of similar words. In: 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics (CoLing-ACL 1998), Montreal, pp. 768–774 (1998)
Lin, D., Church, K., Ji, H., Sekine, S., Yarowsky, D., Bergsma, S., Patil, K., Pitler, E., Lathbury, R., Rao, V., Dalwani, K., Narsale, S.: New tools for web-scale n-grams. In: Proceedings of the International Conference on Language Resources and Evaluation (LREC 2010), Valletta (2010)
Luo, X., Ittycheriah, A., Jing, H., Kambhatla, N., Roukos, S.: A mention-synchronous coreference resolution algorithm based on the Bell tree. In: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL 2004), Barcelona, pp. 135–142 (2004)
Markert, K., Nissim, M.: Comparing knowledge sources for nominal anaphora resolution. Comput. Linguist. 31 (3), 367–402 (2005)
Markert, K., Nissim, M., Modjeska, N.N.: Using the web for nominal anaphora resolution. In: Proceedings of the 2003 EACL Workshop on the Computational Treatment of Anaphora, Budapest (2003)
Martschat, S.: Multigraph clustering for unsupervised coreference resolution. In: Proceedings of the ACL Student Research Workshop, Sofia (2013)
Martschat, S., Strube, M.: Recall error analysis for coreference resolution. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP 2014), Doha, pp. 2070–2081 (2014)
Mendes, P.N., Jakob, M., Bizer, C.: DBpedia: a multilingual cross-domain knowledge base. In: Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC-2012), Istanbul, pp. 1813–1817 (2012)
Miller, G.A., Fellbaum, C.: Semantic networks of English. Cognition 41, 197–229 (1991)
Miller, G.A., Hristea, F.: WordNet nouns: classes and instances. Comput. Linguist. 32 (1), 1–3 (2006). doi:10.1162/coli.2006.32.1.1
Milne, D., Witten, I.H.: Learning to link with Wikipedia. In: Proceedings of the ACM Conference on Information and Knowledge Management (CIKM), Napa Valley, pp. 509–518 (2008)
Nedoluzhko, A., Mírovyskí, J.: How dependency trees and tectogrammatics help annotating coreference and bridging relations in Prague dependency treebank. In: Proceedings of the Second International Conference on Dependency Linguistics (DepLing 2013), Prague, pp 244–251 (2013)
Ng, V.: Shallow semantics for coreference resolution. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI 2007), Hyderabad, pp. 1689–1694 (2007)
Ng, V., Cardie, C.: Improving machine learning approaches to coreference resolution. In: 40th Annual Meeting of the Asssociation for Computational Linguistics, Philadelphia, pp. 104–111 (2002)
Padó, S., Lapata, M.: Dependency-based construction of semantic space models. Comput. Linguist. 33 (2), 161–199 (2007)
Poesio, M., Vieira, R., Teufel, S.: Resolving bridging descriptions in unrestricted text. In: ACL-97 Workshop on Operational Factors in Practical, Robust, Anaphora Resolution for Unrestricted Texts, Madrid, pp. 1–6 (1997)
Poesio, M., Schulte im Walde, S., Brew, C.: Lexical clustering and definite description interpretation. In: AAAI Spring Symposium on Learning for Discourse, Stanford, pp. 82–89 (1998)
Poesio, M., Mehta, R., Maroudas, A., Hitzeman, J.: Learning to resolve bridging references. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics (ACL 2004), Barcelona, pp. 143–150 (2004)
Ponzetto, S.P.: Knowledge Acquisition from a Collaboratively Generated Encyclopedia, Dissertations in Artificial Intelligence, vol 327. IOS Press, Amsterdam (2010)
Ponzetto, S.P., Strube, M.: Exploiting semantic role labeling, WordNet and Wikipedia for coreference resolution. In: Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL 2006), New York, pp. 192–199 (2006)
Ponzetto, S.P., Strube, M.: Deriving a large-scale taxonomy from wikipedia. In: Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence (AAAI 2007), pp. 1440–1445 (2007)
Pradhan, S., Ramshaw, L., Weischedel, R., MacBride, J., Micciulla, L.: Unrestricted coreference: identifying entities and events in ontonotes. In: Proceedings of the IEEE International Conference on Semantic Computing (ICSC), Irvine (2007)
Pradhan, S., Ramshaw, L., Marcus, M., Palmer, M., Weischedel, R., Xue, N.: CoNLL-2011 shared task: modeling unrestricted coreference in OntoNotes. In: Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task, Portland, pp. 1–27 (2011)
Pradhan, S., Moschitti, A., Xue, N., Uryupina, O., Zhang, Y.: CoNLL-2012 shared task: modeling multilingual unrestricted coreference in OntoNotes. In: Joint Conference on EMNLP and CoNLL – Shared Task, Jeju Island, pp. 1–40 (2012)
Rada, R., Mili, H., Bicknell, E., Blettner, M.: Development and application of a metric to semantic nets. IEEE Trans. Syst. Man Cybern. 19 (1), 17–30 (1989)
Rahman, A., Ng, V.: Supervised models for coreference resolution. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing (EMNLP09), Singapore, pp. 968–977 (2009)
Rahman, A., Ng, V.: Coreference resolution with world knowledge. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL 2011), Portland, pp. 814–824 (2011)
Ratinov, L., Roth, D.: Design challenges and misconceptions in named entity recognition. In: Proceedings of the Conference on Computational Natural Language Learning (CoNLL), Boulder, pp. 147–155 (2009)
Ratinov, L., Roth, D.: Learning-based multi-sieve coreference resolution with knowledge. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL 2012), Jeju Island, pp. 1234–1244 (2012)
Ratinov, L., Downey, D., Anderson, M., Roth, D.: Local and global algorithms for disambiguation to Wikipedia. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL 2011), Portland, pp. 1375–1384 (2011)
Ravichandran, D., Pantel, P., Hovy, E.: Randomized algorithms and NLP: using locality sensitive hash function for high speed noun clustering. In: Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005), Ann Arbor, pp. 622–629 (2005)
Recasens, M., Màrquez, L., Sapena, E., Martí, M.A., Taulé, M., Hoste, V., Poesio, M., Versley, Y.: Semeval task 1: coreference resolution in multiple languages. In: Proceedings of the 5th International Workshop on Semantic Evaluation (SemEval 2010), Los Angeles, pp. 1–8 (2010)
Recasens, M., Can, M., Jurafsky, D.: Same referent, different words: unsupervised mining of opaque coreferent mentions. In: Proceedings of NAACL-HLT 2013, Atlanta (2013)
Seco, N., Veale, T., Hayes, J.: An intrinsic information content metric for semantic similarity in WordNet. In: Proceedings of the 16th European Conference on Artificial Intelligence (ECAI 2004), Valencia, pp. 1089–1090 (2004)
Soon, W.M., Ng, H.T., Lim, D.C.Y.: A machine learning approach to coreference resolution of noun phrases. Comput. Linguist. 27 (4), 521–544 (2001)
Stoyanov, V., Gilbert, N., Cardie, C., Riloff, E.: Conundrums in noun phrase coreference resolution: making sense of the state of the art. In: Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the AFNLP (ACL/IJCNLP 2009), Singapore, pp. 656–664 (2009)
Stoyanov, V., Cardie, C., Gilbert, N., Riloff, E., Buttler, D., Hysom, D.: Coreference resolution with Reconcile. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, pp. 156–161 (2010)
Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO: a core of semantic knowledge unifying WordNet and Wikipedia. In: Proceedings of the 16th World Wide Web Conference (WWW 2007), Banff, pp. 697–706 (2007)
Telljohann, H., Hinrichs, E.W., Kübler, S., Zinsmeister, H., Beck, K.: Stylebook for the Tübingen Treebank of Written German (TüBa-D/Z). Technical Report, Seminar für Sprachwissenschaft, Universität Tübingen (2009)
Uryupina, O., Poesio, M., Giuliano, C., Tymoshenko, K.: Disambiguation and filtering methods in using Web knowledge for coreference resolution. In: Proceedings of the Twenty-Fourth International FLAIRS Conference (FLAIRS 2011), Palm Beach (2011)
Versley, Y.: A constraint-based approach to noun phrase coreference resolution in German newspaper text. In: Konferenz zur Verarbeitung Natürlicher Sprache (KONVENS 2006), Konstanz, pp. 143–150 (2006)
Versley, Y.: Antecedent selection techniques for high-recall coreference resolution. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), Prague, pp. 496–505 (2007)
Versley, Y., Ponzetto, S., Poesio, M., Eidelman, V., Jern, A., Smith, J., Yang, X., Moschitti, A.: BART: a modular toolkit for coreference resolution. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Demo Session (ACL 2008 Demo), Columbus, pp. 9–12 (2008)
Vieira, R., Poesio, M.: An empirically based system for processing definite descriptions. Comput. Linguist. 26 (4), 539–593 (2000)
Wu, Z., Palmer, M.: Verb semantics and lexical selections. In: Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics (ACL 1994), Las Cruces, pp. 133–138 (1994)
Yang, X., Su, J.: Coreference resolution using semantic relatedness information from automatically discovered patterns. In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL 2007), Prague, pp. 528–535 (2007)
Zheng, J., Vilnis, L., Singh, S., Choi, J., McCallum, A.: Dynamic knowledge-base alignment for coreference resolution. In: Proceedings of the Seventeenth Conference on Computational Natural Language Learning (CoNLL 2013), Sofia, pp. 153–162 (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Versley, Y., Poesio, M., Ponzetto, S. (2016). Using Lexical and Encyclopedic Knowledge. In: Poesio, M., Stuckardt, R., Versley, Y. (eds) Anaphora Resolution. Theory and Applications of Natural Language Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-47909-4_14
Download citation
DOI: https://doi.org/10.1007/978-3-662-47909-4_14
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-47908-7
Online ISBN: 978-3-662-47909-4
eBook Packages: Computer ScienceComputer Science (R0)