Abstract
Named entity linking or named entity disambiguation is to link entity mentions to corresponding entities in a knowledge base for resolving the ambiguity of entity mentions. Recently, collective linking methods exploit document-level coherence of the referenced entities by computing a pairwise score between candidates of a pair of named entity mentions (e.g., Raytheon and Boeing) in a document. However, in a document, named entity mentions are significantly less frequent than anonymous entity mentions (e.g., defense contractor and the company). In this paper, we propose a method, DOCument-level Anonymous Entity Type words relatedness (DOC-AET), to exploit the document-level coherence between candidate entities and anonymous entity mentions. We use the anonymous entity type (AET) words to extract anonymous entity mentions. We learn embeddings of AET words from their inter-paragraph co-occurrence matrix; thus, the document-level entity-type relatedness is encoded in the AET word embeddings. Then, we compute the coherence scores between candidate entities and anonymous entity mentions using the AET entity embeddings and document context embeddings. By incorporating such coherence scores for candidates ranking, DOC-AET has achieved new state-of-the-art results on two of the five out-domain test sets for named entity linking.




Similar content being viewed by others
Notes
We use italic font to represent anonymous entity mentions.
References
Arora S, Li Y, Liang Y, Ma T, Risteski A (2018) Linear algebraic structure of word senses, with applications to polysemy. Trans Assoc Comput Linguist 6:483–495. https://doi.org/10.1162/tacl_a_00034
Bhowmik R, de Melo G (2018) Generating fine-grained open vocabulary entity type descriptions. In: Proceedings of the 56th annual meeting of the association for computational linguistics. https://doi.org/10.18653/v1/P18-1081
Chen S, Wang J, Jiang F, Lin CY (2020) Improving entity linking by modeling latent entity type information. In: Proceedings of the 34th AAAI conference on artificial intelligence, pp 7529–7537 . https://doi.org/10.1609/aaai.v34i05.6251
Cheng X, Roth D (2013) Relational inference for wikification. In: Proceedings of the 2013 conference on empirical methods in natural language processing, pp 1787–1796. https://www.aclweb.org/anthology/D13-1184
Chisholm A, Hachey B (2015) Entity disambiguation with web links. Trans Assoc Comput Linguist 3:145–156. https://doi.org/10.1162/tacl_a_00129
Choi E, Levy Omer, Choi Yejin, Zettlemoyer Luke (2018) Ultra-fine entity typing. In: Proceedings of the 56th annual meeting of the association for computational linguistics. Assoc Comput Linguist, pp 87–96. https://doi.org/10.18653/v1/P18-1009
Cui H, Peng T, Feng L, Bao T, Liu L (2021) Simple question answering over knowledge graph enhanced by question pattern classification. Knowl Inf Syst 63(10):2741–2761
Durrett G, Klein D (2014) A joint model for entity analysis: coreference, typing, and linking. Trans Assoc Comput Linguist
Ensan F, Du W (2019) Ad hoc retrieval via entity linking and semantic similarity. Knowl Inf Syst 58(3):551–583
Gabrilovich Evgeniy, Ringgaard Michael, Subramanya Amarnag (2013) FACC1: Freebase annotation of ClueWeb corpora, version 1 (release date 2013-06-26, format version 1, correction level 0)
Fang W, Zhang J, Wang D, Chen Z, Li M (2016) Entity disambiguation by knowledge and text jointly embedding. In: Proceedings of The 20th SIGNLL conference on computational natural language learning, pp 260–269. Association for computational linguistics, Berlin, Germany. https://doi.org/10.18653/v1/K16-1026.https://www.aclweb.org/anthology/K16-1026
Ganea OE, Ganea M, Lucchi A, Eickhof, C, Hofmann T (2016) Probabilistic bag-of-hyperlinks model for entity linking. In: Proceedings of the 25th international conference on world wide web, pp 927–938. International World Wide Web Conferences Steering Committee
Ganea OE, Hofmann T (2017) Deep joint entity disambiguation with local neural attention. In: Proceedings of the 2017 conference on empirical methods in natural language processing, pp 2619–2629. Association for Computational Linguistics, Copenhagen, Denmark . https://doi.org/10.18653/v1/D17-1277.https://www.aclweb.org/anthology/D17-1277
Gillick D, Lazic N, Ganchev K, Kirchner J, Huynh D (2014) Contextdependent fine-grained entity type tagging. arXiv preprint arXiv:1412.1820
Globerson A, Lazic N, Chakrabarti S, Subramanya A, Ringaard M, Pereira F (2016) Collective entity resolution with multi-focal attention. In: Proceedings of the 54th annual meeting of the association for computational linguistics (vol 1: Long Papers), pp 621–631. https://doi.org/10.18653/v1/P16-1059.https://www.aclweb.org/anthology/P16-1059
Guo Z, Barbosa D (2018) Robust named entity disambiguation with random walks. Sem Web (Preprint) 9(4):459–479
Gupta N, Singh S, Roth D (2017) Entity linking via joint encoding of types, descriptions, and context. In: Proceedings of the 2017 conference on empirical methods in natural language processing, pp 2681–2690. Association for Computational Linguistics, Copenhagen, Denmark . https://doi.org/10.18653/v1/D17-1284.https://www.aclweb.org/anthology/D17-1284
Hoffart J, Yosef MA, Bordino I, Fürstenau H, Pinkal M, Spaniol M, Taneva B, Thater S, Weikum G (2011) Robust disambiguation of named entities in text. In: Proceedings of the 2011 conference on empirical methods in natural language processing, pp 782–792. Association for Computational Linguistics . http://www.aclweb.org/anthology/D11-1072
Hoffmann R, Zhang C, Ling X, Zettlemoyer L, Weld DS (2011) Knowledge-based weak supervision for information extraction of overlapping relations. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, pp 541–550. Association for Computational Linguistics, Portland, Oregon, USA. https://www.aclweb.org/anthology/P11-1055
Hou F, Wang R, He J, Zhou Y (2020) Improving entity linking through semantic reinforced entity embeddings. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 6843–6848. Association for Computational Linguistics, Online . https://doi.org/10.18653/v1/2020.acl-main.612.https://www.aclweb.org/anthology/2020.acl-main.612
Hou F, Wang R, Zhou Y (2021) Transfer learning for fine-grained entity typing. Knowl Inf Syst 63(4):845–866
Kingma DP, Ba J (2014) Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980
Kouki P, Pujara J, Marcum C, Koehly L, Getoor L (2019) Collective entity resolution in multi-relational familial networks. Knowl Inf Syst 61(3):1547–1581
Lazic N, Subramanya A, Ringgaard M, Pereira F (2015) Plato: a selective context model for entity resolution. Trans Assoc Comput Linguist 3:503–515. https://doi.org/10.1162/tacl_a_00154https://www.aclweb.org/anthology/Q15-1036
Le P, Titov I (2018) Improving entity linking by modeling latent relations between mentions. In: Proceedings of the 56th annual meeting of the association for computational linguistics (vol 1: Long Papers), pp 1595–1604. Association for Computational Linguistics, Melbourne, Australia . https://doi.org/10.18653/v1/P18-1148.https://www.aclweb.org/anthology/P18-1148
Le P, Titov I (2019) Boosting entity linking performance by leveraging unlabeled documents. In: Proceedings of the 57th annual meeting of the association for computational linguistics, pp 1935–1945. Association for Computational Linguistics, Florence, Italy. https://www.aclweb.org/anthology/P19-1187
Ling X, Weld DS (2012) Fine-grained entity recognition. In: Proceedings of association for the advancement of artificial intelligence
Liu M, Zhao Y, Qin B, Liu T (2019) Collective entity linking: a random walk-based perspective. Knowl Inf Syst 60(3):1611–1643
Mikolov T, Sutskever Ilya, Chen Kai, Corrado Greg, Dean Jeffrey (2013) Distributed representations of words and phrases and their compositionality. In: Adv Neural Inf Process Syst, pp 3111–3119
Milne D, Witten IH (2008) Learning to link with wikipedia. In: Proceedings of the 17th ACM conference on information and knowledge management, pp 509–518. ACM
Mu J, Bhat S, Viswanath P (2017) Geometry of polysemy. In: Proceedings of the 5th international conference on learning representations
Murphy KP (2012) Machine learning: a probabilistic perspective. MIT press
Pennington J, Socher R, Manning C (2014) GloVe: Global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532–1543. Association for Computational Linguistics, Doha, Qatar. https://doi.org/10.3115/v1/D14-1162.https://www.aclweb.org/anthology/D14-1162
Phan MC, Sun A, Tay Y, Han J, Li C (2019) Pair-linking for collective entity disambiguation: two could be better than all. IEEE Trans Knowl Data Eng 31(7):1383–1396
Ratinov L, Roth D, Downey D, Anderson M (2011) Local and global algorithms for disambiguation to wikipedia. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, vol 1, pp 1375–1384. Association for Computational Linguistics. https://www.aclweb.org/anthology/P11-1138
Shen W, Wang J, Han J (2015) Entity linking with a knowledge base: issues, techniques, and solutions. IEEE Trans Knowl Data Eng 27(2):443–460
Yaghoobzadeh Y, Adel H, Schutze H (2018) Corpus-level fine-grained entity typing. J Artif Intell Res 61:835–862
Yaghoobzadeh Y, Kann K, Hazen TJ, Agirre E, Schütze H (2019) Probing for semantic classes: Diagnosing the meaning content of word embeddings. In: Proceedings of the 57th Annual meeting of the association for computational linguistics, pp 5740–5753. Association for Computational Linguistics, Florence, Italy. https://www.aclweb.org/anthology/P19-1574
Yamada I, Shindo H, Takeda H, Takefuji Y (2016) Joint learning of the embedding of words and entities for named entity disambiguation. In: Proceedings of The 20th SIGNLL conference on computational natural language learning, pp 250–259. Association for Computational Linguistics, Berlin, Germany. https://doi.org/10.18653/v1/K16-1025.https://www.aclweb.org/anthology/K16-1025
Yamada I, Shindo H, Takeda H, Takefuji Y (2017) Learning distributed representations of texts and entities from knowledge base. Trans Assoc Comput Linguist 5:397–411
Yamada I, Washio K, Shindo H, Matsumoto Y (2020) Global entity disambiguation with pretrained contextualized embeddings of words and entities. arXiv preprint arXiv:1909.00426
Yang X, Gu X, Lin S, Tang S, Zhuang Y, Wu F, Chen Z, Hu G, Ren X (2019) Learning dynamic context augmentation for global entity linking. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), pp 271–281. Association for Computational Linguistics, Hong Kong, China. https://doi.org/10.18653/v1/D19-1026.https://www.aclweb.org/anthology/D19-1026
Yih Wt, Chang MW, He X, Gao J (2015) Semantic parsing via staged query graph generation: Question answering with knowledge base. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing (vol 1: Long Papers), pp 1321–1331. Association for Computational Linguistics, Beijing, China. https://doi.org/10.3115/v1/P15-1128.https://www.aclweb.org/anthology/P15-1128
Zhou X, Miao Y, Wang W, Qin J (2020) A recurrent model for collective entity linking with adaptive features. In: Proceedings of the 34th AAAI Conference on Artificial Intelligence, vol. 34, pp. 329–336
Zwicklbauer S, Seifert C, Granitzer M (2016) Robust and collective entity disambiguation through semantic embeddings. In: Proceedings of the 39th international ACM SIGIR conference, pp 425–434. ACM
Acknowledgements
This work is supported by the 2020 Catalyst: Strategic NZ-Singapore Data Science Research Programme Fund, MBIE, New Zealand.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Hou, F., Wang, R., Ng, SK. et al. Exploiting anonymous entity mentions for named entity linking. Knowl Inf Syst 65, 1221–1242 (2023). https://doi.org/10.1007/s10115-022-01793-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-022-01793-3