research-article

A Large Test Collection for Entity Aspect Linking

Authors:
Jordan Ramsdell

University of New Hampshire, Durham, NH, USA

University of New Hampshire, Durham, NH, USA
View Profile

,
Laura Dietz

University of New Hampshire, Durham, NH, USA

University of New Hampshire, Durham, NH, USA
View Profile

CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge ManagementOctober 2020Pages 3109–3116https://doi.org/10.1145/3340531.3412875

Published:19 October 2020Publication History

CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management

Pages 3109–3116

ABSTRACT

Given a text with entity links, the task of entity aspect linking is to identify which aspect of an entity is referred to in the context. For example, if a text passage mentions the entity "USA'', is USA mentioned in the context of the 2008 financial crisis, American cuisine, or else? Complementing efforts of Nanni et al (2018), we provide a large-scale test collection which is derived from Wikipedia hyperlinks in a dump from 01/01/2020. Furthermore, we offer strong baselines with results and broken-out feature sets to stimulate more research in this area.

Data, code, feature sets, runfiles and results are released under a CC-SA license and offered on our aspect linking resource web page http://www.cs.unh.edu/~dietz/eal-dataset-2020/

Supplemental Material

3340531.3412875.mp4

mp4

7 MB

Download

References

Sebastian Arnold, Rudolf Schneider, Philippe Cudré-Mauroux, Felix A Gers, and Alexander Löser. 2019. SECTOR: A Neural Model for Coherent Topic Segmentation and Classification. Transactions of the Association for Computational Linguistics, Vol. 7 (2019), 169--184.Google ScholarCross Ref
Sebastian Arnold, Betty van Aken, Paul Grundmann, Felix A Gers, and Alexander Löser. 2020. Learning Contextualized Document Representations for Healthcare Answer Retrieval. In Proceedings of The Web Conference 2020. 1332--1a343.Google ScholarDigital Library
Niranjan Balasubramanian and Silviu Cucerzan. 2009. Automatic generation of topic pages using query-based aspect models. In Proceedings of the 18th ACM conference on Information and knowledge management. 2049--2052.Google ScholarDigital Library
Siddhartha Banerjee and Prasenjit Mitra. 2015. Wikikreator: Improving Wikipedia Stubs Automatically. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 867--877.Google ScholarCross Ref
Jeffrey Dalton, Laura Dietz, and James Allan. 2014. Entity Query Feature Expansion Using Knowledge Base Links. In Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval (Gold Coast, Queensland, Australia) (SIGIR '14). Association for Computing Machinery, New York, NY, USA, 365--374. https://doi.org/10.1145/2600428.2609628Google ScholarDigital Library
Laura Dietz and Jeff Dalton. 2020. Humans Optional? Automatic Large-Scale Test Collections for Entity, Passage, and Entity-Passage Retrieval. Datenbank-Spektrum (2020), 1--12.Google Scholar
Laura Dietz and John Foley. 2019. TREC CAR Y3: Complex Answer Retrieval Overview.. In Proceedings of Text REtrieval Conference (TREC).Google Scholar
Laura Dietz and Ben Gamari. 2020. TREC CAR 2.4: A Machine-Readable Wikipedia Dump.Google Scholar
Besnik Fetahu, Katja Markert, and Avishek Anand. 2015. Automated News Suggestions for Populating Wikipedia Entity Pages. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (Melbourne, Australia) (CIKM '15). Association for Computing Machinery, New York, NY, USA, 323--332. https://doi.org/10.1145/2806416.2806531Google ScholarDigital Library
Xitong Liu and Hui Fang. 2015. Latent entity space: a novel retrieval approach for entity-bearing queries. Information Retrieval Journal , Vol. 18, 6 (2015), 473--503.Google ScholarDigital Library
Mike Mintz, Steven Bills, Rion Snow, and Dan Jurafsky. 2009. Distant supervision for relation extraction without labeled data. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP. 1003--1011.Google ScholarDigital Library
Federico Nanni, Simone Paolo Ponzetto, and Laura Dietz. 2018. Entity-aspect linking: providing fine-grained semantics of entities in context. In Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries. 49--58.Google ScholarDigital Library
Federico Nanni, Jingyi Zhang, Ferdinand Betz, and Kiril Gashteovski. 2019. EAL: A Toolkit and Dataset for Entity-Aspect Linking. (2019).Google Scholar
Ridho Reinanda, Edgar Meij, and Maarten de Rijke. 2016. Document Filtering for Long-Tail Entities. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management (Indianapolis, Indiana, USA) (CIKM '16). Association for Computing Machinery, New York, NY, USA, 771--780. https://doi.org/10.1145/2983323.2983728Google ScholarDigital Library
Sebastian Riedel, Limin Yao, Andrew McCallum, and Benjamin M Marlin. 2013. Relation extraction with matrix factorization and universal schemas. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 74--84.Google Scholar
Christina Sauper and Regina Barzilay. 2009. Automatically generating wikipedia articles: A structure-aware approach. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1-Volume 1. Association for Computational Linguistics, 208--216.Google ScholarCross Ref
Chenyan Xiong and Jamie Callan. 2015. Query expansion with freebase. In Proceedings of the 2015 international conference on the theory of information retrieval. 111--120.Google ScholarDigital Library

Index Terms

A Large Test Collection for Entity Aspect Linking
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Information extraction
2. Information systems
  1. Information retrieval
    1. Evaluation of retrieval results
      1. Test collections

Recommendations

Predicting Guiding Entities for Entity Aspect Linking
CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management

Entity linking can disambiguate mentions of an entity in text. However, there are many different aspects of an entity that could be discussed but are not differentiable by entity links, for example, the entity "oyster'' in the context of "food'' or "...
Read More
Entity-Aspect Linking: Providing Fine-Grained Semantics of Entities in Context
JCDL '18: Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries

The availability of entity linking technologies provides a novel way to organize, categorize, and analyze large textual collections in digital libraries. However, in many situations a link to an entity offers only relatively coarse-grained semantic ...
Read More
Entity linking leveraging: automatically generated annotation
COLING '10: Proceedings of the 23rd International Conference on Computational Linguistics

Entity linking refers entity mentions in a document to their representations in a knowledge base (KB). In this paper, we propose to use additional information sources from Wikipedia to find more name variations for entity linking task. In addition, as ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management
October 2020
3619 pages
ISBN:9781450368599
DOI:10.1145/3340531
General Chairs:
Mathieu d'Aquin
DSI, Insight, NUI Galway, Ireland
,
Stefan Dietze
GESIS, Cologne, Germany, Heinrich-Heine-University Düsseldorf, Germany, L3S Research Center, Germany
,
Program Chairs:
Claudia Hauff
TU Delft, The Netherlands
,
Edward Curry
DSI, Insight, NUI Galway, Ireland
,
Philippe Cudre Mauroux
eXascale, University of Fribourg, Switzerland
Copyright © 2020 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 19 October 2020
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
dataset
entity aspect linking
reference method
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate1,861of8,427submissions,22%
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 9
  Total Citations
  View Citations
- 197
  Total Downloads
- Downloads (Last 12 months)19
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

A Large Test Collection for Entity Aspect Linking

CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Predicting Guiding Entities for Entity Aspect Linking

Entity-Aspect Linking: Providing Fine-Grained Semantics of Entities in Context

Entity linking leveraging: automatically generated annotation