skip to main content
10.1145/3340531.3412875acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

A Large Test Collection for Entity Aspect Linking

Published:19 October 2020Publication History

ABSTRACT

Given a text with entity links, the task of entity aspect linking is to identify which aspect of an entity is referred to in the context. For example, if a text passage mentions the entity "USA'', is USA mentioned in the context of the 2008 financial crisis, American cuisine, or else? Complementing efforts of Nanni et al (2018), we provide a large-scale test collection which is derived from Wikipedia hyperlinks in a dump from 01/01/2020. Furthermore, we offer strong baselines with results and broken-out feature sets to stimulate more research in this area.

Data, code, feature sets, runfiles and results are released under a CC-SA license and offered on our aspect linking resource web page http://www.cs.unh.edu/~dietz/eal-dataset-2020/

Skip Supplemental Material Section

Supplemental Material

3340531.3412875.mp4

mp4

7 MB

References

  1. Sebastian Arnold, Rudolf Schneider, Philippe Cudré-Mauroux, Felix A Gers, and Alexander Löser. 2019. SECTOR: A Neural Model for Coherent Topic Segmentation and Classification. Transactions of the Association for Computational Linguistics, Vol. 7 (2019), 169--184.Google ScholarGoogle ScholarCross RefCross Ref
  2. Sebastian Arnold, Betty van Aken, Paul Grundmann, Felix A Gers, and Alexander Löser. 2020. Learning Contextualized Document Representations for Healthcare Answer Retrieval. In Proceedings of The Web Conference 2020. 1332--1a343.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Niranjan Balasubramanian and Silviu Cucerzan. 2009. Automatic generation of topic pages using query-based aspect models. In Proceedings of the 18th ACM conference on Information and knowledge management. 2049--2052.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Siddhartha Banerjee and Prasenjit Mitra. 2015. Wikikreator: Improving Wikipedia Stubs Automatically. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 867--877.Google ScholarGoogle ScholarCross RefCross Ref
  5. Jeffrey Dalton, Laura Dietz, and James Allan. 2014. Entity Query Feature Expansion Using Knowledge Base Links. In Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval (Gold Coast, Queensland, Australia) (SIGIR '14). Association for Computing Machinery, New York, NY, USA, 365--374. https://doi.org/10.1145/2600428.2609628Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Laura Dietz and Jeff Dalton. 2020. Humans Optional? Automatic Large-Scale Test Collections for Entity, Passage, and Entity-Passage Retrieval. Datenbank-Spektrum (2020), 1--12.Google ScholarGoogle Scholar
  7. Laura Dietz and John Foley. 2019. TREC CAR Y3: Complex Answer Retrieval Overview.. In Proceedings of Text REtrieval Conference (TREC).Google ScholarGoogle Scholar
  8. Laura Dietz and Ben Gamari. 2020. TREC CAR 2.4: A Machine-Readable Wikipedia Dump.Google ScholarGoogle Scholar
  9. Besnik Fetahu, Katja Markert, and Avishek Anand. 2015. Automated News Suggestions for Populating Wikipedia Entity Pages. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (Melbourne, Australia) (CIKM '15). Association for Computing Machinery, New York, NY, USA, 323--332. https://doi.org/10.1145/2806416.2806531Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Xitong Liu and Hui Fang. 2015. Latent entity space: a novel retrieval approach for entity-bearing queries. Information Retrieval Journal , Vol. 18, 6 (2015), 473--503.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Mike Mintz, Steven Bills, Rion Snow, and Dan Jurafsky. 2009. Distant supervision for relation extraction without labeled data. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP. 1003--1011.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Federico Nanni, Simone Paolo Ponzetto, and Laura Dietz. 2018. Entity-aspect linking: providing fine-grained semantics of entities in context. In Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries. 49--58.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Federico Nanni, Jingyi Zhang, Ferdinand Betz, and Kiril Gashteovski. 2019. EAL: A Toolkit and Dataset for Entity-Aspect Linking. (2019).Google ScholarGoogle Scholar
  14. Ridho Reinanda, Edgar Meij, and Maarten de Rijke. 2016. Document Filtering for Long-Tail Entities. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management (Indianapolis, Indiana, USA) (CIKM '16). Association for Computing Machinery, New York, NY, USA, 771--780. https://doi.org/10.1145/2983323.2983728Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Sebastian Riedel, Limin Yao, Andrew McCallum, and Benjamin M Marlin. 2013. Relation extraction with matrix factorization and universal schemas. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 74--84.Google ScholarGoogle Scholar
  16. Christina Sauper and Regina Barzilay. 2009. Automatically generating wikipedia articles: A structure-aware approach. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1-Volume 1. Association for Computational Linguistics, 208--216.Google ScholarGoogle ScholarCross RefCross Ref
  17. Chenyan Xiong and Jamie Callan. 2015. Query expansion with freebase. In Proceedings of the 2015 international conference on the theory of information retrieval. 111--120.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A Large Test Collection for Entity Aspect Linking

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management
        October 2020
        3619 pages
        ISBN:9781450368599
        DOI:10.1145/3340531

        Copyright © 2020 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 19 October 2020

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate1,861of8,427submissions,22%

        Upcoming Conference

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader