skip to main content
10.1145/3068839.3068847acmconferencesArticle/Chapter ViewAbstractPublication PageswebdbConference Proceedingsconference-collections
research-article

"Tell me more" using Ladders in Wikipedia

Published: 14 May 2017 Publication History

Abstract

We focus on the problem of "tell me more" information related to a given fact in Wikipedia. We use the novel notion of role to link information in an infobox with different places in the text of the same Wikipedia page (space) as well as information across different revisions of the page (time). In this way, it is possible to link together pieces of information that may not represent the same real world entity, yet have served in the same role. To achieve this, we introduce a novel structure called ladder that allows such spatial and temporal linking and we show how to effectively and efficiently construct such structures from Wikipedia data.

References

[1]
N. Bansal, A. Blum, and S. Chawla. Correlation clustering. Mach. Learn., 56(1-3):89--113, June 2004.
[2]
Y. Chiang, A. Doan, and J. F. Naughton. Modeling entity evolution for temporal record matching. In SIGMOD, pages 1175--1186, 2014.
[3]
D. Dey, S. Sarkar, and P. De. Entity matching in heterogeneous databases: A distance based decision model. In HICSS, pages 305--313, 1998.
[4]
M. Ester, H.-P. Kriegel, J. Sander, X. Xu, et al. A density-based algorithm for discovering clusters in large spatial databases with noise. In KDD, number 34, pages 226--231, 1996.
[5]
A. Gliozzo, B. Magnini, and C. Strapparava. Unsupervised domain relevance estimation for word sense disambiguation. In EMNLP, pages 380--387, July 2004.
[6]
M. A. Hernández and S. J. Stolfo. The merge/purge problem for large databases. In SIGMOD, pages 127--138, 1995.
[7]
J. Hoffart, F. M. Suchanek, K. Berberich, and G.Weikum. YAGO2: A spatially and temporally enhanced knowledge base from wikipedia. Artif. Intell., 194:28--61, 2013.
[8]
J. Hoffart, M. A. Yosef, I. Bordino, H. Fürstenau, M. Pinkal, M. Spaniol, B. Taneva, S. Thater, and G. Weikum. Robust disambiguation of named entities in text. In EMNLP, pages 782--792, 2011.
[9]
S. Kulkarni, A. Singh, G. Ramakrishnan, and S. Chakrabarti. Collective annotation of wikipedia entities in web text. In KDD, pages 457--466, 2009.
[10]
D. Lange, C. Böhm, and F. Naumann. Extracting structured information from wikipedia articles to populate infoboxes. In CIKM, pages 1661--1664, 2010.
[11]
F. Li, M.-L. Lee, W. Hsu, and W.-C. Tan. Linking Temporal Records for Profiling Entities. In SIGMOD, pages 593--605, 2015.
[12]
P. Li, X. Dong, A. Maurino, and D. Srivastava. Linking temporal records. PVLDB, 4(11):956--967, 2011.
[13]
X. Ling and D. S. Weld. Temporal information extraction. In AAAI, 2010.
[14]
G. A. Miller. Wordnet: A lexical database for english. Commun. ACM, 38(11):39--41, Nov. 1995.
[15]
D. Milne and I. H. Witten. Learning to link with wikipedia. In CIKM, pages 509--518, 2008.
[16]
A. Sultana, Q. M. Hasan, A. K. Biswas, S. Das, H. Rahman, C. H. Q. Ding, and C. Li. Infobox suggestion for wikipedia entities. In CIKM, pages 2307--2310, 2012.
[17]
M. A. Yosef, J. Hoffart, I. Bordino, M. Spaniol, and G. Weikum. AIDA: an online tool for accurate disambiguation of named entities in text and tables. PVLDB, 4(12):1450--1453, 2011.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WebDB'17: Proceedings of the 20th International Workshop on the Web and Databases
May 2017
52 pages
ISBN:9781450349833
DOI:10.1145/3068839
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 May 2017

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

SIGMOD/PODS'17
Sponsor:

Acceptance Rates

WebDB'17 Paper Acceptance Rate 7 of 21 submissions, 33%;
Overall Acceptance Rate 30 of 100 submissions, 30%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 75
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 17 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media