EDIUM: Improving Entity Disambiguation via User Modeling

Bansal, Romil; Panem, Sandeep; Gupta, Manish; Varma, Vasudeva

doi:10.1007/978-3-319-06028-6_35

EDIUM: Improving Entity Disambiguation via User Modeling

Romil Bansal²²,
Sandeep Panem²²,
Manish Gupta²² &
…
Vasudeva Varma²²

Conference paper

2907 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8416))

Abstract

Entity Disambiguation is the task of associating entity name mentions in text to the correct referent entities in the knowledge base, with the goal of understanding and extracting useful information from the document. Entity disambiguation is a critical component of systems designed to harness information shared by users on microblogging sites like Twitter. However, noise and lack of context in tweets makes disambiguation a difficult task. In this paper, we describe an Entity Disambiguation system, EDIUM, which uses User interest Models to disambiguate the entities in the user’s tweets. Our system jointly models the user’s interest scores and the context disambiguation scores, thus compensating the sparse context in the tweets for a given user. We evaluated the system’s entity linking capabilities on tweets from multiple users and showed that improvement can be achieved by combining the user models and the context based models.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Murnane, E.L., Haslhofer, B., Lagoze, C.: RESLVE: Leveraging User Interest to Improve Entity Disambiguation on Short Text. In: Proc. of the 22nd Intl. Conf. on World Wide Web (WWW), Republic and Canton of Geneva, Switzerland, pp. 81–82 (2013)
Google Scholar
Shen, W., Wang, J., Luo, P., Wang, M.: Linking Named Entities in Tweets with Knowledge Base via User Interest Modeling. In: Proc. of the 19th ACM Conf. on Knowledge Discovery and Data Mining (KDD), pp. 68–76. ACM, New York (2013)
Google Scholar
Yerva, S.R., Catasta, M., Demartini, G., Aberer, K.: Entity Disambiguation in Tweets Leveraging User Social Profiles. In: Proc. of the 2013 Intl. Conf. on Information Reuse and Integration (IRI), pp. 120–128. IEEE (2013)
Google Scholar
Mendes, P.N., Jakob, M., García-Silva, A., Bizer, C.: DBpedia Spotlight: Shedding Light on the Web of Documents. In: Proc. of the 7th Intl. Conf. on Semantic Systems, pp. 1–8. ACM, New York (2011)
Google Scholar
Milne, D., Witten, I.H.: An Open-source Toolkit for Mining Wikipedia. Artificial Intelligence 194, 222–239 (2013)
Article MathSciNet Google Scholar
Meij, E., Weerkamp, W., de Rijke, M.: Adding Semantics to Microblog Posts. In: Proc. of the 5th ACM Intl. Conf. on Web Search and Data Mining (WSDM), pp. 563–572. ACM, New York (2012)
Chapter Google Scholar
Qureshi, M.A., O’Riordan, C., Pasi, G.: Short-text Domain Specific Key Terms/Phrases Extraction Using an N-gram Model with Wikipedia. In: Proc. of the 21st ACM Conf. on Information and Knowledge Management (CIKM), pp. 2515–2518. ACM, New York (2012)
Google Scholar
Michelson, M., Macskassy, S.A.: Discovering Users’ Topics of Interest on Twitter: A First Look. In: Proc. of the 4th Workshop on Analytics for Noisy Unstructured Text Data, pp. 73–80. ACM (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

International Institute of Information Technology, Hyderabad, India
Romil Bansal, Sandeep Panem, Manish Gupta & Vasudeva Varma

Authors

Romil Bansal
View author publications
You can also search for this author in PubMed Google Scholar
Sandeep Panem
View author publications
You can also search for this author in PubMed Google Scholar
Manish Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Vasudeva Varma
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Amsterdam, Amsterdam, The Netherlands
Maarten de Rijke & Tom Kenter &
Centrum Wiskunde en Informatica, Amsterdam, The Netherlands and Delft University of Technology, Delft, The Netherlands
Arjen P. de Vries
University of Illinois at Urbana-Champaign, Urbana, IL, USA
ChengXiang Zhai
University of Twente, Twente, The Netheralnds and Erasmus University Rotterdam, Rotterdam, The Netherlands
Franciska de Jong
SalesPredict, Haifa, Israel
Kira Radinsky
Microsoft Research, Cambridge, UK
Katja Hofmann

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bansal, R., Panem, S., Gupta, M., Varma, V. (2014). EDIUM: Improving Entity Disambiguation via User Modeling. In: de Rijke, M., et al. Advances in Information Retrieval. ECIR 2014. Lecture Notes in Computer Science, vol 8416. Springer, Cham. https://doi.org/10.1007/978-3-319-06028-6_35

Download citation

DOI: https://doi.org/10.1007/978-3-319-06028-6_35
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-06027-9
Online ISBN: 978-3-319-06028-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics