Skip to main content

EDIUM: Improving Entity Disambiguation via User Modeling

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8416))

Abstract

Entity Disambiguation is the task of associating entity name mentions in text to the correct referent entities in the knowledge base, with the goal of understanding and extracting useful information from the document. Entity disambiguation is a critical component of systems designed to harness information shared by users on microblogging sites like Twitter. However, noise and lack of context in tweets makes disambiguation a difficult task. In this paper, we describe an Entity Disambiguation system, EDIUM, which uses User interest Models to disambiguate the entities in the user’s tweets. Our system jointly models the user’s interest scores and the context disambiguation scores, thus compensating the sparse context in the tweets for a given user. We evaluated the system’s entity linking capabilities on tweets from multiple users and showed that improvement can be achieved by combining the user models and the context based models.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Murnane, E.L., Haslhofer, B., Lagoze, C.: RESLVE: Leveraging User Interest to Improve Entity Disambiguation on Short Text. In: Proc. of the 22nd Intl. Conf. on World Wide Web (WWW), Republic and Canton of Geneva, Switzerland, pp. 81–82 (2013)

    Google Scholar 

  2. Shen, W., Wang, J., Luo, P., Wang, M.: Linking Named Entities in Tweets with Knowledge Base via User Interest Modeling. In: Proc. of the 19th ACM Conf. on Knowledge Discovery and Data Mining (KDD), pp. 68–76. ACM, New York (2013)

    Google Scholar 

  3. Yerva, S.R., Catasta, M., Demartini, G., Aberer, K.: Entity Disambiguation in Tweets Leveraging User Social Profiles. In: Proc. of the 2013 Intl. Conf. on Information Reuse and Integration (IRI), pp. 120–128. IEEE (2013)

    Google Scholar 

  4. Mendes, P.N., Jakob, M., García-Silva, A., Bizer, C.: DBpedia Spotlight: Shedding Light on the Web of Documents. In: Proc. of the 7th Intl. Conf. on Semantic Systems, pp. 1–8. ACM, New York (2011)

    Google Scholar 

  5. Milne, D., Witten, I.H.: An Open-source Toolkit for Mining Wikipedia. Artificial Intelligence 194, 222–239 (2013)

    Article  MathSciNet  Google Scholar 

  6. Meij, E., Weerkamp, W., de Rijke, M.: Adding Semantics to Microblog Posts. In: Proc. of the 5th ACM Intl. Conf. on Web Search and Data Mining (WSDM), pp. 563–572. ACM, New York (2012)

    Chapter  Google Scholar 

  7. Qureshi, M.A., O’Riordan, C., Pasi, G.: Short-text Domain Specific Key Terms/Phrases Extraction Using an N-gram Model with Wikipedia. In: Proc. of the 21st ACM Conf. on Information and Knowledge Management (CIKM), pp. 2515–2518. ACM, New York (2012)

    Google Scholar 

  8. Michelson, M., Macskassy, S.A.: Discovering Users’ Topics of Interest on Twitter: A First Look. In: Proc. of the 4th Workshop on Analytics for Noisy Unstructured Text Data, pp. 73–80. ACM (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Bansal, R., Panem, S., Gupta, M., Varma, V. (2014). EDIUM: Improving Entity Disambiguation via User Modeling. In: de Rijke, M., et al. Advances in Information Retrieval. ECIR 2014. Lecture Notes in Computer Science, vol 8416. Springer, Cham. https://doi.org/10.1007/978-3-319-06028-6_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-06028-6_35

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-06027-9

  • Online ISBN: 978-3-319-06028-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics