Skip to main content

Exploring Wikipedia and Text Features for Named Entity Disambiguation

  • Conference paper
Intelligent Information and Database Systems (ACIIDS 2010)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5991))

Included in the following conference series:

Abstract

Precisely identifying entities is essential for semantic annotation. This paper addresses the problem of named entity disambiguation that aims at mapping entity mentions in a text onto the right entities in Wikipedia. The aim of this paper is to explore and evaluate various combinations of features extracted from Wikipedia and texts for the disambiguation task, based on a statistical ranking model of candidate entities. Through experiments, we show which combinations of features are the best choices for disambiguation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bunescu, R., Paşca, M.: Using encyclopedic knowledge for named entity disambiguation. In: Proc. of the 11th Conference of EACL, pp. 9–16 (2006)

    Google Scholar 

  2. Bontcheva, K., et al.: Shallow methods for named entity coreference resolution. In: Proc. of TALN 2002 Workshop (2002)

    Google Scholar 

  3. Cucerzan, S.: Large-scale named entity disambiguation based on Wikipedia data. In: Proc. of EMNLP-CoNLL Joint Conference (2007)

    Google Scholar 

  4. Cohen, W., Ravikumar, P., Fienberg, S.: A comparison of string metrics for name-matching tasks. In: IJCAI-03 II-Web Workshop (2003)

    Google Scholar 

  5. Cunningham, H., et al.: GATE: A framework and graphical development environment for robust NLP tools and applications. In: Proc. of ACL 2002 (2002)

    Google Scholar 

  6. Gooi, C.H., Allan, J.: Cross-document coreference on a large-scale corpus. In: Proc. of HLT/NAACL 2004 (2004)

    Google Scholar 

  7. Mihalcea, R.: Using Wikipedia for automatic word sense disambiguation. In: Proc. of HLT/NAACL 2007 (2007)

    Google Scholar 

  8. Mihalcea, R., Csomai, A.: Wikify!: Linking documents to encyclopedic knowledge. In: Proc. of CIKM 2007, pp. 233–242 (2007)

    Google Scholar 

  9. Medelyan, O., et al.: Mining meaning from Wikipedia. International Journal of Human-Computer Studies 67(9), 716–754 (2009)

    Article  Google Scholar 

  10. Medelyan, O., et al.: Topic indexing with Wikipedia. In: Proc. of WIKIAI 2008 (2008)

    Google Scholar 

  11. Milne, D., Witten, I.H.: Learning to link with Wikipedia. In: Proc. of CIKM 2008, pp. 509–518 (2008)

    Google Scholar 

  12. Overell, S., Rüger, S.: Using co-occurrence models for placename disambiguation. The IJGIS. Taylor and Francis, Abington (2008)

    Google Scholar 

  13. Nguyen, H.T., Cao, T.H.: A Knowledge-based approach to named entity disambiguation in news articles. In: Orgun, M.A., Thornton, J. (eds.) AI 2007. LNCS (LNAI), vol. 4830, pp. 619–624. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  14. Chen, Y., Martin, J.: Towards robust unsupervised personal name disambiguation. In: Proc. of EMNLP-CoNLL Joint Conference (2007)

    Google Scholar 

  15. Zesch, T., Gurevych, I., Mühlhäuser, M.: Analyzing and accessing Wikipedia as a lexical semantic resource. In: Rehm, G., Witt, A., Lemnitzer, L. (eds.) Data Structures for Linguistic Resources and Applications, pp. 197–205 (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nguyen, H.T., Cao, T.H. (2010). Exploring Wikipedia and Text Features for Named Entity Disambiguation. In: Nguyen, N.T., Le, M.T., ÅšwiÄ…tek, J. (eds) Intelligent Information and Database Systems. ACIIDS 2010. Lecture Notes in Computer Science(), vol 5991. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12101-2_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-12101-2_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-12100-5

  • Online ISBN: 978-3-642-12101-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics