Abstract
Precisely identifying entities is essential for semantic annotation. This paper addresses the problem of named entity disambiguation that aims at mapping entity mentions in a text onto the right entities in Wikipedia. The aim of this paper is to explore and evaluate various combinations of features extracted from Wikipedia and texts for the disambiguation task, based on a statistical ranking model of candidate entities. Through experiments, we show which combinations of features are the best choices for disambiguation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bunescu, R., Paşca, M.: Using encyclopedic knowledge for named entity disambiguation. In: Proc. of the 11th Conference of EACL, pp. 9–16 (2006)
Bontcheva, K., et al.: Shallow methods for named entity coreference resolution. In: Proc. of TALN 2002 Workshop (2002)
Cucerzan, S.: Large-scale named entity disambiguation based on Wikipedia data. In: Proc. of EMNLP-CoNLL Joint Conference (2007)
Cohen, W., Ravikumar, P., Fienberg, S.: A comparison of string metrics for name-matching tasks. In: IJCAI-03 II-Web Workshop (2003)
Cunningham, H., et al.: GATE: A framework and graphical development environment for robust NLP tools and applications. In: Proc. of ACL 2002 (2002)
Gooi, C.H., Allan, J.: Cross-document coreference on a large-scale corpus. In: Proc. of HLT/NAACL 2004 (2004)
Mihalcea, R.: Using Wikipedia for automatic word sense disambiguation. In: Proc. of HLT/NAACL 2007 (2007)
Mihalcea, R., Csomai, A.: Wikify!: Linking documents to encyclopedic knowledge. In: Proc. of CIKM 2007, pp. 233–242 (2007)
Medelyan, O., et al.: Mining meaning from Wikipedia. International Journal of Human-Computer Studies 67(9), 716–754 (2009)
Medelyan, O., et al.: Topic indexing with Wikipedia. In: Proc. of WIKIAI 2008 (2008)
Milne, D., Witten, I.H.: Learning to link with Wikipedia. In: Proc. of CIKM 2008, pp. 509–518 (2008)
Overell, S., Rüger, S.: Using co-occurrence models for placename disambiguation. The IJGIS. Taylor and Francis, Abington (2008)
Nguyen, H.T., Cao, T.H.: A Knowledge-based approach to named entity disambiguation in news articles. In: Orgun, M.A., Thornton, J. (eds.) AI 2007. LNCS (LNAI), vol. 4830, pp. 619–624. Springer, Heidelberg (2007)
Chen, Y., Martin, J.: Towards robust unsupervised personal name disambiguation. In: Proc. of EMNLP-CoNLL Joint Conference (2007)
Zesch, T., Gurevych, I., Mühlhäuser, M.: Analyzing and accessing Wikipedia as a lexical semantic resource. In: Rehm, G., Witt, A., Lemnitzer, L. (eds.) Data Structures for Linguistic Resources and Applications, pp. 197–205 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nguyen, H.T., Cao, T.H. (2010). Exploring Wikipedia and Text Features for Named Entity Disambiguation. In: Nguyen, N.T., Le, M.T., ÅšwiÄ…tek, J. (eds) Intelligent Information and Database Systems. ACIIDS 2010. Lecture Notes in Computer Science(), vol 5991. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12101-2_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-12101-2_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12100-5
Online ISBN: 978-3-642-12101-2
eBook Packages: Computer ScienceComputer Science (R0)