ABSTRACT
Named Entity Disambiguation is the task of disambiguating named entity mentions in unstructured text and linking them to their corresponding entries in a large knowledge base such as Freebase. Practically, each text match in a given document should be mapped to the correct entity out of the corresponding entities in the knowledge base or none of them if no correct entity is found (Empty Entry). The case of an empty entry makes the problem at hand more complex, but by solving it, one can successfully cope with missing and erroneous data as well as unknown entities. In this work we present AOL's Named Entity Resolver which was designed to handle real life scenarios including empty entries. As part of the automated news analysis platform, it processes over 500K news articles a day, entities from each article are extracted and disambiguated. According to our experiments, AOL's resolver shows much better results in disambiguating entities mapped to Wikipedia or Freebase compared to industry leading products.
- Cucerzan, S., 2007, June. Large-Scale Named Entity Disambiguation Based on Wikipedia Data. In EMNLP-CoNLL (Vol. 7, pp. 708--716).Google Scholar
- Dredze, M., McNamee, P., Rao, D., Gerber, A. and Finin, T., 2010, August. Entity disambiguation for knowledge base population. In Proceedings of the 23rd International Conference on Computational Linguistics (pp. 277--285). Association for Computational Linguistics. Google ScholarDigital Library
- Tarjan, R., 1972. Depth-first search and linear graph algorithms. SIAM journal on computing, 1(2), pp.146--160.Google Scholar
- Zheng, Z., Si, X., Li, F., Chang, E.Y. and Zhu, X., 2012, December. Entity disambiguation with freebase. In Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology-Volume 01 (pp. 82--89). IEEE Computer Society. Google ScholarDigital Library
Index Terms
- AOL's Named Entity Resolver: Solving Disambiguation via Document Strongly Connected Components and Ad-Hoc Edges Construction
Recommendations
From names to entities using thematic context distance
CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge managementName ambiguity arises from the polysemy of names and causes uncertainty about the true identity of entities referenced in unstructured text. This is a major problem in areas like information retrieval or knowledge management, for example when searching ...
Named entity recognition and disambiguation using linked data and graph-based centrality scoring
SWIM '12: Proceedings of the 4th International Workshop on Semantic Web Information ManagementNamed Entity Recognition (NER) is a subtask of information extraction and aims to identify atomic entities in text that fall into predefined categories such as person, location, organization, etc. Recent efforts in NER try to extract entities and link ...
Named entity recognition and resolution in legal text
Semantic Processing of Legal TextsNamed entities in text are persons, places, companies, etc. that are explicitly mentioned in text using proper nouns. The process of finding named entities in a text and classifying them to a semantic type, is called named entity recognition. Resolution ...
Comments