ABSTRACT
We consider the problem of entity tagging: given one or more named entities from a specific domain, the goal is to automatically associate descriptive phrases, referred to as etags (entity tags), to each entity. Consider a product catalog containing product names and possibly short descriptions. For a product in the catalog, say Ricoh G600 Digital Camera, we want to associate etags such as "water resistant", "rugged" and "outdoor" to it, even though its name or description does not mention those phrases. Entity tagging can enable more effective search over entities. We propose to leverage signals in web documents to perform such tagging. We develop techniques to perform such tagging in a domain independent manner while ensuring high precision and high recall.
- S. Agrawal et al. Exploiting web search engines to search structured databases. In WWW, 2009. Google ScholarDigital Library
- R. Kumar and A. Tomkins. A characterization of online search behavior. Data Engineering Bulletin, 2009Google Scholar
Index Terms
- EntityTagger: automatically tagging entities with descriptive phrases
Recommendations
A Relation Proposal Network for End-to-End Information Extraction
Natural Language Processing and Chinese ComputingAbstractInformation extraction is an important task in natural language processing. In this paper, we introduce our solution on NLPCC 2019 shared task 3 Information Extraction which has provided with the largest industry Schema based Knowledge Extraction (...
Comparison of Methods to Annotate Named Entity Corpora
The authors compared two methods for annotating a corpus for the named entity (NE) recognition task using non-expert annotators: (i) revising the results of an existing NE recognizer and (ii) manually annotating the NEs completely. The annotation time, ...
Web personal name disambiguation based on reference entity tables mined from the web
WIDM '09: Proceedings of the eleventh international workshop on Web information and data managementAmbiguous personal names are common on the Web, which pose a challenge for many different tasks. The traditional disambiguation employs the clustering methods. However, without reference entity tables, the clustering method can only identify whether two ...
Comments