ABSTRACT
An entity on the web can be referred by numerous morphs that are always ambiguous, implicit and informal, which makes it challenging to accurately identify all the morphs corresponding to a specific entity. In this paper, we introduce a novel method based on knowledge graph, which takes advantage of both knowledge reasoning and statistic learning. First, we present a model to build a knowledge graph for the given entity. The knowledge graph integrates the fragmented knowledge on how humans create morphs. Then, the candidate morphs are generated based on the rules summarized from the knowledge graph. At last, we use a classification method to filter the useless candidates and identify the target morphs. The experiments conducted on real world dataset demonstrate efficiency of our proposed method in terms of precision and recall.
- H. Huang, Z. Wen, D. Yu, H. Ji, Y. Sun, J. Han, and H. Li. 2013. Resolving Entity Morphs in Censored Data. Meeting of the Association for Computational Linguistics. (Aug. 2013). 1083--1093.Google Scholar
- L. Chen, C. Zhang, and C. Wilson. 2013. Tweeting under pressure: analyzing trending topics and evolving word choice on sina weibo. ACM Conference on Online Social Networks. (Oct. 2013). 89--100. Google ScholarDigital Library
- B. Zhang, H. Huang, X. Pan, S. Li, C. Y. Lin, H. Ji, K. Knight, Z. Wen, Y. Sun, and J. Han. 2015. Context-aware Entity Morph Decoding. Meeting of the Association for Computational Linguistics. (Aug. 2015). 586--595.Google ScholarCross Ref
- D. Bollegala, Y. Matsuo, and M. Ishizuka. 2011. Automatic Discovery of Personal Name Aliases from the Web. IEEE Transactions on Knowledge & Data Engineering, 2011, 23(6): 831--844. Google ScholarDigital Library
- D. Bollegala, T. Honma, Y. Matsuo, and M. Ishizuka. 2008. Mining for personal name aliases on the web. International Conference on World Wide Web. (April 2008). 1107--1108. Google ScholarDigital Library
- B. Zhang, H. Huang, X. Pan, H. Ji, K. Knight, Z. Wen, Y. Sun, J. Han, and B. Yener. 2014. Be Appropriate and Funny: Automatic Entity Morph Encoding. Meeting of the Association for Computational Linguistics. (Aug. 2014). 706--711.Google Scholar
- K. S. Dave, and V. Varma. 2010. Pattern based keyword extraction for contextual advertising. ACM International Conference on Information and Knowledge Management. (Oct. 2010). 1885--1888. Google ScholarDigital Library
- C. Fellbaum, and G. Miller. 1998. WordNet:An Electronic Lexical Database. MIT Press. 1998.Google Scholar
- Z. Dong, Q. Dong, and C. Hao. 2010. HowNet and its computation of meaning. International Conference on Computational Linguistics: Demonstrations. (Aug. 2010). 53--56. Google ScholarDigital Library
- Hou, L., Li, J., Wang, Z., Tang, J., Zhang, P., and Yang, R. (2015). Newsminer: multifaceted news analysis for event search. Knowledge-Based Systems, 2015, 76: 17--29. Google ScholarDigital Library
Index Terms
- KIEM: A Knowledge Graph based Method to Identify Entity Morphs
Recommendations
AUTOMATIC ANNOTATION OF AMBIGUOUS PERSONAL NAMES ON THE WEB
Personal name disambiguation is an important task in social network extraction, evaluation and integration of ontologies, information retrieval, cross-document coreference resolution and word sense disambiguation. We propose an unsupervised method to ...
MeKG: Building a Medical Knowledge Graph by Data Mining from MEDLINE
Brain InformaticsAbstractMining data on a knowledge level can help to achieve a higher performance of a decision support system. This study built a knowledge graph based on MEDLINE that has a large number of articles in the medical domain. MEDLINE uses Medical Subject ...
Knowledge graph for TCM health preservation
We construct a knowledge graph for TCM Health Preservation, which integrates terms, databases and other resources.This knowledge graph facilitates knowledge services such as visualization, knowledge retrieval, and recommendation.This knowledge graph ...
Comments