Abstract
Facing lots of name mentions appearing on the web, entity linking is essential for many information processing applications. To improve linking accuracy, the relations between entities are usually considered in the linking process. This kind of method is called collective entity linking and can obtain high-quality results. There are two kinds of information helpful to reveal the relations between entities, i.e., contextual information and structural information of entities. Most traditional collective entity linking methods consider them separately. In fact, these two kinds of information represent entities from specific and diverse views and can enhance each other, respectively. Besides, if we look into each view closely, it can be separated into sub-views that are more meaningful. For this reason, this article proposes a multi-view–based collective entity linking algorithm, which combines several views of entities into an objective function for entity linking. The importance of each view can be valued and the linking results can be obtained along with resolving this objective function. Experimental results demonstrate that our linking algorithm can acquire higher accuracy than many state-of-the-art entity linking methods. Besides, since we simplify the entity's structure and change the entity linking to a sub-matrix searching problem, our algorithm also obtains high efficiency.
- O. Benjelloun, H. Garcia-Molina, D. Menestrina, Q. Su, S. E. Whang, and J. Widom. 2009. Swoosh: A generic approach to entity resolution. Int. J. Very Large Data Bases 18, 1 (2009), 255--276. Google ScholarDigital Library
- R. Blanco, G. Ottaviano, and E. Meij. 2015. Fast and space-efficient entity linking in queries. In Proceedings of the 8th ACM International Conference on Web Search and Data Mining. ACM, Shanghai, China, 179--188. Google ScholarDigital Library
- D. Burdick, R. Fagin, P. G. Kolaitis, L. Popa, and W.-C. Tan. 2016. A declarative framework for linking entities. ACM Trans. Database Syst. 41, 3 (2016), 17. Google ScholarDigital Library
- D. Ceccarelli, C. Lucchese, S. Orlando, R. Perego, and S. Trani. 2013. Learning relatedness measures for entity linking. In Proceedings of the 22nd ACM International Conference on Information and Knowledge Management. ACM, New York, NY, 139--148. Google ScholarDigital Library
- X. Dong, A. Y. Halevy, and J. Madhavan. 2005. Reference reconciliation in complex information spaces. In Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data. ACM, New York, 85--96. Google ScholarDigital Library
- J. Daiber, M. Jakob, C. Hokamp, and P. N. Mendes. 2013. Improving efficiency and accuracy in multilingual entity extraction. In Proceedings of the 9th International Conference on Semantic Systems. ACM, New York, 121--124. Google ScholarDigital Library
- G. David, T. Manos, W. Wouter, M. Edgar, and R. D. Maarten. 2016. Dynamic collective entity representations for entity ranking. In Proceedings of the 9th ACM International Conference on Web Search and Data Mining. ACM, New York, 595--604. Google ScholarDigital Library
- M. Erwan, Y. Francois, and C. Olivier. 2008. Robust similarity measures for named entities matching. In Proceedings of the 22nd International Conference on Computational Linguistics. ACL, Manchester, UK, 593--600. Google ScholarDigital Library
- Z. Guo and D. Barbosa. 2014. Robust entity linking via random walks. In Proceedings of the 23rd ACM International Conference on Information and Knowledge Management. ACM, 499--508. Google ScholarDigital Library
- Z. Guo and D. Barbosa. 2014. Entity linking with a unified semantic representation. In Proceedings of the 23rd International Conference on World Wide Web. ACM, New York, 1305--1310. Google ScholarDigital Library
- D. Graus, M. Tsagkias, W. Weerkamp, E. Meij, and M. D. Rijke. 2016. Dynamic collective entity representations for entity ranking. In Proceedings of the 9th ACM International Conference on Web Search and Data Mining. ACM, New York, 595--604. Google ScholarDigital Library
- B. Hachey, W. Radford, and J. R. Curran. 2011. Graph-based named entity linking with Wikipedia. In Proceedings of the 12th International Conference on Web Information System Engineering. Springer, Berlin, Germany, 213--226. Google ScholarDigital Library
- X. P. Han, L. Sun, and J. Zhao. 2011. Collective entity linking in web text: A graph-based method. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, 765--774. Google ScholarDigital Library
- J. Hoffart, M. A. Yosef, I. Bordino, H. Fürstenau, M. Pinkal, M. Spaniol, B. Taneva, S. Thater, and G. Weikum. 2011. Robust disambiguation of named entities in text. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing. ACL, Edinburgh, Scotland, UK, 782--792. Google ScholarDigital Library
- J. Hoffart, S. Seufert, D. B. Nguyen, M. Theobald, and G. Weikum. 2012. KORE: Keyphrase overlap relatedness for entity disambiguation. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management. ACM, New York, 545--554. Google ScholarDigital Library
- B. Hachey, W. Radford, J. Nothman, M. Honnibal, and J. R. Curran. 2013. Evaluating entity linking with Wikipedia. Artificial Intelligence 194, 1 (2013), 130--150. Google ScholarDigital Library
- Z. Y. He, S. J. Liu, M. Li, M. Zhou, L. K. Zhang, and H. F. Wang. 2013. Learning entity representation for entity disambiguation. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. ACL, Stroudsburg, PA, 30--34.Google Scholar
- Z. Y. He, S. J. Liu, Y. Song, M. Li, M. Zhou, and H. F. Wang. 2013. Efficient collective entity linking with stacking. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. ACL, Stroudsburg, PA, 426--435.Google Scholar
- J. Hoffart, Y. Altun, and G. Weikum. 2014. Discovering emerging entities with ambiguous names. In Proceedings of the 23rd International Conference on World Wide Web. ACM, New York, 385--396. Google ScholarDigital Library
- H. Ji, J. Nothman, and B. Hachey. 2014. Overview of TAC-KBP 2014 entity discovery and linking tasks. In Proceedings of the Text Analysis Conference. NIST, Gaithersburg, MD, 1--15.Google Scholar
- S. Kulkarni, A. Singh, G. Ramakrishnan, and S. Chakrabarti. 2009. Collective annotation of Wikipedia entities in web text. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, 457--466. Google ScholarDigital Library
- Y. Li, D. McLean, Z. A. Bandar, J. D. O'Shea, and K. Crockett. 2006. Sentence similarity based on semantic nets and corpus statistics. IEEE Trans. Knowl. Data Eng. 18, 8 (2006), 1138--1150. Google ScholarDigital Library
- G. Luo, X. J. Huang, C. Y. Lin, and Z. Q. Nie. 2015. Joint entity recognition and disambiguation. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. ACL, 879--888.Google Scholar
- E. Meij, K. Balog, and D. Odijk. 2013. Entity linking and retrieval. In Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, 1127--1127. Google ScholarDigital Library
- A. Moro, A. Raganato, R. Navigli, and A. Raganato. 2014. Entity linking meets word sense disambiguation: A unified approach. Trans. Assoc. Comput. Ling. 2, 1 (2014), 231--244.Google ScholarCross Ref
- F. L. Matthew, D. Greg, and K. Dan. 2016. Capturing semantic similarity for entity linking with convolutional neural networks. In Proceedings of the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. ACL, San Diego, CA, 1256--1261.Google Scholar
- A.-C. N. Ngomo and S. Auer. 2011. LIMES: A time-efficient approach for large-scale link discovery on the web of data. In Proceedings of the 22nd International Joint Conference on Artificial Intelligence. AAAI, Palo Alto, CA. 2312--2317. Google ScholarDigital Library
- Z. Q. Nie, J. R. Wen, and W. Y. Ma. 2012. Statistical entity extraction from the web. Proceedings of the IEEE 100, 9 (2012), 2675--2687.Google ScholarCross Ref
- E. S. Ristad and P. N. Yianilos. 1998. Learning string-edit distance. IEEE Trans. Pattern Anal. Mach. Intell. 20, 5 (1998), 522--532. Google ScholarDigital Library
- B. Steffen and S. Tobias. 2004. Multi-view clustering. In Proceedings of the 4th IEEE International Conference on Data Mining. IEEE, 19--26.Google Scholar
- P. Sen. 2012. Collective context-aware topic models for entity disambiguation. In Proceedings of the 21st International Conference on World Wide Web. ACM, New York, 729--738. Google ScholarDigital Library
- W. Shen, J. W. Han, and J. Y. Wang. 2014. A probabilistic model for linking named entities in web text with heterogeneous information networks. In Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data. ACM, New York, 1199--1210. Google ScholarDigital Library
- Y. M. Sun, L. Lin, D. Y. Tang, N. Yang, Z. Z. Ji, and X. L. Wang. 2015. Modeling mention, context and entity with neural networks for entity disambiguation. In Proceedings of the 24th International Joint Conference on Artificial Intelligence. AAAI, 1333--1339. Google ScholarDigital Library
- W. Shen, J. Y. Wang, and J. W. Han. 2015. Entity linking with a knowledge base: Issues, techniques, and solutions. IEEE Trans. Knowl. Data Eng. 27, 2 (2015), 443--460.Google ScholarCross Ref
- R. Usbeck, A.-C. N. Ngomo, M. Röder, D. Gerber, S. A. Coelho, S. Auer, and A. Both. 2014. AGDISTIS-Agnostic disambiguation of named entities using linked open data. In Proceedings of the 21st European Conference on Artificial Intelligence. AAAI, 1113--1114. Google ScholarDigital Library
- J. Volz, C. Bizer, M. Gaedke, and G. Kobilarov. 2009. Discovering and maintaining links on the web of data. In Proceedings of the 8th International Semantic Web Conference. SWSA, Karlsruhe, Germany, 650--665. Google ScholarDigital Library
- Q. Wang, M. Y. Cui, and H. Z. Liang. 2016. Semantic-aware blocking for entity resolution. IEEE Trans. Knowl. Data Eng. 28, 1 (2016), 166--180. Google ScholarDigital Library
- Z. C. Zheng, F. T. Li, M. L. Huang, and X. Y. Zhu. 2010. Learning to link entities with knowledge base. In Proceedings of the 2010 Annual Conference of the North American Chapter of the ACL. ACL, Stroudsburg, PA, 483--491. Google ScholarDigital Library
- L. Zhang, Y. Dong, and A. Rettinger. 2015. Towards entity correctness, completeness and emergence for entity recognition. In Proceedings of the 24th International Conference on World Wide Web. ACM, New York, 143--144. Google ScholarDigital Library
- S. Zwicklbauer, C. Seifert, and M. Granitzer. 2016. Robust and collective entity disambiguation through semantic embeddings. In Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, 425--434. Google ScholarDigital Library
Index Terms
- A Multi-View–Based Collective Entity Linking Method
Recommendations
Collective entity linking in web text: a graph-based method
SIGIR '11: Proceedings of the 34th international ACM SIGIR conference on Research and development in Information RetrievalEntity Linking (EL) is the task of linking name mentions in Web text with their referent entities in a knowledge base. Traditional EL methods usually link name mentions in a document by assuming them to be independent. However, there is often additional ...
Entity Difference Modeling Based Entity Linking for Question Answering over Knowledge Graphs
Natural Language Processing and Chinese ComputingAbstractEntity linking plays a vital role in Question Answering over Knowledge Graphs (KGQA), and the representation of entities is a fundamental component of entity linking for user questions. In order to alleviate the problem of entity descriptions that ...
Entity linking and retrieval
SIGIR '13: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrievalThis full-day tutorial presents a comprehensive introduction to entity linking and retrieval. Part I provides a detailed overview of entity linking: identifying and disambiguating entity occurrences in unstructured text. Part II focuses on entity ...
Comments