Abstract
A knowledge graph is a structured knowledge system which contains a huge amount of entities and relations. It plays an important role in the field of named entity query. DBpedia, YAGO and other English knowledge graphs provide open access to huge amounts of high-quality named entities. However, Chinese knowledge graphs are still in the development stage, and contain fewer entities. The relations between entities are not rich. A natural question is: how to use mature English knowledge graphs to query Chinese named entities, and to obtain rich relation networks. In this paper, we propose a Chinese entity query system based on English knowledge graphs. For entities we build up links between Chinese entities and English knowledge graphs. The basic idea is to build a cross-lingual entity linking model, RSVM, between Chinese and English Wikipedia. RSVM is used to build cross-lingual links between Chinese entities and English knowledge graphs. The experiments show that our approach can achieve a high precision of 82.3 % for the task of finding cross-lingual entities on a test dataset. Our experiments for the sub task of finding missing cross-lingual links show that our approach has a precision of 89.42 % with a recall of 80.47 %.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Adafre, S.F., de Rijke, M.: Finding similar sentences across multiple languages in wikipedia. In: Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics, ECAL 2006, 3 April - 7 April 2006, Trento, Italy, pp. 62–69 (2006)
Albitar, S., Fournier, S., Espinasse, B.: An effective TF/IDF-based text-to-text semantic similarity measure for text classification. In: Benatallah, B., Bestavros, A., Manolopoulos, Y., Vakali, A., Zhang, Y. (eds.) WISE 2014, Part I. LNCS, vol. 8786, pp. 105–114. Springer, Heidelberg (2014)
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: DBpedia: a nucleus for a Web of open data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)
Bagga, A., Baldwin, B.: Entity-based cross-document coreferencing using the vector space model. In: Proceedings of the Conference on 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, COLING-ACL 1998, 10–14 August, 1998, Université de Montréal, Montréal, pp. 79–85. Quebec, Canada (1998)
Bunescu, R.C., Pasca, M.: Using encyclopedic knowledge for named entity disambiguation. In: Proceedings on 11th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2006, 3–7 April, 2006, Trento, Italy (2006)
Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and Other Kernel-based Learning Methods. Cambridge University Press, New York (2010)
Jiang, L., Wang, J., An, N., Wang, S., Zhan, J., Li, L.: GRAPE: a graph-based framework for disambiguating people appearances in web search. In: ICDM 2009, The Ninth IEEE International Conference on Data Mining, Miami, Florida, USA, 6–9 December 2009, pp. 199–208 (2009)
Joachims, T.: Optimizing search engines using clickthrough data. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 23–26 July, 2002, Edmonton, Alberta, Canada, pp. 133–142 (2002)
Mahdisoltani, F., Biega, J., Suchanek, F.M.: YAGO3: a knowledge base from multilingual wikipedias. In: Seventh Biennial Conference on Innovative Data Systems Research, CIDR 2015, Asilomar, CA, USA, January 4–7, 2015, Online Proceedings (2015)
Miller, G.A.: Wordnet: a lexical database for english. Commun. ACM 38(11), 39–41 (1995)
Shen, W., Wang, J., Luo, P., Wang, M.: LINDEN: linking named entities with knowledge base via semantic knowledge. In: Proceedings of the 21st World Wide Web Conference 2012, WWW 2012, Lyon, France, 16–20 April, 2012, pp. 449–458 (2012)
Sorg, P., Cimiano, P.: Enriching the crosslingual link structure of wikipedia - a classification-based approach. In: Proceedings of the Aaai Workshop on Wikipedia and Artifical Intelligence (2008)
Su, Y., Zhang, C., Cheng, W., Qian, W.: Cleqs: a cross-lingual entity query system based on knowledge graphs. In: NDBC 2015, Chengdu, China (2015)
Wang, C., Gao, M., He, X., Zhang, R.: Challenges in chinese knowledge graph construction. In: 31st IEEE International Conference on Data Engineering Workshops, ICDE Workshops 2015, Seoul, South Korea, 13–17 April, 2015, pp. 59–61 (2015)
Wang, Z., Li, J., Wang, Z., Tang, J.: Cross-lingual knowledge linking across wiki knowledge bases. In: Proceedings of the 21st World Wide Web Conference 2012, WWW 2012, Lyon, France, 16–20 April, 2012, pp. 459–468 (2012)
Wentland, W., Knopp, J., Silberer, C., Hartung, M.: Building a multilingual lexical resource for named entity disambiguation, translation and transliteration. In: Proceedings of the International Conference on Language Resources and Evaluation, LREC 2008, 26 May - 1 June 2008, Marrakech, Morocco (2008)
Witten, I.H., Milne, D.N.: An effective, low-cost measure of semantic relatedness obtained from wikipedia links. Proceedings of Aaai (2008)
Wu, W., Li, H., Wang, H., Zhu, K.Q.: Probase: a probabilistic taxonomy for text understanding. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2012, Scottsdale, AZ, USA, 20–24 May, 2012, pp. 481–492 (2012)
Acknowledgement
This work is supported by National Science Foundation of China under grant No. 61170086. The authors would also like to thank Ping An Technology (Shenzhen) Co., Ltd. for the support of this research.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Su, Y., Zhang, C., Li, J., Wang, C., Qian, W., Zhou, A. (2015). Cross-Lingual Entity Query from Large-Scale Knowledge Graphs. In: Cai, R., Chen, K., Hong, L., Yang, X., Zhang, R., Zou, L. (eds) Web Technologies and Applications. APWeb 2015. Lecture Notes in Computer Science(), vol 9461. Springer, Cham. https://doi.org/10.1007/978-3-319-28121-6_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-28121-6_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28120-9
Online ISBN: 978-3-319-28121-6
eBook Packages: Computer ScienceComputer Science (R0)