Abstract
Entity alignment refers to discovering two entities in different knowledge bases that represent the same thing in reality. Existing methods generally only adopt TransE or TransE-like knowledge graph representation learning models, which usually assume that there are enough training triples for each entity, and entities appearing in few triples are easily misaligned. In this paper, we propose a multi-information embedding based entity alignment method (MEEA), which utilizes embedding methods based on multi-information, including triple embedding and neighbor information embedding, to obtain the vector representations of each entity, which are then used for aligning entities. In addition, we propose a weighted neighbor information encoding method to make the neighbor information based vector representation suitable for entity alignment, which measures the effects of different neighbors on entity alignment from three aspects (i.e., the mapping cardinality of the neighbor, the relation association of the neighbor, and the attention mechanism) and gives corresponding weights. Experiments are conducted on two cross-lingual knowledge bases, and the experimental results show that MEEA is able to yield a better performance compared to the state-of-the-art methods.
Similar content being viewed by others
References
Bhattacharya I, Getoor L (2007) Collective entity resolution in relational data. ACM Trans Knowl Discov Data 1(1):5–41
Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J (2008) Freebase: a collaboratively created graph database for structuring human knowledge, in: Proceedings of the 27th ACM SIGMOD International Conference on Management of Data, pp. 1247–1250
Bordes A, Usunier N, Garcia-Duran A, Weston J, Yakhnenko O (2013) Translating embeddings for modeling multi relational data, in: Proceedings of the 27th Conference on Neural Information Processing Systems, pp. 2787–2795
Chen L, Gu W, Tian X, Chen G (2019) AHAB: aligning heterogeneous knowledge bases via iterative blocking. Inf Process Manag 56(1):1–13
Chen M, Tian Y, Yang M, Zaniolo C (2017) Multilingual knowledge graph embeddings for cross-lingual knowledge alignment, in: Proceedings of the 26th International Joint Conference on Artificial Intelligence, pp. 1511–1517
Cochinwala M, Kurien V, Lalk G, Shasha D (2001) Efficient data reconciliation. Inf Sci 137(1):1–15
Elfeky MG, Verykios VS, Elmagarmid AK (2002) TAILOR: a record linkage tool box, in: Proceedings of the 18th IEEE International Conference on Data Engineering, pp. 17–28
El-Roby A, Aboulnaga A (2015) ALEX: automatic link exploration in linked data, in: proceedings of the 34th ACM SIGMOD international conference on Management of Data, pp. 1839-1853
Han B, Chen L, Tian X (2018) Knowledge based collection selection for distributed information retrieval. Inf Process Manag 54(1):116–128
Huang C, Zhu J, Huang X, Yang M, Fung G, Hu Q (2018) A novel approach for entity resolution in scientific documents using context graphs. Inf Sci 432(5):431–441
Jiang Y, Wang X, Zheng H (2014) A semantic similarity measure based on information distance for ontology alignment. Inf Sci 278(10):76–87
Kejriwal M, Miranker DP (2015) Semi-supervised instance matching using boosted classifiers, in: Proceedings of the 12th Extended Semantic Web Conference, pp. 388–402
Lehmann J, Isele R, Jakob M, Jentzsch A, Kontokostas D, Mendes PN, Hellmann S, Morsey M, Kleef P, Auer S, Bizer C (2015) DBpedia: a large-scale, multilingual knowledge base extracted from Wikipedia. Semantic Web 6(2):167–195
Lacoste-Julien S, Palla K, Davies A, Kasneci G, Graepel T, Ghahramani Z (2013) Sigma: simple greedy matching for aligning large knowledge bases, in: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 572–580
Lin Y, Liu Z, Luan H, Sun M, Rao S, Liu S (2013) Modeling relation paths for representation learning of knowledge bases, in: Proceedings of the 20th Conference on Empirical Methods in Natural Language Processing, pp. 705–714
Lin Y, Liu Z, Sun M, Liu Y, Zhu X (2015) Learning entity and relation embeddings for knowledge graph completion, in: Proceedings of the 29th AAAI Conference on Artificial Intelligence, pp. 2181–2187
Lin Y, Shen S, Liu Z, Luan H, Sun M (2016) Neural relation extraction with selective attention over instances, in: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 2124–2133
Nezhadi AH, Shadgar B, Osareh A (2011) Ontology alignment using machine learning techniques. Int J Comput Sci Inform Technol 3(2):139–150
Nickel M, Rosasco L, Poggio T (2016) Holographic embeddings of knowledge graphs. in: Proceedings of the 30th AAAI Conference on Artificial Intelligence, pp. 1955-1961
Niu X, Rong S, Wang H, Yu Y (2012) An effective rule miner for instance matching in a web of data, in: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 1085–1094
Porter EH, Winkler WE (1997) Approximate string comparison and its effect on an advanced record linkage system. U.S. Bureau of the Census, Technical Report pp. 190-199
Papadakis G, Alexiou G, Papastefanatos G, Koutrika G (2015) Schema-agnostic vs Schema-based configurations for blocking methods on homogeneous data, in: Proceedings of the 41st VLDB Endowment, pp. 312–323
Pujara J, Augustine E, Getoor L (2017) Sparsity and noise: where knowledge graph embeddings fall short, in: proceedings of the 22th conference on empirical methods in natural language processing, pp. 1751-1756
Raimond Y, Sutton C, Sandler MB (2008) Automatic interlinking of music datasets on the semantic web, in: Proceedings of the 17th WWW Workshop on Linked Data on the Web, pp. 1–8
Suchanek FM, Kasneci G, Weikum G (2007) Yago: a core of semantic knowledge, in: Proceedings of the 16th International Conference on World Wide Web, pp. 697–706
Song D, Heflin J (2011) Automatically generating data linkages using a domain-independent candidate selection approach, in: Proceedings of the 10th International Semantic Web Conference, pp. 649–664
Sun Z, Hu W, Li C (2017) Cross-lingual entity alignment via joint attribute-preserving embedding, in: Proceedings of the 16th International Semantic Web Conference, pp. 628–644
Sun Z, Hu W, Zhang Q, Qu Y (2018) Bootstrapping entity alignment with knowledge graph embedding, in: Proceedings of the 27th International Joint Conference on Artificial Intelligence, pp. 4396–4402
Schlichtkrull M, Kipf TN, Bloem P, Berg R, Titov I, Welling M (2018) Modeling relational data with graph convolutional networks, in: Proceedings of the 15th Extended Semantic Web Conference, pp. 1–9
Trsedya BD, Qi J, Zhang R (2019) Entity alignment between knowledge graphs using attribute embeddings, in: Proceedings of the 33rd AAAI Conference on Artificial Intelligence, pp. 1–8
Théo T, Johannes W, Sebastian R, Éric G, Guillaume B (2016) Complex embeddings for simple link prediction. in: Proceedings of the 33th International Conference on Machine Learning, pp. 2071-2080
Tang X, Chen L, Cui J, Wei B (2019) Knowledge representation learning with entity descriptions, hierarchical types, and textual relations. Inf Process Manag 56(3):809–822
Wang Z, Lv Q, Lan X, Zhang Y (2018) Cross-lingual knowledge graph alignment via graph convolutional networks, in: Proceedings of the 23th Conference on Empirical Methods in Natural Language Processing, pp. 349–357
Romadhony A, Widyantoro D, Purwarianti A (2019) Utilizing structured knowledge bases in open IE based event template extraction. Appl Intell 49:206–219
Winkler WE, Thibaudeau Y (1991) An application of the Fellegi-Sunter model of record linkage to the 1990 U.S. decennial census, Technical Report, U.S. Bureau of the Census, pp. 1–22
Wang J, Kraska T, Franklin MJ, Feng J (2012) Crowder: crowdsourcing entity resolution, in: Proceedings of the 38th VLDB Endowment, pp. 1483–1494
Wang Z, Zhang J, Feng J, Chen Z (2014) Knowledge graph embedding by translating on hyperplanes, in: Proceedings of the 28th AAAI Conference on Artificial Intelligence, 2014, pp. 1112–1119
Xie R, Liu Z, Sun M (2017) Representation learning of knowledge graphs with hierarchical types, in: Proceedings of the 25th International Joint Conference on Artificial Intelligence, pp. 2965–2971
Xiong W, Yu M, Chang S, Guo X, Wang WY (2018) One-shot relational learning for knowledge graphs, in: Proceedings of the 23th Conference on Empirical Methods in Natural Language Processing, pp. 1980-1990
Zheng W, Cheng H, Yu J, Zou L, Zhao K (2019) Interactive natural language question answering over knowledge graphs. Inf Sci 481(1):141–159
Zhuang Y, Li G, Feng J (2016) A survey on entity alignment of knowledge base. J Comput Res Develop 53(1):165–192
Zhu H, Xie R, Liu Z, Sun M (2017) Iterative entity alignment via joint knowledge embeddings, in: Proceedings of the 26th International Joint Conference on Artificial Intelligence, pp. 4258–4264
Lin L, Liu J, Lv Y, Guo F (2020) A similarity model based on reinforcement local maximum connected same destination structure oriented to disordered fusion of knowledge graphs. Appl Intell 50:2867–2886
Zhang Z, Chen J, Chen X, Liu H, Xiang Y, Liu B, Zheng Y (2020) An industry evaluation of embedding-based entity alignment, in: Proceedings of the 28th International Conference on Computational Linguistics: Industry Track, pp. 179–189
Acknowledgements
This work was funded by the National Key Research and Development Program of China (No. 2018YFB0505000) and the Fundamental Research Funds for the Central Universities (No.2020QNA5017).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Chen, L., Tian, X., Tang, X. et al. Multi-information embedding based entity alignment. Appl Intell 51, 8896–8912 (2021). https://doi.org/10.1007/s10489-021-02400-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-021-02400-8