Skip to main content
Log in

Multi-information embedding based entity alignment

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Entity alignment refers to discovering two entities in different knowledge bases that represent the same thing in reality. Existing methods generally only adopt TransE or TransE-like knowledge graph representation learning models, which usually assume that there are enough training triples for each entity, and entities appearing in few triples are easily misaligned. In this paper, we propose a multi-information embedding based entity alignment method (MEEA), which utilizes embedding methods based on multi-information, including triple embedding and neighbor information embedding, to obtain the vector representations of each entity, which are then used for aligning entities. In addition, we propose a weighted neighbor information encoding method to make the neighbor information based vector representation suitable for entity alignment, which measures the effects of different neighbors on entity alignment from three aspects (i.e., the mapping cardinality of the neighbor, the relation association of the neighbor, and the attention mechanism) and gives corresponding weights. Experiments are conducted on two cross-lingual knowledge bases, and the experimental results show that MEEA is able to yield a better performance compared to the state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Bhattacharya I, Getoor L (2007) Collective entity resolution in relational data. ACM Trans Knowl Discov Data 1(1):5–41

    Article  Google Scholar 

  2. Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J (2008) Freebase: a collaboratively created graph database for structuring human knowledge, in: Proceedings of the 27th ACM SIGMOD International Conference on Management of Data, pp. 1247–1250

  3. Bordes A, Usunier N, Garcia-Duran A, Weston J, Yakhnenko O (2013) Translating embeddings for modeling multi relational data, in: Proceedings of the 27th Conference on Neural Information Processing Systems, pp. 2787–2795

  4. Chen L, Gu W, Tian X, Chen G (2019) AHAB: aligning heterogeneous knowledge bases via iterative blocking. Inf Process Manag 56(1):1–13

    Article  Google Scholar 

  5. Chen M, Tian Y, Yang M, Zaniolo C (2017) Multilingual knowledge graph embeddings for cross-lingual knowledge alignment, in: Proceedings of the 26th International Joint Conference on Artificial Intelligence, pp. 1511–1517

  6. Cochinwala M, Kurien V, Lalk G, Shasha D (2001) Efficient data reconciliation. Inf Sci 137(1):1–15

    Article  Google Scholar 

  7. Elfeky MG, Verykios VS, Elmagarmid AK (2002) TAILOR: a record linkage tool box, in: Proceedings of the 18th IEEE International Conference on Data Engineering, pp. 17–28

  8. El-Roby A, Aboulnaga A (2015) ALEX: automatic link exploration in linked data, in: proceedings of the 34th ACM SIGMOD international conference on Management of Data, pp. 1839-1853

  9. Han B, Chen L, Tian X (2018) Knowledge based collection selection for distributed information retrieval. Inf Process Manag 54(1):116–128

    Article  Google Scholar 

  10. Huang C, Zhu J, Huang X, Yang M, Fung G, Hu Q (2018) A novel approach for entity resolution in scientific documents using context graphs. Inf Sci 432(5):431–441

    Article  MathSciNet  Google Scholar 

  11. Jiang Y, Wang X, Zheng H (2014) A semantic similarity measure based on information distance for ontology alignment. Inf Sci 278(10):76–87

    Article  Google Scholar 

  12. Kejriwal M, Miranker DP (2015) Semi-supervised instance matching using boosted classifiers, in: Proceedings of the 12th Extended Semantic Web Conference, pp. 388–402

  13. Lehmann J, Isele R, Jakob M, Jentzsch A, Kontokostas D, Mendes PN, Hellmann S, Morsey M, Kleef P, Auer S, Bizer C (2015) DBpedia: a large-scale, multilingual knowledge base extracted from Wikipedia. Semantic Web 6(2):167–195

    Article  Google Scholar 

  14. Lacoste-Julien S, Palla K, Davies A, Kasneci G, Graepel T, Ghahramani Z (2013) Sigma: simple greedy matching for aligning large knowledge bases, in: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 572–580

  15. Lin Y, Liu Z, Luan H, Sun M, Rao S, Liu S (2013) Modeling relation paths for representation learning of knowledge bases, in: Proceedings of the 20th Conference on Empirical Methods in Natural Language Processing, pp. 705–714

  16. Lin Y, Liu Z, Sun M, Liu Y, Zhu X (2015) Learning entity and relation embeddings for knowledge graph completion, in: Proceedings of the 29th AAAI Conference on Artificial Intelligence, pp. 2181–2187

  17. Lin Y, Shen S, Liu Z, Luan H, Sun M (2016) Neural relation extraction with selective attention over instances, in: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 2124–2133

  18. Nezhadi AH, Shadgar B, Osareh A (2011) Ontology alignment using machine learning techniques. Int J Comput Sci Inform Technol 3(2):139–150

    Google Scholar 

  19. Nickel M, Rosasco L, Poggio T (2016) Holographic embeddings of knowledge graphs. in: Proceedings of the 30th AAAI Conference on Artificial Intelligence, pp. 1955-1961

  20. Niu X, Rong S, Wang H, Yu Y (2012) An effective rule miner for instance matching in a web of data, in: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 1085–1094

  21. Porter EH, Winkler WE (1997) Approximate string comparison and its effect on an advanced record linkage system. U.S. Bureau of the Census, Technical Report pp. 190-199

    Google Scholar 

  22. Papadakis G, Alexiou G, Papastefanatos G, Koutrika G (2015) Schema-agnostic vs Schema-based configurations for blocking methods on homogeneous data, in: Proceedings of the 41st VLDB Endowment, pp. 312–323

  23. Pujara J, Augustine E, Getoor L (2017) Sparsity and noise: where knowledge graph embeddings fall short, in: proceedings of the 22th conference on empirical methods in natural language processing, pp. 1751-1756

  24. Raimond Y, Sutton C, Sandler MB (2008) Automatic interlinking of music datasets on the semantic web, in: Proceedings of the 17th WWW Workshop on Linked Data on the Web, pp. 1–8

  25. Suchanek FM, Kasneci G, Weikum G (2007) Yago: a core of semantic knowledge, in: Proceedings of the 16th International Conference on World Wide Web, pp. 697–706

  26. Song D, Heflin J (2011) Automatically generating data linkages using a domain-independent candidate selection approach, in: Proceedings of the 10th International Semantic Web Conference, pp. 649–664

  27. Sun Z, Hu W, Li C (2017) Cross-lingual entity alignment via joint attribute-preserving embedding, in: Proceedings of the 16th International Semantic Web Conference, pp. 628–644

  28. Sun Z, Hu W, Zhang Q, Qu Y (2018) Bootstrapping entity alignment with knowledge graph embedding, in: Proceedings of the 27th International Joint Conference on Artificial Intelligence, pp. 4396–4402

  29. Schlichtkrull M, Kipf TN, Bloem P, Berg R, Titov I, Welling M (2018) Modeling relational data with graph convolutional networks, in: Proceedings of the 15th Extended Semantic Web Conference, pp. 1–9

  30. Trsedya BD, Qi J, Zhang R (2019) Entity alignment between knowledge graphs using attribute embeddings, in: Proceedings of the 33rd AAAI Conference on Artificial Intelligence, pp. 1–8

  31. Théo T, Johannes W, Sebastian R, Éric G, Guillaume B (2016) Complex embeddings for simple link prediction. in: Proceedings of the 33th International Conference on Machine Learning, pp. 2071-2080

  32. Tang X, Chen L, Cui J, Wei B (2019) Knowledge representation learning with entity descriptions, hierarchical types, and textual relations. Inf Process Manag 56(3):809–822

    Article  Google Scholar 

  33. Wang Z, Lv Q, Lan X, Zhang Y (2018) Cross-lingual knowledge graph alignment via graph convolutional networks, in: Proceedings of the 23th Conference on Empirical Methods in Natural Language Processing, pp. 349–357

  34. Romadhony A, Widyantoro D, Purwarianti A (2019) Utilizing structured knowledge bases in open IE based event template extraction. Appl Intell 49:206–219

    Article  Google Scholar 

  35. Winkler WE, Thibaudeau Y (1991) An application of the Fellegi-Sunter model of record linkage to the 1990 U.S. decennial census, Technical Report, U.S. Bureau of the Census, pp. 1–22

  36. Wang J, Kraska T, Franklin MJ, Feng J (2012) Crowder: crowdsourcing entity resolution, in: Proceedings of the 38th VLDB Endowment, pp. 1483–1494

  37. Wang Z, Zhang J, Feng J, Chen Z (2014) Knowledge graph embedding by translating on hyperplanes, in: Proceedings of the 28th AAAI Conference on Artificial Intelligence, 2014, pp. 1112–1119

  38. Xie R, Liu Z, Sun M (2017) Representation learning of knowledge graphs with hierarchical types, in: Proceedings of the 25th International Joint Conference on Artificial Intelligence, pp. 2965–2971

  39. Xiong W, Yu M, Chang S, Guo X, Wang WY (2018) One-shot relational learning for knowledge graphs, in: Proceedings of the 23th Conference on Empirical Methods in Natural Language Processing, pp. 1980-1990

  40. Zheng W, Cheng H, Yu J, Zou L, Zhao K (2019) Interactive natural language question answering over knowledge graphs. Inf Sci 481(1):141–159

    Article  MathSciNet  Google Scholar 

  41. Zhuang Y, Li G, Feng J (2016) A survey on entity alignment of knowledge base. J Comput Res Develop 53(1):165–192

    Google Scholar 

  42. Zhu H, Xie R, Liu Z, Sun M (2017) Iterative entity alignment via joint knowledge embeddings, in: Proceedings of the 26th International Joint Conference on Artificial Intelligence, pp. 4258–4264

  43. Lin L, Liu J, Lv Y, Guo F (2020) A similarity model based on reinforcement local maximum connected same destination structure oriented to disordered fusion of knowledge graphs. Appl Intell 50:2867–2886

    Article  Google Scholar 

  44. Zhang Z, Chen J, Chen X, Liu H, Xiang Y, Liu B, Zheng Y (2020) An industry evaluation of embedding-based entity alignment, in: Proceedings of the 28th International Conference on Computational Linguistics: Industry Track, pp. 179–189

Download references

Acknowledgements

This work was funded by the National Key Research and Development Program of China (No. 2018YFB0505000) and the Fundamental Research Funds for the Central Universities (No.2020QNA5017).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ling Chen.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, L., Tian, X., Tang, X. et al. Multi-information embedding based entity alignment. Appl Intell 51, 8896–8912 (2021). https://doi.org/10.1007/s10489-021-02400-8

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-021-02400-8

Keywords

Navigation