Abstract
Recently, similar entity searching over knowledge graph (KG) has gained much attentions by researchers. However, in rich-semantic KGs with multi-typed entities and relations, also known as heterogeneous information network, relevant entity search is considered as a challenging task due to the ambiguity as well as complexity of user’s queries in realistic applications, such as QA chatbot and KG-based information retrieval. In this paper, we propose a novel approach, called W-KG2Vec which enables to automatically learn the semantic representations of entities in KG by applying the meta-path. The proposed W-KG2Vec is a meta-path-specific model which supports to evaluate both semantic relations as well as the text-based similarity between entities. The combination of text- and structure-based embedding mechanism of W-KG2Vec is promising to achieve better representations of entities in given KGs for handling complex user’s queries. To effectively learn the sequential textual representations of entities’ descriptions, we propose a combination of BERT pre-trained model with LTSM encoder, called BERT-Text2Vec. Then, the text-based similarity between entities is used to leverage our weighted meta-path-based random walk mechanism in W-KG2Vec model. Extensive experiences on real-world KGs (YAGO and Freebase) demonstrate the effectiveness of our proposed model against recent state-of-the-art KG embedding baselines.






















Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Bordes A, Usunier N, Garcia-Duran A, Weston J, Yakhnenko O (2013) Translating embeddings for modeling multi-relational data. Adv Neural Inf Process Syst 2:2787–2795
Bordes A, Weston J, Usunier N (2014) Open question answering with weakly supervised embedding models. In: Joint European conference on machine learning and knowledge discovery in databases
Bordes A, Glorot X, Weston J, Bengio Y (2012) Joint learning of words and meaning representations for open-text semantic parsing. Artif Intell Stat, pp 127–135
Bordes A, Chopra S, Weston J (2014) Question answering with subgraph embeddings. arXiv preprint https://arxiv.org/abs/1406.3676.
Bordes A, Weston J, Collobert R, Bengio Y (2011) Learning structured embeddings of knowledge bases. In: Twenty-fifth AAAI conference on artificial intelligence
Cao X, Shi C, Zheng Y, Ding J, Li X, Wu B (2018) A heterogeneous information network method for entity set expansion in knowledge graph. In Pacific-Asia conference on knowledge discovery and data mining
Conneau A, Kiela D, Schwenk H, Barrault L, Bordes A (2017) Supervised learning of universal sentence representations from natural language inference data. arXiv preprint https://arxiv.org/abs/1705.02364
Dettmers T, Minervini P, Stenetorp P, Riedel S (2017) Convolutional 2d knowledge graph embeddings. arXiv preprint https://arxiv.org/abs/1707.01476
Fang Y, Wang H, Zhao L, Yu F, Wang C (2020) Dynamic knowledge graph based fake-review detection. Appl Intell 50:4281–4295
Grover A, Leskovec J (2016) node2vec: Scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining
Han B, Chen L, Tian X (2018) Knowledge based collection selection for distributed information retrieval. Inf Process Manage 54(1):116–128
Han X, Liu Z, Sun M (2016) Joint representation learning of text and knowledge for knowledge graph completion. arXiv preprint https://arxiv.org/abs/1611.04125
He S, Liu K, Ji G, Zhao J (2015) Learning to represent knowledge graphs with Gaussian embedding. In: Proceedings of the 24th ACM international on conference on information and knowledge management
Hussein R, Yang D, Cudré-Mauroux P (2018) Are meta-paths necessary? revisiting heterogeneous graph embeddings. In: Proceedings of the 27th ACM international conference on information and knowledge management
Li B, Pi D (2020) Network representation learning: a systematic literature review. Neural Comput Appl 32:1–33
Lin X, Liang Y, Giunchiglia F, Feng X, Guan R (2019) Relation path embedding in knowledge graphs. Neural Comput Appl 31(9):5629–5639
Lin J, Zhao Y, Huang W, Liu C, Pu H (2020) Domain knowledge graph-based research progress of knowledge representation. Neural Comput Appl 33:1–10
Lin Y, Liu Z, Sun M, Liu Y, Zhu X (2015) Learning entity and relation embeddings for knowledge graph completion. In: Twenty-ninth AAAI conference on artificial intelligence
Liu W, Zhou P, Zhao Z, Wang Z, Ju Q, Deng H, Wang P (2020) K-BERT: enabling language representation with knowledge graph. In: AAAI
Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. arXiv preprint https://arxiv.org/abs/1301.3781
Nguyen DQ, Sirts K, Qu L, Johnson M (2016) Stranse: a novel embedding model of entities and relationships in knowledge bases. arXiv preprint https://arxiv.org/abs/1606.08140
Nickel M, Tresp V, Kriegel HP (2011) A three-way model for collective learning on multi-relational data. In ICML
Nie A, Bennett E, Goodman N (2019) DisSent: learning sentence representations from explicit discourse relations. In Proceedings of the 57th annual meeting of the association for computational linguistics
Pham PDP (2019) W-MetaPath2Vec: the topic-driven meta-path-based model for large-scaled content-based heterogeneous information network representation learning. Expert Syst Appl 123:328–344
Pham P, Do P (2020) W-Metagraph2Vec: a novel approval of enriched schematic topic-driven heterogeneous information network embedding. Int J Mach Learn Cybern 11:1–20
Pham P, Do P (2020) W-Com2Vec: A topic-driven meta-path-based intra-community embedding for content-based heterogeneous information network. Intell Data Anal 24(5):1207–1233
Pham P, Do P, Ta CD (2018) W-PathSim: novel approach of weighted similarity measure in content-based heterogeneous information networks by applying LDA topic modeling. In: Asian conference on intelligent information and database systems
Socher R, Chen D, Manning CD, Ng A (2013) Reasoning with neural tensor networks for knowledge base completion. In: Advances in neural information processing systems
Sun Z, Deng ZH, Nie JY, Tang J (2019) Rotate: knowledge graph embedding by relational rotation in complex space. arXiv preprint https://arxiv.org/abs/1902.10197
Sun Y, Han J, Yan X, Yu PS, Wu T (2011) Pathsim: meta path-based top-k similarity search in heterogeneous information networks. In: Proceedings of the VLDB endowment
Toutanova K, Chen D, Pantel P, Poon H, Choudhury P, Gamon M (2015) Toutanova, Representing text for joint embedding of text and knowledge bases. In: Proceedings of the 2015 conference on empirical methods in natural language processing
Wang H, Jiang S, Yu Z (2020) Modeling of complex internal logic for knowledge base completion. Appl Intell 50:3336–3349
Wang Q, Mao Z, Wang B, Guo L (2017) Knowledge graph embedding: A survey of approaches and applications. IEEE Trans Knowl Data Eng 29(12):2724–2743
Wang X, He X, Cao Y, Liu M, Chua TS (2019) Kgat: knowledge graph attention network for recommendation. In: Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining
Wang Z, Zhang J, Feng J, Chen Z (2014) Knowledge graph embedding by translating on hyperplanes. In: Twenty-Eighth AAAI conference on artificial intelligence
Wang Z, Zhang J, Feng J, Chen Z (2014) Knowledge graph and text jointly embedding. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP)
Wu Y, Pan J, Lu P, Lin K, Yu Z (2017) Knowledge graph embedding translation based on constraints. J Inf Hiding Multimedia Signal Process Ubiquitous Int 8(5):1119–1131
Xiao H, Huang M, Zhu X (2016) TransG: A generative model for knowledge graph embedding. In: Proceedings of the 54th annual meeting of the association for computational linguistics (Volume 1: Long Papers)
Xu J, Chen K, Qiu X, Huang X (2016) Knowledge graph representation with jointly structural and textual encoding. arXiv preprint https://arxiv.org/abs/1611.08661
Yao L, Mao C, Luo Y (2019) KG-BERT: BERT for knowledge graph completion. arXiv preprint https://arxiv.org/abs/1909.03193
Zhang D, Yuan B, Wang D, Liu R (2015) Joint semantic relevance learning with text data and graph knowledge. In: Proceedings of the 3rd workshop on continuous vector space models and their compositionality
Zhong H, Zhang J, Wang Z, Wan H, Chen Z (2015) Aligning knowledge and text embeddings by entity descriptions. In: Proceedings of the 2015 conference on empirical methods in natural language processing
Acknowledgements
This research is funded by Vietnam National University HoChiMinh City (VNU-HCM) under grant number DS2020-26-01.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Do, P., Pham, P. W-KG2Vec: a weighted text-enhanced meta-path-based knowledge graph embedding for similarity search. Neural Comput & Applic 33, 16533–16555 (2021). https://doi.org/10.1007/s00521-021-06252-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-021-06252-8