skip to main content
10.1145/3587828.3587830acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicscaConference Proceedingsconference-collections
research-article

Improving the Efficiency of Link Prediction on Handling Incomplete Knowledge Graph Using Clustering

Published: 20 June 2023 Publication History

Abstract

A knowledge graph (KG) is used to store knowledge in the form of connected facts. Facts in KG are represented in the form of a triple (subject, predicate, object) or (head, relation, tail). KG is widely used in question answering, information retrieval, classification, recommender systems, and so on. However, a common problem with KG is incomplete KG. A KG is called incomplete if there is a missing relationship between two entities. An incomplete KG can have an impact on decreasing the accuracy of a task that uses the KG. One solution to the incomplete KG is to use link prediction. Link prediction aims to predict the missing relationship between two entities in a KG. Another problem is that the size of KG is large, consisting of hundreds or millions of entities and relationships. Handling large KG also needs to be considered. Therefore, link prediction on large KG also needs to be considered so that the link prediction process is more efficient. This paper discusses link prediction using embedding to overcome the incomplete KG problem. In addition, it is proposed to use clustering to increase the efficiency of the link prediction process. Clustering is used to group the embedding results. After the embedding results are grouped, scoring and loss function calculations to predict missing links are carried out in groups that are considered appropriate. It is expected that with this grouping, the time of link prediction process can be more efficient because there is no need to check all the vectors in the embedding space.

References

[1]
M. Wang, L. Qiu, dan X. Wang, “A Survey on Knowledge Graph Embeddings for Link Prediction,” Symmetry, 2021.
[2]
X. Huang, J. Zhang, D. Li, dan P. Li, “Knowledge Graph Embedding Based Question Answering,” Twelfth ACM International Conference on Web Search and Data Mining (WSDM, no. Ccl, hlm. 105–113, 2019.
[3]
S. Srivastava, M. Patidar, S. Chowdhury, P. Agarwal, I. Bhattacharya, dan G. Shroff, “Complex Question Answering on knowledge graphs using machine translation and multi-task learning,” Proceedings ofthe 16th Conference ofthe European Chapter ofthe Association for Computational Linguistics, no. 2014, hlm. 3428–3439, 2021.
[4]
L. Dietsz, A. Kotov, dan E. Meij, “Utilizing Knowledge Graphs for Text-Centric Information Retrieval,” 41st International ACM SIGIR Conference on Research and Development in Information Retrieval, 2018.
[5]
S. Zhou dkk., “Interactive Recommender System via Knowledge Graph-enhanced Reinforcement Learning,” Proceedings of the 43rd International ACMSIGIR Conference on Research and Development in Information Retrieval, hlm. 179–188, 2020.
[6]
S. Ji, S. Pan, E. Cambria, P. Marttinen, P. S. Yu, dan L. Fellow, “A Survey on Knowledge Graphs: Representation, Acquisition, and Applications,” IEEE Transactions on Neural Networks and Learning Systems, vol. 33, no. 2, hlm. 494–514, 2022.
[7]
M. Wang, W. Chen, S. Wang, Y. Jiang, L. Yao, dan G. Qi, “Efficient search over incomplete knowledge graphs in binarized embedding space,” Future Generation Computer Systems, vol. 123, hlm. 24–34, 2021.
[8]
Y. Jia, Y. Wang, dan X. Jin, “Knowledge Graph Embedding: a Locally and Temporally Adaptive Transalation - Based Approach,” ACM Transactions on the Web, vol. 12 no 2, 2017.
[9]
A. Rossi, D. Firmani, A. Matinata, dan P. Merialdo, “Knowledge Graph Embedding for Link Prediction: a Comparative Analysis,” ACM Transactions on Knowledge Discovery from Data, vol. 15, no. 2, 2020.
[10]
R. West, E. Gabrilovich, K. Murphy, S. Sun, R. Gupta, dan D. Lin, “Knowledge Base Completion via Search-Based Question Answering,” 2014.
[11]
H. Cai, V. W. Zheng, K. Chen, dan C. C. Chang, “A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications,” IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018.
[12]
M. R. A. Rashid, G. Rizzo, M. Torchiano, N. Mihindukulasooriya, O. Corcho, dan R. García-Castro, “Completeness and consistency analysis for evolving knowledge bases,” Journal of Web Semantics, vol. 54, hlm. 48–71, Jan 2019.
[13]
M. Burgess, E. Adar, dan M. Cafarella, “Link-Prediction Enhanced Consensus Clustering for Complex Networks,” PLoS ONE, vol. 11, no. 5, hlm. e0153384, Mei 2016.
[14]
Q. Zhang, R. Wang, J. Yang, dan L. Xue, “Knowledge-Based Systems Knowledge graph embedding by reflection transformation,” Knowledge-Based Systems, vol. 238, 2022.
[15]
M. W. Chekol, “Tensor Decomposition for Link Prediction in Temporal Knowledge Graphs,” Proceedings ofthe 11th Knowledge Capture Conference, no. i, hlm. 253–256, 2021.
[16]
M. Shahrose, K. Fahad, dan M. Saad, “Parallel tensor factorization for relational learning,” Neural Computing and Applications, vol. 6, hlm. 8455–8464, 2022.
[17]
L. Cheng, J. Cui, X. Tang, Y. Qian, Y. Li, dan Y. Zhang, “RLPath_ a knowledge graph link prediction method using reinforcement learning based attentive relation path searching and representation learning,” Applied Intelligence, 2021.
[18]
J. Ma, Y. Qiao, G. Hu, Y. Wang, dan C. Zhang, “ELPKG: A High-Accuracy Link Prediction Approach for Knowledge Graph Completion,” Symmetry, vol. 11, 2019.
[19]
P. Minervini, C. Amato, N. Fanizzi, dan F. Esposito, “Efficient Learning of Entity and Predicate Embeddings for Link Prediction in Knowledge Graphs,” n Proceedings of the URSW@ISWC, hlm. 26–37, 2014.
[20]
A. Rossi dan P. Merialdo, “Explaining Link Prediction Systems based on Knowledge Graph Embeddings,” International Conference on Management of Data, hlm. 2062–2075, 2022.
[21]
B. Koloski, T. Stepišnik Perdih, M. Robnik-Šikonja, S. Pollak, dan B. Škrlj, “Knowledge graph informed fake news classification via heterogeneous representation ensembles,” Neurocomputing, vol. 496, hlm. 208–226, Jul 2022.
[22]
Q. Wang, Z. Mao, B. Wang, dan L. Guo, “Knowledge Graph Embedding: A Survey of Approaches and Applications,” IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, vol. 29, no. 12, hlm. 2724–2743, 2017.
[23]
M. Wang, Lingling, Qiu, dan Xiaoli, Wang, “A Survey on Knowledge Graph Embeddings for Link Prediction,” Symmetri, 2021.
[24]
A. Bordes, N. Usunier, A. Garcia-dur, J. Weston, dan O. Yakhnenko, “Translating Embeddings for Modeling Multi-relational Data,” Proceedings of the 26th International Conference on Neural Information Processing Systems, vol. 2, hlm. 1–9, 2013.
[25]
Z. Wang, J. Zhang, J. Feng, dan Z. Chen, “Knowledge graph embedding by translating on hyperplanes,” Proceedings of the National Conference on Artificial Intelligence, vol. 2, hlm. 1112–1119, 2014.
[26]
H. Lin, Y. Liu, W. Wang, Y. Yue, dan Z. Lin, “Learning Entity and Relation Embeddings for Knowledge Resolution,” Procedia Computer Science, vol. 108, hlm. 345–354, 2017.
[27]
S. He, K. Liu, G. Ji, dan J. Zhao, “Learning to represent knowledge graphs with Gaussian embedding,” International Conference on Information and Knowledge Management, Proceedings, vol. 19-23-Oct-, hlm. 623–632, 2015.
[28]
O. Barrera, S. Guriev, E. Henry, dan E. Zhuravskaya, “Facts, alternative facts, and fact checking in times of post-truth politics,” Journal of Public Economics, vol. 182, hlm. 104123, 2020.
[29]
M. Nickel, “A Three-Way Model for Collective Learning on Multi-Relational Data,” International Conference on Machine Learning, 2011.
[30]
B. Yang, W. Yih, X. He, J. Gao, dan L. Deng, “Embedding Entities and Relations For Learning and Inference in Knowledge Bases,” Proceedings of the ICLR, hlm. 1–13, 2015.
[31]
M. Nickel, L. Rosasco, dan T. Poggio, “Holographic Embeddings of Knowledge Graphs,” Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, hlm. 1955–1961, 2016.
[32]
T. Dettmers, P. Minervini, P. Stenetorp, dan S. Riedel, “Convolutional 2D Knowledge Graph Embeddings,” 32nd AAAI Conference on Artificial Intelligence, AAAI 2018, hlm. 1811–1818, 2018.
[33]
D. Q. Nguyen, T. D. Nguyen, D. Q. Nguyen, dan D. Phung, “A novel embedding model for knowledge base completion based on convolutional neural network,” NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, vol. 2, hlm. 327–333, 2018.
[34]
C. Li, Y. Wang, dan C. Wang, “A Knowledge Graph Embedding Method Based on Neural Network,” IEEE Fifth International Conference on Data Science in Cyberspace, hlm. 326–332, 2020.
[35]
Q. Zhang, R. Wang, J. Yang, dan L. Xue, “Knowledge Graph Embedding by Translating in time Domain Space for Link Prediction,” Knowledge-Based System, 2020.
[36]
M. Nayyeri dkk., “Trans4E: Link prediction on scholarly knowledge graphs,” Neurocomputing, vol. 461, hlm. 530–542, Okt 2021.
[37]
S. E. Schaeffer, “Graph Clustering,” Computer Science Review, vol. 1, no. 1, hlm. 27–64, 2007.
[38]
L. Guo dan Q. Dai, “Graph Clustering via Variational Graph Embedding,” Pattern Recognition, vol. 122, 2021.
[39]
A. E. Ezugwu, A. M. Ikotun, O. O. Oyelade, dan L. Abualigah, “A comprehensive survey of clustering algorithms: State-of-the-art machine learning applications, taxonomy, challenges, and future research prospects,” Engineering Applications of Artificial Intelligence, vol. 110, no. December 2021, hlm. 104743, 2022.
[40]
J. Yu dan L.-Y. Wu, “Multiple Order Local Information model for link prediction in complex networks,” Physica A: Statistical Mechanics and its Applications, vol. 600, hlm. 127522, Agu 2022.
[41]
K. Berahmand, E. Nasiri, S. Forouzandeh, dan Y. Li, “A preference random walk algorithm for link prediction through mutual influence nodes in complex networks,” Journal of King Saud University - Computer and Information Sciences, vol. 34, no. 8, hlm. 5375–5387, Sep 2022.
[42]
Y.-L. Chen, C.-H. Hsiao, dan C.-C. Wu, “An ensemble model for link prediction based on graph embedding,” Decision Support Systems, vol. 157, hlm. 113753, Jun 2022.
[43]
O. F. Robledo, X.-X. Zhan, A. Hanjalic, dan H. Wang, “Influence of clustering coefficient on network embedding in link prediction,” Appl Netw Sci, vol. 7, no. 1, hlm. 35, Des 2022.
[44]
A. Kumar, S. S. Singh, K. Singh, dan B. Biswas, “Level-2 node clustering coefficient-based link prediction,” Appl Intell, vol. 49, no. 7, hlm. 2762–2779, Jul 2019.

Cited By

View all
  • (2024)Improving embedding-based link prediction performance using clusteringJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2024.10218136:8Online publication date: 1-Oct-2024

Index Terms

  1. Improving the Efficiency of Link Prediction on Handling Incomplete Knowledge Graph Using Clustering
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        ICSCA '23: Proceedings of the 2023 12th International Conference on Software and Computer Applications
        February 2023
        385 pages
        ISBN:9781450398589
        DOI:10.1145/3587828
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 20 June 2023

        Permissions

        Request permissions for this article.

        Check for updates

        Qualifiers

        • Research-article
        • Research
        • Refereed limited

        Conference

        ICSCA 2023

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)13
        • Downloads (Last 6 weeks)2
        Reflects downloads up to 14 Feb 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Improving embedding-based link prediction performance using clusteringJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2024.10218136:8Online publication date: 1-Oct-2024

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format.

        HTML Format

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media