Adaptive knowledge subgraph ensemble for robust and trustworthy knowledge graph completion

Wan, Guojia; Du, Bo; Pan, Shirui; Wu, Jia

doi:10.1007/s11280-019-00711-y

Adaptive knowledge subgraph ensemble for robust and trustworthy knowledge graph completion

Published: 15 October 2019

Volume 23, pages 471–490, (2020)
Cite this article

World Wide Web Aims and scope Submit manuscript

Guojia Wan¹,
Bo Du¹,
Shirui Pan² &
…
Jia Wu³

1600 Accesses
14 Citations
3 Altmetric
Explore all metrics

Abstract

Knowledge graph (KG) embedding approaches are widely used to infer underlying missing facts based on intrinsic structure information. However, the presence of noisy facts in automatically extracted or crowdsourcing KGs significantly reduces the reliability of various embedding learners. In this paper, we thoroughly study the underlying reasons for the performance drop in dealing with noisy knowledge graphs, and we propose an ensemble framework, Adaptive Knowledge Subgraph Ensemble (AKSE), to enhance the robustness and trust of knowledge graph completion. By employing an effective knowledge subgraph extraction approach to re-sample the sub-components from the original knowledge graph, AKSE generates different representations for learning diversified base learners (e.g., TransE and DistMult), which substantially alleviates the noise effect of KG embedding. All embedding learners are integrated into a unified framework to reduce generalization errors via our simple or adaptive weighting schemes, where the weight is allocated based on each individual learner’s prediction capacity. Experimental results show that the robustness of our ensemble framework outperforms exiting knowledge graph embedding approaches on manually injected noise as well as inherent noisy extracted KGs.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

What Affects the Performance of Models? Sensitivity Analysis of Knowledge Graph Embedding

GCE: Global Contextual Information for Knowledge Graph Embedding

Start Small, Think Big: On Hyperparameter Optimization for Large-Scale Knowledge Graph Embeddings

Notes

References

Bollacker, K.D., Evans, C., Paritosh, P., Sturge, T., Taylor, J.: Freebase: a collaboratively created graph database for structuring human knowledge. In: SIGMOD, pp. 1247–1250 (2008)
Bordes, A., Usunier, N., García-durán, A., Weston, J., Yakhnenko, O.: Translating embeddings for modeling multi-relational data. In: NIPS, pp. 2787–2795 (2013)
Breiman, L.: Bagging predictors. Mach. Learn. 24(2), 123–140 (1996)
MATH Google Scholar
Dettmers, T., Minervini, P., Stenetorp, P., Riedel, S.: Convolutional 2D knowledge graph embeddings. In: AAAI, pp. 1811–1818 (2018)
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55(1), 119–139 (1997)
Article MathSciNet MATH Google Scholar
Fumera, G., Roli, F., Serrau, A.: A theoretical analysis of bagging as a linear combination of classifiers. IEEE Trans. Pattern Anal. Mach. Intell. 30(7), 1293–1299 (2008)
Article Google Scholar
Galar, M., Fernández, A., Tartas, E.B., Sola, H.B., Herrera, F.: A review on ensembles for the class imbalance problem: Bagging-, boosting-, and hybrid-based approaches. IEEE Trans. Syst. Man Cybern. Part C 42(4), 463–484 (2012)
Article Google Scholar
Han, X., Cao, S., Lv, X., Lin, Y., Liu, Z., Sun, M., Li, J.: Openke: an open toolkit for knowledge embedding. In: EMNLP, pp. 139–144 (2018)
Kadlec, R., Bajgar, O., Kleindienst, J.: Knowledge base completion: baselines strike back. In: Rep4NLP@ACL, pp. 69–74 (2017)
Kolomiyets, O., Moens, M.F.: A survey on question answering technology from an information retrieval perspective. Inform. Sci. 181(24), 5412–5434 (2011)
Article MathSciNet Google Scholar
Lin, Y., Liu, Z., Sun, M., Liu, Y., Zhu, X.: Learning entity and relation embeddings for knowledge graph completion. In: AAAI, pp. 2181–2187 (2015)
Liu, A., Wang, W., Shang, S., Li, Q., Zhang, X.: Efficient task assignment in spatial crowdsourcing with worker and task privacy protection. GeoInformatica 22 (2), 335–362 (2018)
Article Google Scholar
Liu, G., Liu, Y., Liu, A., Li, Z., Zheng, K., Wang, Y., Zhou, X.: Context-aware trust network extraction in large-scale trust-oriented social networks. World Wide Web 21(3), 713–738 (2018)
Article Google Scholar
Liu, G., Liu, Y., Zheng, K., Liu, A., Li, Z., Wang, Y., Zhou, X.: Mcs-gpm: multi-constrained simulation based graph pattern matching in contextual social graphs. IEEE Trans. Knowl. Data Eng. 30(6), 1050–1064 (2017)
Article Google Scholar
Liu, G., Wang, Y., Orgun, M.A.: Optimal social trust path selection in complex social networks. In: Twenty-Fourth AAAI Conference on Artificial Intelligence (2010)
Liu, G., Wang, Y., Orgun, M.A., Lim, E.P.: Finding the optimal social trust path for the selection of trustworthy service providers in complex social networks. IEEE Trans. Serv. Comput. 6(2), 152–167 (2011)
Article Google Scholar
Melville, P., Shah, N., Mihalkova, L., Mooney, R.J.: Experiments on ensembles with missing and noisy data. In: MCS, pp. 293–302 (2004)
Chapter Google Scholar
Mikolov, T., Yih, W.t., Zweig, G.: Linguistic regularities in continuous space word representations. In: Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 746–751 (2013)
Miller, G.A.: Wordnet: a lexical database for english. Commun. ACM 38(11), 39–41 (1995)
Article Google Scholar
Mitchell, T.M., et al., W.W.C.: Never-ending learning. Commun. ACM 61 (5), 103–115 (2018)
Nickel, M., Murphy, K., Tresp, V., Gabrilovich, E.: A review of relational machine learning for knowledge graphs. Proc. IEEE 104(1), 11–33 (2016)
Article Google Scholar
Nickel, M., Rosasco, L., Poggio, T.A.: Holographic embeddings of knowledge graphs. In: AAAI, pp. 1955–1961 (2016)
Nickel, M., Tresp, V., Kriegel, H.: A three-way model for collective learning on multi-relational data. In: ICML, pp. 809–816 (2011)
Pan, S., Hu, R., Fung, S.f., Long, G., Jiang, J., Zhang, C.: Learning graph embedding with adversarial training methods. arXiv:1901.01250 (2019)
Paulheim, H.: Knowledge graph refinement: a survey of approaches and evaluation methods. Semantic Web 8(3), 489–508 (2017)
Article Google Scholar
Pujara, J., Augustine, E., Getoor, L.: Sparsity and noise: where knowledge graph embeddings fall short. In: EMNLP, pp. 1751–1756 (2017)
Schlichtkrull, M.S., Kipf, T.N., Bloem, P., van den Berg, R., Titov, I., Welling, M.: Modeling relational data with graph convolutional networks. In: ESWC, pp. 593–607 (2018)
Speer, R., Chin, J., Havasi, C.: Conceptnet 5.5: an open multilingual graph of general knowledge. In: AAAI, pp. 4444–4451 (2017)
Trouillon, T., Dance, C.R., Gaussier, É., Welbl, J., Riedel, S., Bouchard, G.: Knowledge graph completion via complex tensor factorization. J. Mach. Learn. Res. 18, 130:1–130:38 (2017)
MathSciNet MATH Google Scholar
Ueda, N., Nakano, R.: Generalization error of ensemble estimators. In: ICNN, vol. 1, pp. 90–95. https://doi.org/10.1109/ICNN.1996.548872 (1996)
Wang, Q., Mao, Z., Wang, B., Guo, L.: Knowledge graph embedding: a survey of approaches and applications. IEEE Trans. Knowl. Data Eng. 29(12), 2724–2743 (2017)
Article Google Scholar
Wang, Z., Zhang, J., Feng, J., Chen, Z.: Knowledge graph embedding by translating on hyperplanes. In: AAAI, pp. 1112–1119 (2014)
Wu, Z., Pan, S., Chen, F., Long, G., Zhang, C., Yu, P.S.: A comprehensive survey on graph neural networks. arXiv:1901.00596(2019)
Xiao, M., Ma, K., Liu, A., Zhao, H., Li, Z., Zheng, K., Zhou, X.: Sra: secure reverse auction for task assignment in spatial crowdsourcing. IEEE Transactions on Knowledge and Data Engineering (2019)
Xiao, M., Wu, J., Huang, L., Cheng, R., Wang, Y.: Online task assignment for crowdsensing in predictable mobile social networks. IEEE Trans. Mob. Comput. 16(8), 2306–2320 (2016)
Article Google Scholar
Xie, R., Liu, Z., Lin, F., Lin, L.: Does William Shakespeare REALLY Write Hamlet? Knowledge Representation Learning with Confidence. In: AAAI, pp. 4954–4961 (2018)
Yang, B., Yih, W., He, X., Gao, J., Deng, L.: Embedding entities and relations for learning and inference in knowledge bases. In: ICLR (2015)
Zhai, D., Sun, Y., Liu, A., Li, Z., Liu, G., Zhao, L., Zheng, K.: Towards secure and truthful task assignment in spatial crowdsourcing. World Wide Web pp. 1–24 (2018)
Zhao, Y., Liu, G., Zheng, K., Liu, A., Li, Z., Zhou, X.: A context-aware approach for trustworthy worker selection in social crowd. World Wide Web 20(6), 1211–1235 (2017)
Article Google Scholar
Zhu, F., Wang, Y., Chen, C., Liu, G., Orgun, M.A., Wu, J.: A deep framework for cross-domain and cross-system recommendations. In: IJCAI, pp. 3711–3717 (2018)
Zhu, J., Jiang, W., Liu, A., Liu, G., Zhao, L.: Effective and efficient trajectory outlier detection based on time-dependent popular route. World Wide Web 20(1), 111–134 (2017)
Article Google Scholar
Zhu, X., Wu, X.: Class noise vs. attribute noise: a quantitative study. Artif. Intell. Rev. 22(3), 177–210 (2004)
Article MATH Google Scholar

Download references

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China under Grants 61822113, the Natural Science Foundation of Hubei Province under Grants 2018CFA050.

Author information

Authors and Affiliations

National Engineering Research Center for MultimediaSoftware, School of Computer Science, Wuhan University, Wuhan, China
Guojia Wan & Bo Du
Faculty of Information Technology, Monash University, Melbourne, Australia
Shirui Pan
Department of Computing, Macquarie University, Sydney, Australia
Jia Wu

Authors

Guojia Wan
View author publications
You can also search for this author in PubMed Google Scholar
Bo Du
View author publications
You can also search for this author in PubMed Google Scholar
Shirui Pan
View author publications
You can also search for this author in PubMed Google Scholar
Jia Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Bo Du or Shirui Pan.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article belongs to the Topical Collection: Special Issue on Trust, Privacy, and Security in Crowdsourcing Computing

Guest Editors: An Liu, Guanfeng Liu, Mehmet A. Orgun, and Qing Li

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wan, G., Du, B., Pan, S. et al. Adaptive knowledge subgraph ensemble for robust and trustworthy knowledge graph completion. World Wide Web 23, 471–490 (2020). https://doi.org/10.1007/s11280-019-00711-y

Download citation

Received: 23 May 2019
Revised: 11 July 2019
Accepted: 22 July 2019
Published: 15 October 2019
Issue Date: January 2020
DOI: https://doi.org/10.1007/s11280-019-00711-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Adaptive knowledge subgraph ensemble for robust and trustworthy knowledge graph completion

Abstract

Access this article

Similar content being viewed by others

What Affects the Performance of Models? Sensitivity Analysis of Knowledge Graph Embedding

GCE: Global Contextual Information for Knowledge Graph Embedding

Start Small, Think Big: On Hyperparameter Optimization for Large-Scale Knowledge Graph Embeddings

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding authors

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Adaptive knowledge subgraph ensemble for robust and trustworthy knowledge graph completion

Abstract

Access this article

Similar content being viewed by others

What Affects the Performance of Models? Sensitivity Analysis of Knowledge Graph Embedding

GCE: Global Contextual Information for Knowledge Graph Embedding

Start Small, Think Big: On Hyperparameter Optimization for Large-Scale Knowledge Graph Embeddings

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding authors

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation