Abstract
We consider the task of knowledge fusion, an important aspect of cognitive intelligence, with the goal of combining part-of knowledge drawn from different sources. For this, entities and relations are cast into matrix-based representations. Unlike previous work on relation prediction, we consider the challenging setting of graphs with large amounts of completely separate connected components and no overlap between the training and test set entities. In order to address these challenges, we propose a novel cognitively inspired factorization method that jointly factorizes a subject–predicate–object tensor via RESCAL and a similarity matrix via matrix factorization. Our experimental results show that our method significantly outperforms several strong baseline models, including RESCAL and several TransE-style models. The proposed joint factorization of a subject–predicate–object tensor while applying matrix factorization to a similarity matrix obtains substantially higher average accuracy rates than previous approaches. This shows that it can successfully address the challenge of knowledge fusion of disconnected data.
Similar content being viewed by others
References
Acar E, Rasmussen M, Savorani F, Næs T., Bro R. 2013. Understanding data fusion within the framework of coupled matrix and tensor factorizations 129, 53-63.
Bordes A, Glorot X, Weston J, Bengio Y. 2013. A semantic matching energy function for learning with multi-relational data. Machine Learning. To appear.
Bordes A, Usunier N, García-Durán A, Weston J, Yakhnenko O. Translating embeddings for modeling multi-relational data. NIPS; 2013. p. 2787–2795.
Boyd SP, Parikh N, Chu E, Peleato B, Eckstein J. Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends in Machine Learning 2011;3(1): 1–122.
Chew PA, Bader BW, Kolda TG, Abdelali A. Cross-language information retrieval using PARAFAC2. SIGKDD; 2007. p. 143–152.
Correa FE, Oliveira MDB, Gama J, Corrêa PLP, Rady J. Analyzing the behavior dynamics of grain price indexes using tucker tensor decomposition and spatio-temporal trajectories. Comput Electron Agric 2016; 120:72-78.
Dong X, Gabrilovich E, Heitz G, Horn W, Lao N, Murphy K, Strohmann T, Sun S, Zhang W. Knowledge vault: a web-scale approach to probabilistic knowledge fusion. SIGKDD; 2014. p. 601–610.
Dong XL, Gabrilovich E, Heitz G, Horn W, Murphy K, Sun S, Zhang W. From data fusion to knowledge fusion. PVLDB 2014;7(10):881–892.
Dong XL, Srivastava D. Knowledge curation and knowledge fusion: challenges, models and applications. Proceedings of the 2015 ACM SIGMOD international conference on management of data, Melbourne, Victoria, Australia, May 31 - June 4, 2015; 2015. p. 2063–2066.
Du J, Yuan C, Tian P, Lin H. Channel estimation for multi-input multi-output relay systems using the PARATUCK2 tensor model. IET Commun 2016;10(9):995–1002.
Fellbaum C, (ed). 1998. WordNet: an electronic lexical database. http://www.amazon.ca/exec/obidos/redirect?tag=citeulike04-20%&path=ASIN/026206197X. Cambridge: The MIT Press.
He L, Liu B, Li G, Sheng Y, Wang Y, Xu Z. Knowledge base completion by variational Bayesian neural tensor decomposition. Cogn Comput 2018;10(6):1075–1084.
He S, Liu K, Ji G, Zhao J. Learning to represent knowledge graphs with gaussian embedding. CIKM. ACM; 2015. p. 623–632.
Ji G, Liu K, He S, Zhao J. Knowledge graph completion with adaptive sparse transfer matrix. AAAI; 2016. p. 985–991.
Jiang JJ, Conrath DW. 1997. Semantic similarity based on corpus statistics and lexical taxonomy. arXiv:cmp-lg/9709008.
Kiers H. Towards a standardized notation and terminology in multiway analysis. 2000;14, 105-122.
Kolda TG, Bader BW. Tensor decompositions and applications. SIAM Rev 2009;51(3):455–500.
Krishna R, Zhu Y, Groth O, Johnson J, Hata K, Kravitz J, Chen S, Kalantidis Y, Li LJ, Shamma DA, Bernstein M, Fei-Fei L. 2016. Visual genome: connecting language and vision using crowdsourced dense image annotations. arXiv:1602.07332 .
Leacock C, Chodorow M. Combining local context and wordnet similarity for word sense identification. WordNet: an electronic lexical database 1998;49(2):265–283.
Lin D. An information-theoretic definition of similarity. ICML; 1998.
Lin Y, Liu Z, Luan H, Sun M, Rao S, Liu S. 2015. Modeling relation paths for representation learning of knowledge bases. Computer Science.
Lin Y, Liu Z, Sun M, Liu Y, Zhu X. Learning entity and relation embeddings for knowledge graph completion. AAAI; 2015. p. 2181–2187.
Lin Y, Liu Z, Zhu X, Zhu X, Zhu X. Learning entity and relation embeddings for knowledge graph completion. AAAI; 2015. p. 2181–2187.
Liu B, He L, Li Y, Zhe S, Xu Z. Neuralcp: Bayesian multiway data analysis with neural tensor decomposition. Cogn Comput 2018;10(6):1051–1061.
Mikolov T, Chen K, Corrado G, Dean J. 2013. Efficient estimation of word representations in vector space. arXiv:1301.3781.
Nengfu X, Wensheng W, Xiaorong Y, Lihua J. Rule-based agricultural knowledge fusion in web information integration. NJAS - Wageningen Journal of Life Sciences 2012;10(1):635–638(4).
Nickel M, Tresp V, Kriegel H. A three-way model for collective learning on multi-relational data. ICML; 2011. p. 809–816.
Nickel M, Tresp V, Kriegel H. Factorizing YAGO: scalable machine learning for linked data. WWW; 2012. p. 271–280.
Pennington J, Socher R, Manning CD. Glove: Global vectors for word representation. EMNLP; 2014. p. 1532–1543.
Pilehvar MT, Jurgens D, Navigli R. Align, disambiguate and walk: a unified approach for measuring semantic similarity. ACL; 2013. p. 1341–1351.
Preece AD, Hui K, Gray WA, Marti P, Bench-Capon TJM, Jones DM, Cui Z. The KRAFT architecture for knowledge fusion and transformation. Knowl.-Based Syst 2000;13(2-3):113–120.
Ragusa E, Gastaldo P, Zunino R, Cambria E. Learning with similarity functions: a tensor-based framework. Cogn Comput 2019;11(1):31–49.
Resnik P. 1995. Using information content to evaluate semantic similarity in a taxonomy. arXiv:cmp-lg/9511007.
Solé-Casals J, Caiafa CF, Zhao Q, Cichocki AS. Brain-computer interface with corrupted EEG data: a tensor completion approach. Cogn Comput 2018;10(6):1062–1074.
Stegeman A, Berge JT, Psychometrika LDL. Sufficient conditions for uniqueness in candecomp/parafac and indscal with random component matrices. Psychometrika 2006;71(2):219–229.
Tandon N, Hariman C, Urbani J, Rohrbach A, Rohrbach M, Weikum G. Commonsense in parts: mining part-whole relations from the web and image tags. Proceedings of the thirtieth AAAI conference on artificial intelligence, February 12–17, 2016, Phoenix, Arizona, USA; 2016. p. 243–250.
Thoma S, Rettinger A, Both F. 2017. Knowledge fusion via embeddings from text, knowledge graphs, and images. arXiv:1704.06084.
Wang Y, Widrow B, Zadeh LA, Howard N, Wood S, Bhavsar VC, Budin G, Chan CW, Fiorini RA, Gavrilova ML, Shell DF. Cognitive intelligence: deep learning, thinking, and reasoning by brain-inspired systems. IJCINI 2016;10(4):1–20.
Wang Z, Zhang J, Feng J, Chen Z. Knowledge graph embedding by translating on hyperplanes. AAAI; 2014. p. 1112–1119.
Wu Z, Palmer MS. Verb semantics and lexical selection. ACL; 1994. p. 133–138.
Yu Xl, Qiao L. 2017. Knowledge fusion methods: a survey. DEStech Transactions on Computer Science and Engineering (smce).
Zhang J, Han Y, Jiang J. Tucker decomposition-based tensor learning for human action recognition. Multimedia Syst 2016;22(3):343–353.
Funding
The authors would like to acknowledge the financial support provided by the National Natural Science Foundation of China (no. 61503217)
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors declare that they have no conflict of interest.
Ethical Approval
This article does not describe any studies with human participants or animals performed by any of the authors.
Informed Consent
Informed consent was not required as no humans or animals were involved.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Hao, Z., Wang, Y., Liu, Z. et al. Knowledge Fusion via Joint Tensor and Matrix Factorization. Cogn Comput 12, 642–653 (2020). https://doi.org/10.1007/s12559-019-09686-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12559-019-09686-4