摘要
提出一种多重知识表示框架, 探讨了其对推动大数据人工智能技术在各个领域中发展的重要意义及深远影响. 传统知识表达和现代基于深度学习的知识表达通常着眼于利用特定变换方式, 将输入转换为符号编码或者向量. 例如, 知识图谱关注于描述各个概念之间的语义联系, 而深度神经网络更像是感知原始信号输入的工具. 多重知识表达是一种更为先进的人工智能表征框架, 具备更完整的智能功能, 比如原始信号感知、 特征提取及向量化、 知识符号化和逻辑推断. 多重知识表达有如下两点优势: (1) 与现有以深度学习为主导的人工智能技术相比, 具有更强的解释性以及更好的泛化能力; (2) 将多重知识表达集成于现有人工智能技术, 有利于各种表征 (例如原始信号感知以及符号化编码) 发挥互补优势. 我们希望多重知识表达相关研究以及应用能够驱动新一代人工智能蓬勃发展.
References
Amodei D, Olah C, Steinhardt J, et al., 2016. Concrete problems in AI safety. https://arxiv.org/abs/1606.06565v2
Arrieta AB, díaz-Rodríguez N, Del Ser J, et al., 2020. Explainable artificial intelligence (XAI): concepts, taxonomies, opportunities and challenges toward responsible AI. Inform Fus, 58:82–115. https://doi.org/10.1016/j.inffus.2019.12.012
Auer S, Bizer C, Kobilarov G, et al., 2007. DBpedia: a nucleus for a web of open data. Proc 6th Int Semantic Web Conf and 2nd Asian Semantic Web Conf the Semantic Web, p.722–735. https://doi.org/10.1007/978-3-540-76298-0_52
de Souza CR, Gaidon A, Cabon Y, et al., 2017. Procedural generation of videos to train deep action recognition networks. IEEE Conf on Computer Vision and Pattern Recognition, p.2594–2604. https://doi.org/10.1109/CVPR.2017.278
Fan HH, Zhu LC, Yang Y, et al., 2020. Recurrent attention network with reinforced generator for visual dialog. ACM Trans Multim Comput Commun Appl, 16(3):78. https://doi.org/10.1145/3390891
Ferrada S, Bustos B, Hogan A, 2017. IMGpedia: a linked dataset with content-based analysis of Wikimedia images. Proc 16th Int Semantic Web Conf on the Semantic Web, p.84–93. https://doi.org/10.1007/978-3-319-68204-4_8
França MVM, Zaverucha G, d’Avila Garcez AS, 2014. Fast relational learning using bottom clause propositionalization with artificial neural networks. Mach Learn, 94(1): 81–104. https://doi.org/10.1007/s10994-013-5392-1
Gogoglou A, Bruss CB, Hines KE, 2019. On the interpretability and evaluation of graph representation learning. https://arxiv.org/abs/1910.03081
Goodfellow IJ, Pouget-Abadie J, Mirza M, et al., 2014. Generative adversarial nets. Proc 27th Int Conf on Neural Information Processing Systems, p.2672–2680.
He KM, Zhang XY, Ren SQ, et al., 2016. Deep residual learning for image recognition. IEEE Conf on Computer Vision and Pattern Recognition, p.770–778. https://doi.org/10.1109/CVPR.2016.90
He KM, Fan HQ, Wu YX, et al., 2020. Momentum contrast for unsupervised visual representation learning. IEEE/CVF Conf on Computer Vision and Pattern Recognition, p.9726–9735. https://doi.org/10.1109/CVPR42600.2020.00975
Johnson J, Gupta A, Li FF, 2018. Image generation from scene graphs. IEEE/CVF Conf on Computer Vision and Pattern Recognition, p.1219–1228. https://doi.org/10.1109/CVPR.2018.00133
Krizhevsky A, Sutskever I, Hinton GE, 2012. ImageNet classification with deep convolutional neural networks. Proc 25th Int Conf on Neural Information Processing Systems, p.1097–1105.
Miao JX, Wu Y, Yang Y, 2021. Identifying visible parts via pose estimation for occluded person re-identification. IEEE Trans Neur Netw Learn Syst, early access. https://doi.org/10.1109/TNNLS.2021.3059515
Pan YH, 2019. On visual knowledge. Front Inform Technol Electron Eng, 20(8):1021–1025. https://doi.org/10.1631/FITEE.1910001
Pan YH, 2020a. Miniaturized five fundamental issues about visual knowledge. Front Inform Technol Electron Eng, early access. https://doi.org/10.1631/FITEE.2040000
Pan YH, 2020b. Multiple knowledge representation of artificial intelligence. Engineering, 6(3):216–217. https://doi.org/10.1016/j.eng.2019.12.011
Pan YH, 2021. On visual understanding. Front Inform Technol Electron Eng, early access. https://doi.org/10.1631/FITEE.2130000
Serafini L, d’Avila Garcez A, 2016. Logic tensor networks: deep learning and logical reasoning from data and knowledge. Proc 11th Int Workshop on Neural-Symbolic Learning and Reasoning Co-located with the Joint Multi-conf on Human-Level Artificial Intelligence.
Simonyan K, Zisserman A, 2014. Two-stream convolutional networks for action recognition in videos. Proc 27th Int Conf on Neural Information Processing Systems, p.568–576.
Singh J, Zheng L, 2020. Combining semantic guidance and deep reinforcement learning for generating human level paintings. https://arxiv.org/abs/2011.12589
Sun YF, Cheng CM, Zhang YH, et al., 2020. Circle loss: a unified perspective of pair similarity optimization. IEEE/CVF Conf on Computer Vision and Pattern Recognition, p.6397–6406. https://doi.org/10.1109/CVPR42600.2020.00643
Tang KH, Niu YL, Huang JQ, et al., 2020. Unbiased scene graph generation from biased training. IEEE/CVF Conf on Computer Vision and Pattern Recognition, p.3716–3725. https://doi.org/10.1109/CVPR42600.2020.00377
Vaswani A, Shazeer N, Parmar N, et al., 2017. Attention is all you need. Proc 31st Int Conf on Neural Information Processing Systems, p.6000–6010.
Veeravasarapu V, Rothkopf C, Visvanathan R, 2017. Adversarially tuned scene generation. IEEE Conf on Computer Vision and Pattern Recognition, p.6441–6449. https://doi.org/10.1109/CVPR.2017.682
Vrandečić D, Krötzsch M, 2014. Wikidata: a free collaborative knowledge base. Commun ACM, 57(10):78–85. https://doi.org/10.1145/2629489
Wang XH, Zhu LC, Wu Y, et al., 2020. Symbiotic attention for egocentric action recognition with object-centric alignment. IEEE Trans Patt Anal Mach Intell, early access. https://doi.org/10.1109/TPAMI.2020.3015894
Xu DF, Zhu YK, Choy CB, et al., 2017. Scene graph generation by iterative message passing. IEEE Conf on Computer Vision and Pattern Recognition, p.3097–3106. https://doi.org/10.1109/CVPR.2017.330
Yan Y, Nie FP, Li W, et al., 2016. Image classification by cross-media active learning with privileged information. IEEE Trans Multim, 18(12):2494–2502. https://doi.org/10.1109/TMM.2016.2602938
Zhu LC, Fan HH, Luo YW, et al., 2021. Temporal cross-layer correlation mining for action recognition. IEEE Trans Multim, early access. https://doi.org/10.1109/TMM.2021.3057503
Acknowledgements
The authors thank Drs. Yifan SUN, Linchao ZHU, Xiaohan WANG, and Yu WU for constructive discussions.
Author information
Authors and Affiliations
Contributions
Yunhe PAN conceptualized the main idea and led the research. Yi YANG and Yueting ZHUANG surveyed the relevant materials. Yi YANG, Yueting ZHUANG, and Yunhe PAN had in-depth discussions; they drafted, revised, and finalized the paper.
Corresponding author
Ethics declarations
Yi YANG, Yueting ZHUANG, and Yunhe PAN declare that they have no conflict of interest.
Additional information
Project supported by the National Key R&D Program of China (No. 2020AAA0108800)
Rights and permissions
About this article
Cite this article
Yang, Y., Zhuang, Y. & Pan, Y. Multiple knowledge representation for big data artificial intelligence: framework, applications, and case studies. Front Inform Technol Electron Eng 22, 1551–1558 (2021). https://doi.org/10.1631/FITEE.2100463
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1631/FITEE.2100463