skip to main content
10.1145/3653081.3653117acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiotaaiConference Proceedingsconference-collections
research-article

A fusion inference method for large language models and knowledge graphs based on structured injection and causal inference

Published:03 May 2024Publication History

ABSTRACT

In this paper, we propose a large language model and knowledge graph fusion reasoning method based on structured injection and causal reasoning (LKFSC) to address the limitations of existing large language models and knowledge graphs in practical applications. The approach effectively mitigates the problems of long-distance dependency and limited contextual information, and improves the reasoning capability of the large language model. Meanwhile, by fusing the generative ability of the large language model and the inference ability of the knowledge graph, the method realizes intelligent reasoning for complex problems. The main contributions of this paper include proposing a structured injection method that introduces causality for reasoning, and constructing a fusion reasoning framework that effectively mitigates the illusory problem of large language models and provides powerful and intelligent decision support for practical applications.

References

  1. Zhao, W.X., Zhou, K., Li, J., Tang, T., Wang, X., Hou, Y., Min, Y., Zhang, B., Zhang, J., Dong, Z., Du, Y., Yang, C., Chen, Y., Chen, Z., Jiang, J., Ren, R., Li, Y. (2023). ..., Tang, X., Liu, Z., Liu, P., Nie, J., & Wen, J. 2023. A Survey of Large Language Models. arXiv, abs/2303.18223.Google ScholarGoogle Scholar
  2. Katherine Lee, Daphne Ippolito, Andrew Nystrom, Chiyuan Zhang, Douglas Eck, Chris Callison-Burch, and Nicholas Carlini. 2021. Deduplicating training data makes language models better. arXiv preprint arXiv:2107.06499.Google ScholarGoogle Scholar
  3. Nayeon Lee, Wei Ping, Peng Xu, Mostofa Patwary, Mohammad Shoeybi, and Bryan Catanzaro. 2022. Factuality enhanced language models for open-ended text generation. arXiv preprint arXiv:2206.04624.Google ScholarGoogle Scholar
  4. Ren, R., Wang, Y., Qu, Y., Zhao, W.X., Liu, J., Tian, H., Wu, H., Wen, J., & Wang, H. 2023. Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation. arXiv, abs/2307.11019.Google ScholarGoogle Scholar
  5. Chen, F., & Feng, Y. 2023. Chain-of-Thought Prompt Distillation for Multimodal Named Entity Recognition and Multimodal Relation Extraction.Google ScholarGoogle Scholar
  6. Ouyang, L., Wu, J., Jiang, X., Almeida, D., Wainwright, C.L., Mishkin, P., Zhang, C., Agarwal, S., Slama, K., Ray, A., Schulman, J., Hilton, J., Kelton, F. , Miller, L.E., Simens, M., Askell, A., Welinder, P., Christiano, P.F., Leike, J., & Lowe, R.J. 2022. Training language models to follow instructions with human feedback. arXiv, abs/2203.02155.Google ScholarGoogle Scholar
  7. Girshick, R.B., Donahue, J., Darrell, T., & Malik, J. 2013. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. 2014 IEEE Conference on Computer Vision and Pattern Recognition,. 580-587.Google ScholarGoogle Scholar
  8. Shi, X., Chen, Z., Wang, H., Yeung, D.Y., Wong, W., & Woo, W. 2015. Convolutional LSTM Network: a Machine Learning Approach for Precipitation Nowcasting. neural Information Processing Systems.Google ScholarGoogle Scholar
  9. Kipf, T., & Welling, M. 2016. Semi-Supervised Classification with Graph Convolutional Networks. arXiv, abs/1609.02907.Google ScholarGoogle Scholar
  10. Fukushima, K. 1980. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biological Cybernetics,. 36, 193-202.Google ScholarGoogle Scholar
  11. Vaswani, A., Shazeer, N.M., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., & Polosukhin, I. 2017. Attention is All you Need. neural Information Processing Systems.Google ScholarGoogle Scholar
  12. Burr, T.L. 2003. Causation, Prediction, and Search. technometrics, 45, 272 - 273.Google ScholarGoogle Scholar
  13. Ji, S., Pan, S., Cambria, E., Marttinen, P., & Yu, P.S. 2020. A Survey on Knowledge Graphs: Representation, Acquisition, and Applications. IEEE Transactions on Neural Networks and Learning Systems, 33, 494-514.Google ScholarGoogle ScholarCross RefCross Ref
  14. Zhang, J. 2023. Graph-ToolFormer: To Empower LLMs with Graph Reasoning Ability via Prompt Augmented by ChatGPT. arXiv, abs/2304.11116.Google ScholarGoogle Scholar
  15. Choudhary, N., & Reddy, C.K. 2023. Complex Logical Reasoning over Knowledge Graphs using Large Language Models. arXiv, abs/2305.01157.Google ScholarGoogle Scholar
  16. Zhu, Y., Wang, X., Chen, J., Qiao, S., Ou, Y., Yao, Y., Deng, S., Chen, H., & Zhang, N. 2023. LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities. arXiv, abs/2305.13168.Google ScholarGoogle Scholar
  17. Wen, Y., Wang, Z., & Sun, J. 2023. MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models. arXiv, abs/2308.09729.Google ScholarGoogle Scholar
  18. Shazeer, N.M. 2019. Fast Transformer Decoding: One Write-Head is All You Need. arXiv, abs/1911.02150.Google ScholarGoogle Scholar
  19. Jin, Zhijing, 2023 "Can Large Language Models Infer Causation from Correlation?". arXiv preprint arXiv:2306.05836Google ScholarGoogle Scholar
  20. Wei, Yinwei, 2022 "Causal inference for knowledge graph based recommendation." IEEE Transactions on Knowledge and Data Engineering (2022).Google ScholarGoogle Scholar
  21. Xu, Shuyuan, 2022 "Dynamic causal collaborative filtering." Proceedings of the 31st ACM International Conference on Information & Knowledge ManagementGoogle ScholarGoogle Scholar
  22. Kıcıman, Emre, 2023 "Causal reasoning and large language models: opening a new frontier for causality." arXiv preprint arXiv:2305.00050Google ScholarGoogle Scholar
  23. Wang, Wenjie, 2023 "Causal recommendation: progresses and future directions." Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2023.Google ScholarGoogle Scholar
  24. Zheng, Yu, 2021 "Disentangling user interest and conformity for recommendation with causal embedding." Proceedings of the Web Conference 2021.Google ScholarGoogle Scholar
  25. Chickering, David Maxwell. 2002 "Optimal structure identification with greedy search." journal of machine learning research 3. Nov: 507-554.Google ScholarGoogle Scholar
  26. Shimizu, Shohei, 2006 "A linear non-Gaussian acyclic model for causal discovery." Journal of Machine Learning Research 7.10.Google ScholarGoogle Scholar
  27. Zeng, A., Liu, X., Du, Z., Wang, Z., Lai, H., Ding, M., Yang, Z., Xu, Y., Zheng, W., Xia, X., Tam, W.L., Ma, Z., Xue, Y., Zhai, J., Chen, W., Zhang, P., Dong, Y., & amp; Tang, J. 2022. GLM-130B: An Open Bilingual Pre-trained Model. arXiv, abs/2210.02414.Google ScholarGoogle Scholar
  28. Xu, M. 2022 Text2vec: Text to vector toolkit (Version 1.1.2) [Computer software]. https://github.com/shibing624/text2vecGoogle ScholarGoogle Scholar

Index Terms

  1. A fusion inference method for large language models and knowledge graphs based on structured injection and causal inference

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      IoTAAI '23: Proceedings of the 2023 5th International Conference on Internet of Things, Automation and Artificial Intelligence
      November 2023
      902 pages
      ISBN:9798400716485
      DOI:10.1145/3653081

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 3 May 2024

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)2

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format