Skip to main content

KSRL: Knowledge Selection Based Reinforcement Learning for Knowledge-Grounded Dialogue

  • Conference paper
  • First Online:
Knowledge Science, Engineering and Management (KSEM 2023)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14120))

  • 634 Accesses

Abstract

In the domain of multi-turn knowledge-grounded dialogues, the sequential coherence among knowledge elements chosen across various conversational turns presents potential cues for knowledge selection. However, this aspect has been largely overlooked in preceding studies. To tackle this issue, the present study introduces an innovative methodology that employs reinforcement learning to enhance knowledge selection in open-domain dialogue systems. By recasting the knowledge selection challenge as a sequential decision-making task and implementing reinforcement learning, the dialogue system is capable of discerning which knowledge to choose based on the conversational context and preceding dialogue turns, thereby generating high-quality responses. The system acquires a reward signal contingent upon the quality of the generated responses and subsequently updates its policy to maximize the expected reward over time. Harnessing the capabilities of reinforcement learning, our proposed method effectively learns to identify the most pertinent knowledge, thereby generating superior-quality responses. The study assesses the proposed approach using multiple open-domain dialogue datasets, demonstrating that it surpasses the performance of prior methodologies.

The research work is supported by National Key R &D Program of China (No.2022YFB3904700), Key Research and Development Program of in Shandong Province (2019JZZY020102), Key Research and Development Program of Jiangsu Province (No.BE2018084), Industrial Internet Innovation and Development Project in 2021 (TC210A02M, TC210804D), Opening Project of Beijing Key Laboratory of Mobile Computing and Pervasive Device.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Dinan, E., Roller, S., Shuster, K., Fan, A., Auli, M., Weston, J.: Wizard of wikipedia: Knowledge-powered conversational agents. In: International Conference on Learning Representations

    Google Scholar 

  2. Fan, L., et al.: MineDojo: building open-ended embodied agents with internet-scale knowledge (2022). https://doi.org/10.48550/ARXIV.2206.08853, https://arxiv.org/abs/2206.08853

  3. Feng, R., Chen, M.: Multi-sensor data fusion for short-term traffic flow prediction: a novel multi-channel data structure integrated with mixed-pointwise convolution and channel attention mechanism. In: Artificial Neural Networks and Machine Learning-ICANN 2022: 31st International Conference on Artificial Neural Networks, Bristol, UK, September 6–9, 2022, Proceedings; Part IV, pp. 731–742. Springer (2022). https://doi.org/10.1007/978-3-031-15937-4_61

  4. Fu, D., Zhang, C., Yu, J., Sun, Q., Zhan, Z.: Improving dialogue generation with commonsense knowledge fusion and selection. In: Knowledge Science, Engineering and Management: 15th International Conference, KSEM 2022, Singapore, August 6–8, 2022, Proceedings, Part I, pp. 93–108. Springer (2022). https://doi.org/10.1007/978-3-031-10983-6_8

  5. Holtzman, A., Buys, J., Forbes, M., Choi, Y.: The curious case of neural text degeneration. In: International Conference on Learning Representations (ICLR) (2020)

    Google Scholar 

  6. Kim, B., Ahn, J., Kim, G.: Sequential latent knowledge selection for knowledge-grounded dialogue. arXiv preprint arXiv:2002.07510 (2020)

  7. Li, Z., Niu, C., Meng, F., Feng, Y., Li, Q., Zhou, J.: Incremental transformer with deliberation decoder for document grounded conversations. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 12–21 (2019)

    Google Scholar 

  8. Lian, R., Xie, M., Wang, F., Peng, J., Wu, H.: Learning to select knowledge for response generation in dialog systems. In: IJCAI International Joint Conference on Artificial Intelligence, p. 5081 (2019)

    Google Scholar 

  9. Ma, Z., Ye, J., Yang, X., Liu, J.: HCLD: a hierarchical framework for zero-shot cross-lingual dialogue system. In: Proceedings of the 29th International Conference on Computational Linguistics, pp. 4492–4498 (2022)

    Google Scholar 

  10. Mirchandani, S., Karamcheti, S., Sadigh, D.: ELLA: exploration through learned language abstraction. arXiv preprint arXiv:2103.05825 (2021)

  11. Mu, J., et al.: Improving intrinsic exploration with language abstractions (2022). https://doi.org/10.48550/ARXIV.2202.08938, https://arxiv.org/abs/2202.08938

  12. Radford, A., et al.: Learning transferable visual models from natural language supervision. arXiv preprint arXiv:2103.00020 (2021)

  13. Tam, A.C., et al.: Semantic exploration from language abstractions and pretrained representations (2022). https://doi.org/10.48550/ARXIV.2204.05080, https://arxiv.org/abs/2204.05080

  14. Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems. vol. 30 (2017)

    Google Scholar 

  15. Wang, C., Li, Y., Fei, C., Huang, X.: Labeled knowledge-based decision making with assumption-based argumentation. In: Knowledge Science, Engineering and Management: 15th International Conference, KSEM 2022, Singapore, August 6–8, 2022, Proceedings, Part I, pp. 450–465. Springer (2022). https://doi.org/10.1007/978-3-031-10983-6_35

  16. Wang, Y., Zhu, X., Zhang, H.: Relation prediction based on source-entity behavior preference modeling via heterogeneous graph pooling. In: Knowledge Science, Engineering and Management: 15th International Conference, KSEM 2022, Singapore, August 6–8, 2022, Proceedings, Part I, pp. 425–436. Springer (2022). https://doi.org/10.1007/978-3-031-10983-6_33

  17. Xu, J., Wang, H., Niu, Z.Y., Wu, H., Che, W.: Knowledge graph grounded goal planning for open-domain conversation generation. In: AAAI Conference on Artificial Intelligence (AAAI) (2020)

    Google Scholar 

  18. Zhang, T., Huang, M., Zhao, L.: Learning structured representation for text classification via reinforcement learning. In: Proceedings of the AAAI Conference on Artificial Intelligence. vol. 32 (2018)

    Google Scholar 

  19. Zou, P., Teng, Y., Niu, T.: Multi-scale feature extraction and fusion for online knowledge distillation. In: Artificial Neural Networks and Machine Learning-ICANN 2022: 31st International Conference on Artificial Neural Networks, Bristol, UK, September 6–9, 2022, Proceedings; Part IV, pp. 126–138. Springer (2022). https://doi.org/10.1007/978-3-031-15937-4_11

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jian Ye .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ma, Z., Ye, J., Cheng, S. (2023). KSRL: Knowledge Selection Based Reinforcement Learning for Knowledge-Grounded Dialogue. In: Jin, Z., Jiang, Y., Buchmann, R.A., Bi, Y., Ghiran, AM., Ma, W. (eds) Knowledge Science, Engineering and Management. KSEM 2023. Lecture Notes in Computer Science(), vol 14120. Springer, Cham. https://doi.org/10.1007/978-3-031-40292-0_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-40292-0_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-40291-3

  • Online ISBN: 978-3-031-40292-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics