Discourse Relation-Aware Multi-turn Dialogue Response Generation

Wang, Huijie; He, Ruifang; Jia, Yungang; Xu, Jing; Wang, Bo

doi:10.1007/978-3-031-44693-1_66

Huijie Wang^11,12,
Ruifang He^11,12,
Yungang Jia¹³,
Jing Xu^11,12 &
…
Bo Wang¹¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14302))

Included in the following conference series:

CCF International Conference on Natural Language Processing and Chinese Computing

1083 Accesses

Abstract

Multi-turn dialogue response generation aims to generate a response with consideration of the context. It is not equal to multiple single-turn dialogues due to the context dependence of response. Many existing models achieve great success for response generation, but they still struggle to model the contextual semantics of dialogue history. Sequence models have difficulties to explore the interactive relations between contextual utterances, which affects the coherence of generated responses. To solve the issue, we propose a discourse relation-aware model, which encodes the contextual utterances with a directed acyclic graph neural network (DAGNN) with constraints on dialogue-specific discourse relations to better model the intrinsic structure. Besides, we introduce an auxiliary discourse relation recognition task to enhance the model’s ability of representing the context. Extensive experimental results show that our proposed model outperforms baselines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Asher, N., Hunter, J., Morey, M., Benamara, F., Afantenos, S.: Discourse structure and dialogue acts in multiparty dialogue: the stac corpus. In: Proceedings of the 10th LREC, pp. 2721–2727 (2016)
Google Scholar
Chen, W., et al.: Dialogved: a pre-trained latent variable encoder-decoder model for dialog response generation. In: Proceedings of the 60th ACL, pp. 4852–4864 (2022)
Google Scholar
Feng, X., Feng, X., Qin, B., Geng, X.: Dialogue discourse-aware graph model and data augmentation for meeting summarization. In: Proceedings of the IJCAI, pp. 3808–3814 (2021)
Google Scholar
Gu, X., Yoo, K.M., Ha, J.W.: Dialogbert: discourse-aware response generation via learning to recover and rank utterances. In: Proceedings of the AAAI, pp. 12911–12919 (2021)
Google Scholar
Gubelmann, R., Handschuh, S.: Context matters: a pragmatic study of plms’ negation understanding. In: Proceedings of the 60th ACL, pp. 4602–4621 (2022)
Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Lewis, M., et al.: Bart: denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. In: Proceedings of the 58th ACL, pp. 7871–7880 (2020)
Google Scholar
Li, J., Liu, M., Kan, M.Y., Zheng, Z., Wang, Z., et al.: Molweni: a challenge multiparty dialogues-based machine reading comprehension dataset with discourse structure. In: Proceedings of the 28th COLING, pp. 2642–2652 (2020)
Google Scholar
Li, J., et al.: Dadgraph: a discourse-aware dialogue graph neural network for multiparty dialogue machine reading comprehension. In: Proceedings of the 2021 IJCNN, pp. 1–8 (2021)
Google Scholar
Li, J., Galley, M., Brockett, C., Gao, J., Dolan, W.B.: A diversity-promoting objective function for neural conversation models. In: Proceedings of the 2016 NAACL: Human Language Technologies, pp. 110–119 (2016)
Google Scholar
Li, Y., Zhao, H.: Self-and pseudo-self-supervised prediction of speaker and key-utterance for multi-party dialogue reading comprehension. In: Findings of the 2021 EMNLP, pp. 2053–2063 (2021)
Google Scholar
Lowe, R., Pow, N., Serban, I.V., Pineau, J.: The ubuntu dialogue corpus: a large dataset for research in unstructured multi-turn dialogue systems. In: Proceedings of the 16th SIGDIAL, pp. 285–294 (2015)
Google Scholar
Ma, M.D., Bowden, K., Wu, J., et al.: Implicit discourse relation identification for open-domain dialogues. In: Proceedings of the 57th ACL, pp. 666–672 (2019)
Google Scholar
Papineni, K., Roukos, S., Ward, T., et al.: Bleu: a method for automatic evaluation of machine translation. In: Proceedings of the 40th ACL, pp. 311–318 (2002)
Google Scholar
Prasad, R., et al.: The penn discourse treebank 2.0. In: Proceedings of the 6th LREC (2008)
Google Scholar
Qi, W., et al.: Prophetnet: predicting future n-gram for sequence-to-sequencepre-training. In: Findings of the 2020 EMNLP, pp. 2401–2410 (2020)
Google Scholar
Radford, A., et al.: Language models are unsupervised multitask learners. In: OpenAI blog, p. 9 (2019)
Google Scholar
Roller, S., et al.: Recipes for building an open-domain chatbot. In: Proceedings of the EACL (2021)
Google Scholar
Serban, I., Sordoni, A., Bengio, Y., Courville, A., Pineau, J.: Building end-to-end dialogue systems using generative hierarchical neural network models. In: Proceedings of the AAAI, pp. 3776–3783 (2016)
Google Scholar
Serban, I., et al.: A hierarchical latent variable encoder-decoder model for generating dialogues. In: Proceedings of the AAAI, pp. 3295–3301 (2017)
Google Scholar
Shang, L., Lu, Z., Li, H.: Neural responding machine for short-text conversation. In: Proceedings of the 53rd ACL and the 7th IJCNLP, pp. 1577–1586 (2015)
Google Scholar
Shao, Y., Gouws, S., Britz, D., Goldie, A., Strope, B., Kurzweil, R.: Generating high-quality and informative conversation responses with sequence-to-sequence models. In: Proceedings of the 2017 EMNLP, pp. 2210–2219 (2017)
Google Scholar
Shen, L., Feng, Y., Zhan, H.: Modeling semantic relationship in multi-turn conversations with hierarchical latent variables. In: Proceedings of the 57th ACL, pp. 5497–5502 (2019)
Google Scholar
Shen, W., Wu, S., Yang, Y., Quan, X.: Directed acyclic graph network for conversational emotion recognition. In: Proceedings of the 59th ACL and the 11th IJCNLP, pp. 1551–1560 (2021)
Google Scholar
Shi, Z., Huang, M.: A deep sequential model for discourse parsing on multi-party dialogues. In: Proceedings of the AAAI, pp. 7007–7014 (2019)
Google Scholar
Stone, M., Stojnic, U., Lepore, E.: Situated utterances and discourse relations. In: Proceedings of the 10th IWCS, pp. 390–396 (2013)
Google Scholar
Sun, Y., Yu, N., Fu, G.: A discourse-aware graph neural network for emotion recognition in multi-party conversation. In: Findings of the 2021 EMNLP, pp. 2949–2958 (2021)
Google Scholar
Sutskever, I., et al.: Sequence to sequence learning with neural networks. In: Proceedings of the NeurIPS, vol. 195, pp. 496–527 (2014)
Google Scholar
Thost, V., Chen, J.: Directed acyclic graph neural networks. In: Proceedings of the ICLR (2021)
Google Scholar
Tonelli, S., Riccardi, G., Prasad, R., Joshi, A.: Annotation of discourse relations for conversational spoken dialogs. In: Proceedings of the 7th LREC (2010)
Google Scholar
Zhang, H., Lan, Y., Pang, L., Chen, H., et al.: Modeling topical relevance for multi-turn dialogue generation. In: Proceedings of the IJCAI, pp. 3737–3743 (2020)
Google Scholar
Zhang, H., Lan, Y., Pang, L., Guo, J., Cheng, X.: Recosa: Detecting the relevant contexts with self-attention for multi-turn dialogue generation. In: Proceedings of the 57th ACL, pp. 3721–3730 (2019)
Google Scholar
Zhang, S., Dinan, E., Urbanek, J., et al.: Personalizing dialogue agents: I have a dog, do you have pets too? In: Proceedings of the 56th ACL, pp. 2204–2213 (2018)
Google Scholar
Zhang, W., et al.: A static and dynamic attention framework for multi turn dialogue generation. In: JACM, vol. 41 (2022)
Google Scholar
Zhang, Y., et al.: Dialogpt: large-scale generative pre-training for conversational response generation. In: Proceedings of the 58th ACL: System Demonstrations, pp. 270–278 (2020)
Google Scholar

Download references

Acknowledgement

Our work is supported by the National Natural Science Foundation of China under Grant (61976154).

Author information

Authors and Affiliations

College of Intelligence and Computing, Tianjin University, Tianjin, China
Huijie Wang, Ruifang He, Jing Xu & Bo Wang
Tianjin Key Laboratory of Cognitive Computing and Application, Tianjin, China
Huijie Wang, Ruifang He & Jing Xu
Tianjin Branch of National Computer Network and Information Security Management Center, Tianjin, China
Yungang Jia

Authors

Huijie Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ruifang He
View author publications
You can also search for this author in PubMed Google Scholar
Yungang Jia
View author publications
You can also search for this author in PubMed Google Scholar
Jing Xu
View author publications
You can also search for this author in PubMed Google Scholar
Bo Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ruifang He .

Editor information

Editors and Affiliations

Emory University, Atlanta, GA, USA
Fei Liu
Microsoft Research Asia, Beijing, China
Nan Duan
Soochow University, Suzhou, China
Qingting Xu
Soochow University, Suzhou, China
Yu Hong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, H., He, R., Jia, Y., Xu, J., Wang, B. (2023). Discourse Relation-Aware Multi-turn Dialogue Response Generation. In: Liu, F., Duan, N., Xu, Q., Hong, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2023. Lecture Notes in Computer Science(), vol 14302. Springer, Cham. https://doi.org/10.1007/978-3-031-44693-1_66

Download citation

DOI: https://doi.org/10.1007/978-3-031-44693-1_66
Published: 08 October 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-44692-4
Online ISBN: 978-3-031-44693-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the China Computer Federation (CCF) (opens in a new tab)