Skip to main content
Log in

Global-chronological graph interactive networks for multi-domain dialogue state tracking

  • Original Article
  • Published:
International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Abstract

Dialogue state tracking (DST) provides necessary information for the policy learning module to decide the next action according to the historical state and the utterance related details of the dialogue, so robust dialogue state tracking is a core part of high-quality task-oriented dialogue systems. The tasks addressed by dialogue state tracking are becoming increasingly complex, and due to the multi-domain switching in the dialogue relevant contexts, the model needs to have the ability to generate appropriate responses to accurately fit the domains, slots and slot values. However, efficient encoding of state information and fundamentally simulating the logical relationship between domains and slots to dynamically extract variation features of the multiple domains are still challenges for DST. In this paper, we propose a novel framework global-chronological graph interactive networks (GCGIN), which encodes both global and chronological information of the dialogue from dual perspectives, and leverages coupling and co-affine modules to interact with the above two different granularities information. Our model adopts the mask mechanism to improve anti-noise ability, and constructs the dynamic schema graph structure to dynamically model domain-slot and domain-value relations via dialogue chronological order to achieve cross-domain features learning and improve model scalability. Extensive experimental results on three benchmark datasets demonstrate that our proposed model outperforms existing mainstream methods, and comprehensive analysis validates the effectiveness of each component.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Data availability

MultiWOZ 2.1 [22]: https://github.com/budzianowski/multiwoz/tree/master/data MultiWOZ2.2 [23]: https://github.com/budzianowski/multiwoz SGD [24]: https://github.com/google-research-datasets/dstc8-schema-guided-dialogue.

References

  1. Henderson M, Thomson B, Williams JD (2014) The second dialog state tracking challenge. In: Proceedings of the 15th annual meeting of the special interest group on discourse and dialogue (SIGDIAL), pp 263–272

  2. Ni J, Young T, Pandelea V, Xue F, Adiga V, Cambria E (2021) Recent advances in deep learning based dialogue systems: a systematic survey. arXiv preprint arXiv:2105.04387

  3. Mrkšić N, Séaghdha DÓ, Wen T-H, Thomson B, Young S (2017) Neural belief tracker: Data-driven dialogue state tracking. In: Proceedings of the 55th annual meeting of the Association for Computational Linguistics, vol 1 (Long Papers), pp 1777–1788

  4. Chen L, Lv B, Wang C, Zhu S, Tan B, Yu K (2020) Schema-guided multi-domain dialogue state tracking with graph attention neural networks. In: Proceedings of the AAAI conference on artificial intelligence, vol 34, pp 7521–7528

  5. Ye F, Manotumruksa J, Zhang Q, Li S, Yilmaz E (2021) Slot self-attentive dialogue state tracking. In: Proceedings of the web conference 2021, pp 1598–1608

  6. Hosseini-Asl E, McCann B, Wu C-S, Yavuz S, Socher R (2020) A simple language model for task-oriented dialogue. Adv Neural Inf Process Syst 33:20179–20191

    Google Scholar 

  7. Heck M, van Niekerk C, Lubis N, Geishauser C, Lin H-C, Moresi M, Gasic M (2020) Trippy: a triple copy strategy for value independent neural dialog state tracking. In: Proceedings of the 21th annual meeting of the special interest group on discourse and dialogue, pp 35–44

  8. Kim S, Yang S, Kim G, Lee S-W (2020) Efficient dialogue state tracking by selectively overwriting memory. In: Proceedings of the 58th annual meeting of the Association for Computational Linguistics, pp 567–582

  9. Ramadan O, Budzianowski P, Gasic M (2018) Large-scale multi-domain belief tracking with knowledge sharing. In: Proceedings of the 56th Annual meeting of the Association for Computational Linguistics, vol 2 (Short Papers), pp 432–437

  10. Chen J, Zhang R, Mao Y, Xu J (2020) Parallel interactive networks for multi-domain dialogue state generation. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp 1921–1931

  11. Feng Y, Wang Y, Li H (2021) A sequence-to-sequence approach to dialogue state tracking. In: Proceedings of the 59th annual meeting of the Association for Computational Linguistics and the 11th international joint conference on natural language processing, vol 1 (Long Papers), pp 1714–1725

  12. Ren L, Ni J, McAuley J (2019) Scalable and accurate dialogue state tracking via hierarchical sequence generation. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), pp 1876–1885

  13. Chen Z, Chen L, Xu Z, Zhao Y, Zhu S, Yu K (2020) Credit: coarse-to-fine sequence generation for dialogue state tracking. arXiv preprint arXiv:2009.10435

  14. Tian X, Huang L, Lin Y, Bao S, He H, Yang Y, Wu H, Wang F, Sun S (2021) Amendable generation for dialogue state tracking. In: Proceedings of the 3rd workshop on natural language processing for conversational AI, pp 80–92

  15. Lee C-H, Cheng H, Ostendorf M (2021) Dialogue state tracking with a language model using schema-driven prompting. In: Proceedings of the 2021 conference on empirical methods in natural language processing, pp 4937–4949

  16. Guo J, Shuang K, Li J, Wang Z (2021) Dual slot selector via local reliability verification for dialogue state tracking. In: Proceedings of the 59th annual meeting of the Association for Computational Linguistics and the 11th international joint conference on natural language processing, vol 1 (Long Papers), pp 139–151

  17. Zhang J, Hashimoto K, Wu C-S, Wang Y, Philip SY, Socher R, Xiong C (2020) Find or classify? dual strategy for slot-value predictions on multi-domain dialog state tracking. In: Proceedings of the ninth joint conference on lexical and computational semantics, pp 154–167

  18. Wu C-S, Madotto A, Hosseini-Asl E, Xiong C, Socher R, Fung P (2019) Transferable multi-domain state generator for task-oriented dialogue systems. In: Proceedings of the 57th annual meeting of the Association for Computational Linguistics, pp 808–819

  19. Huang Y, Feng J, Hu M, Wu X, Du X, Ma S (2020) Meta-reinforced multi-domain state generator for dialogue systems. In: Proceedings of the 58th annual meeting of the Association for Computational Linguistics, pp 7109–7118

  20. Zhao J, Mahdieh M, Zhang Y, Cao Y, Wu Y (2021) Effective sequence-to-sequence dialogue state tracking. In: Proceedings of the 2021 conference on empirical methods in natural language processing, pp 7486–7493

  21. Veličković P, Cucurull G, Casanova A, Romero A, Liò P, Bengio Y (2018) Graph attention networks. In: International conference on learning representations

  22. Eric M, Goel R, Paul S, Sethi A, Agarwal S, Gao S, Kumar A, Goyal A, Ku P, Hakkani-Tur D (2020) Multiwoz 2.1: a consolidated multi-domain dialogue dataset with state corrections and state tracking baselines. In: Proceedings of the 12th language resources and evaluation conference, pp 422–428

  23. Zang X, Rastogi A, Sunkara S, Gupta R, Zhang J, Chen J (2020) Multiwoz 2.2: a dialogue dataset with additional annotation corrections and state tracking baselines. In: Proceedings of the 2nd workshop on natural language processing for conversational AI, pp 109–117

  24. Rastogi A, Zang X, Sunkara S, Gupta R, Khaitan P (2020) Towards scalable multi-domain conversational agents: the schema-guided dialogue dataset. In: Proceedings of the AAAI conference on artificial intelligence, vol 34, pp 8689–8696

  25. Henderson M, Thomson B, Young S (2014) Word-based dialog state tracking with recurrent neural networks. In: Proceedings of the 15th annual meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL), pp 292–299

  26. Williams JD, Raux A, Henderson M (2016) The dialog state tracking challenge series: a review. Dialogue Discourse 7(3):4–33

    Article  Google Scholar 

  27. Lee H, Lee J, Kim T-Y(2019) Sumbt: slot-utterance matching for universal and scalable belief tracking. In: Proceedings of the 57th annual meeting of the Association for Computational Linguistics, pp 5478–5483

  28. Xu P, Hu Q (2018) An end-to-end approach for handling unknown slot values in dialogue state tracking. In: Proceedings of the 56th annual meeting of the Association for Computational Linguistics, vol 1 (Long Papers), pp 1448–1457

  29. Gao S, Sethi A, Agarwal S, Chung T, Hakkani-Tur D (2019) Dialog state tracking: a neural reading comprehension approach. In: Proceedings of the 20th annual SIGdial Meeting on Discourse and Dialogue, pp 264–273

  30. Ham D, Lee J-G, Jang Y, Kim K-E (2020) End-to-end neural pipeline for goal-oriented dialogue systems using gpt-2. In: Proceedings of the 58th annual meeting of the Association for Computational Linguistics, pp 583–592

  31. Madotto A, Lin Z, Zhou Z, Moon S, Crook PA, Liu B, Yu Z, Cho E, Fung P, Wang Z (2021) Continual learning in task-oriented dialogue systems. In: Proceedings of the 2021 conference on empirical methods in natural language processing, pp 7452–7467

  32. Aksu T, Liu Z, Chen NF, Kan M-Y (2021) N-shot learning for augmenting task-oriented dialogue state tracking. arXiv preprint arXiv:2103.00293

  33. Yang P, Huang H-Y, Mao X-L (2021) Comprehensive study: how the context information of different granularity affects dialogue state tracking? In: Proceedings of the 59th annual meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, vol 1 (Long Papers), pp 2481–2491

  34. Cho H, Sankar C, Lin C, Sadagopan KR, Shayandeh S, Celikyilmaz A, May J, Beirami A (2021) Checkdst: measuring real-world generalization of dialogue state tracking performance. arXiv preprint arXiv:2112.08321

  35. Dai Y, Li H, Li Y, Sun J, Huang F, Si L, Zhu X (2021) Preview, attend and review: Schema-aware curriculum learning for multi-domain dialogue state tracking. In: Proceedings of the 59th annual meeting of the Association for Computational Linguistics and the 11th international joint conference on natural language processing, vol 2 (Short Papers), pp 879–885

  36. Devlin J, Chang M-W, Lee K, Toutanova K (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805

  37. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. Adv Neural Inf Process Syst 30

  38. Romera-Paredes B, Torr P (2015) An embarrassingly simple approach to zero-shot learning. In: International conference on machine learning. PMLR, pp 2152–2161

  39. Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980

  40. Noroozi V, Zhang Y, Bakhturina E, Kornuta T (2020) A fast and robust bert-based dialogue state tracker for schema-guided dialogue dataset. arXiv preprint arXiv:2008.12335

  41. Duan W, Xuan J, Qiao M, Lu J (2022) Learning from the dark: boosting graph convolutional neural networks with diverse negative samples

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qichen Zhang.

Ethics declarations

Conflict of interest

No conflict of interest exits in the submission of this manuscript, and manuscript is approved by all authors for publication. I would like to declare on behalf of my co-authors that the work described was original research that has not been published previously, and not under consideration for publication elsewhere, in whole or in part. All the authors listed have approved the manuscript that is enclosed.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, Q., Wang, S. & Li, J. Global-chronological graph interactive networks for multi-domain dialogue state tracking. Int. J. Mach. Learn. & Cyber. 14, 2607–2620 (2023). https://doi.org/10.1007/s13042-023-01785-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13042-023-01785-x

Keywords

Navigation