Abstract
Dialogue state tracking (DST) provides necessary information for the policy learning module to decide the next action according to the historical state and the utterance related details of the dialogue, so robust dialogue state tracking is a core part of high-quality task-oriented dialogue systems. The tasks addressed by dialogue state tracking are becoming increasingly complex, and due to the multi-domain switching in the dialogue relevant contexts, the model needs to have the ability to generate appropriate responses to accurately fit the domains, slots and slot values. However, efficient encoding of state information and fundamentally simulating the logical relationship between domains and slots to dynamically extract variation features of the multiple domains are still challenges for DST. In this paper, we propose a novel framework global-chronological graph interactive networks (GCGIN), which encodes both global and chronological information of the dialogue from dual perspectives, and leverages coupling and co-affine modules to interact with the above two different granularities information. Our model adopts the mask mechanism to improve anti-noise ability, and constructs the dynamic schema graph structure to dynamically model domain-slot and domain-value relations via dialogue chronological order to achieve cross-domain features learning and improve model scalability. Extensive experimental results on three benchmark datasets demonstrate that our proposed model outperforms existing mainstream methods, and comprehensive analysis validates the effectiveness of each component.




Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
MultiWOZ 2.1 [22]: https://github.com/budzianowski/multiwoz/tree/master/data MultiWOZ2.2 [23]: https://github.com/budzianowski/multiwoz SGD [24]: https://github.com/google-research-datasets/dstc8-schema-guided-dialogue.
References
Henderson M, Thomson B, Williams JD (2014) The second dialog state tracking challenge. In: Proceedings of the 15th annual meeting of the special interest group on discourse and dialogue (SIGDIAL), pp 263–272
Ni J, Young T, Pandelea V, Xue F, Adiga V, Cambria E (2021) Recent advances in deep learning based dialogue systems: a systematic survey. arXiv preprint arXiv:2105.04387
Mrkšić N, Séaghdha DÓ, Wen T-H, Thomson B, Young S (2017) Neural belief tracker: Data-driven dialogue state tracking. In: Proceedings of the 55th annual meeting of the Association for Computational Linguistics, vol 1 (Long Papers), pp 1777–1788
Chen L, Lv B, Wang C, Zhu S, Tan B, Yu K (2020) Schema-guided multi-domain dialogue state tracking with graph attention neural networks. In: Proceedings of the AAAI conference on artificial intelligence, vol 34, pp 7521–7528
Ye F, Manotumruksa J, Zhang Q, Li S, Yilmaz E (2021) Slot self-attentive dialogue state tracking. In: Proceedings of the web conference 2021, pp 1598–1608
Hosseini-Asl E, McCann B, Wu C-S, Yavuz S, Socher R (2020) A simple language model for task-oriented dialogue. Adv Neural Inf Process Syst 33:20179–20191
Heck M, van Niekerk C, Lubis N, Geishauser C, Lin H-C, Moresi M, Gasic M (2020) Trippy: a triple copy strategy for value independent neural dialog state tracking. In: Proceedings of the 21th annual meeting of the special interest group on discourse and dialogue, pp 35–44
Kim S, Yang S, Kim G, Lee S-W (2020) Efficient dialogue state tracking by selectively overwriting memory. In: Proceedings of the 58th annual meeting of the Association for Computational Linguistics, pp 567–582
Ramadan O, Budzianowski P, Gasic M (2018) Large-scale multi-domain belief tracking with knowledge sharing. In: Proceedings of the 56th Annual meeting of the Association for Computational Linguistics, vol 2 (Short Papers), pp 432–437
Chen J, Zhang R, Mao Y, Xu J (2020) Parallel interactive networks for multi-domain dialogue state generation. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp 1921–1931
Feng Y, Wang Y, Li H (2021) A sequence-to-sequence approach to dialogue state tracking. In: Proceedings of the 59th annual meeting of the Association for Computational Linguistics and the 11th international joint conference on natural language processing, vol 1 (Long Papers), pp 1714–1725
Ren L, Ni J, McAuley J (2019) Scalable and accurate dialogue state tracking via hierarchical sequence generation. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), pp 1876–1885
Chen Z, Chen L, Xu Z, Zhao Y, Zhu S, Yu K (2020) Credit: coarse-to-fine sequence generation for dialogue state tracking. arXiv preprint arXiv:2009.10435
Tian X, Huang L, Lin Y, Bao S, He H, Yang Y, Wu H, Wang F, Sun S (2021) Amendable generation for dialogue state tracking. In: Proceedings of the 3rd workshop on natural language processing for conversational AI, pp 80–92
Lee C-H, Cheng H, Ostendorf M (2021) Dialogue state tracking with a language model using schema-driven prompting. In: Proceedings of the 2021 conference on empirical methods in natural language processing, pp 4937–4949
Guo J, Shuang K, Li J, Wang Z (2021) Dual slot selector via local reliability verification for dialogue state tracking. In: Proceedings of the 59th annual meeting of the Association for Computational Linguistics and the 11th international joint conference on natural language processing, vol 1 (Long Papers), pp 139–151
Zhang J, Hashimoto K, Wu C-S, Wang Y, Philip SY, Socher R, Xiong C (2020) Find or classify? dual strategy for slot-value predictions on multi-domain dialog state tracking. In: Proceedings of the ninth joint conference on lexical and computational semantics, pp 154–167
Wu C-S, Madotto A, Hosseini-Asl E, Xiong C, Socher R, Fung P (2019) Transferable multi-domain state generator for task-oriented dialogue systems. In: Proceedings of the 57th annual meeting of the Association for Computational Linguistics, pp 808–819
Huang Y, Feng J, Hu M, Wu X, Du X, Ma S (2020) Meta-reinforced multi-domain state generator for dialogue systems. In: Proceedings of the 58th annual meeting of the Association for Computational Linguistics, pp 7109–7118
Zhao J, Mahdieh M, Zhang Y, Cao Y, Wu Y (2021) Effective sequence-to-sequence dialogue state tracking. In: Proceedings of the 2021 conference on empirical methods in natural language processing, pp 7486–7493
Veličković P, Cucurull G, Casanova A, Romero A, Liò P, Bengio Y (2018) Graph attention networks. In: International conference on learning representations
Eric M, Goel R, Paul S, Sethi A, Agarwal S, Gao S, Kumar A, Goyal A, Ku P, Hakkani-Tur D (2020) Multiwoz 2.1: a consolidated multi-domain dialogue dataset with state corrections and state tracking baselines. In: Proceedings of the 12th language resources and evaluation conference, pp 422–428
Zang X, Rastogi A, Sunkara S, Gupta R, Zhang J, Chen J (2020) Multiwoz 2.2: a dialogue dataset with additional annotation corrections and state tracking baselines. In: Proceedings of the 2nd workshop on natural language processing for conversational AI, pp 109–117
Rastogi A, Zang X, Sunkara S, Gupta R, Khaitan P (2020) Towards scalable multi-domain conversational agents: the schema-guided dialogue dataset. In: Proceedings of the AAAI conference on artificial intelligence, vol 34, pp 8689–8696
Henderson M, Thomson B, Young S (2014) Word-based dialog state tracking with recurrent neural networks. In: Proceedings of the 15th annual meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL), pp 292–299
Williams JD, Raux A, Henderson M (2016) The dialog state tracking challenge series: a review. Dialogue Discourse 7(3):4–33
Lee H, Lee J, Kim T-Y(2019) Sumbt: slot-utterance matching for universal and scalable belief tracking. In: Proceedings of the 57th annual meeting of the Association for Computational Linguistics, pp 5478–5483
Xu P, Hu Q (2018) An end-to-end approach for handling unknown slot values in dialogue state tracking. In: Proceedings of the 56th annual meeting of the Association for Computational Linguistics, vol 1 (Long Papers), pp 1448–1457
Gao S, Sethi A, Agarwal S, Chung T, Hakkani-Tur D (2019) Dialog state tracking: a neural reading comprehension approach. In: Proceedings of the 20th annual SIGdial Meeting on Discourse and Dialogue, pp 264–273
Ham D, Lee J-G, Jang Y, Kim K-E (2020) End-to-end neural pipeline for goal-oriented dialogue systems using gpt-2. In: Proceedings of the 58th annual meeting of the Association for Computational Linguistics, pp 583–592
Madotto A, Lin Z, Zhou Z, Moon S, Crook PA, Liu B, Yu Z, Cho E, Fung P, Wang Z (2021) Continual learning in task-oriented dialogue systems. In: Proceedings of the 2021 conference on empirical methods in natural language processing, pp 7452–7467
Aksu T, Liu Z, Chen NF, Kan M-Y (2021) N-shot learning for augmenting task-oriented dialogue state tracking. arXiv preprint arXiv:2103.00293
Yang P, Huang H-Y, Mao X-L (2021) Comprehensive study: how the context information of different granularity affects dialogue state tracking? In: Proceedings of the 59th annual meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, vol 1 (Long Papers), pp 2481–2491
Cho H, Sankar C, Lin C, Sadagopan KR, Shayandeh S, Celikyilmaz A, May J, Beirami A (2021) Checkdst: measuring real-world generalization of dialogue state tracking performance. arXiv preprint arXiv:2112.08321
Dai Y, Li H, Li Y, Sun J, Huang F, Si L, Zhu X (2021) Preview, attend and review: Schema-aware curriculum learning for multi-domain dialogue state tracking. In: Proceedings of the 59th annual meeting of the Association for Computational Linguistics and the 11th international joint conference on natural language processing, vol 2 (Short Papers), pp 879–885
Devlin J, Chang M-W, Lee K, Toutanova K (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. Adv Neural Inf Process Syst 30
Romera-Paredes B, Torr P (2015) An embarrassingly simple approach to zero-shot learning. In: International conference on machine learning. PMLR, pp 2152–2161
Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980
Noroozi V, Zhang Y, Bakhturina E, Kornuta T (2020) A fast and robust bert-based dialogue state tracker for schema-guided dialogue dataset. arXiv preprint arXiv:2008.12335
Duan W, Xuan J, Qiao M, Lu J (2022) Learning from the dark: boosting graph convolutional neural networks with diverse negative samples
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
No conflict of interest exits in the submission of this manuscript, and manuscript is approved by all authors for publication. I would like to declare on behalf of my co-authors that the work described was original research that has not been published previously, and not under consideration for publication elsewhere, in whole or in part. All the authors listed have approved the manuscript that is enclosed.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhang, Q., Wang, S. & Li, J. Global-chronological graph interactive networks for multi-domain dialogue state tracking. Int. J. Mach. Learn. & Cyber. 14, 2607–2620 (2023). https://doi.org/10.1007/s13042-023-01785-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13042-023-01785-x