Abstract
User Simulators are major tools that enable offline training of task-oriented dialogue systems. To efficiently utilize semantic dialog data and generate natural language utterances, user simulators based on neural network architectures are proposed. However, existing neural user simulators still rely on hand-crafted rules, which is difficult to ensure the effectiveness of feature extraction. This paper proposes the Graph Neural Net-based User Simulator (GUS), which constructs semantic graphs from the corpus and uses them to build Graph Convolutional Network (GCN) to extract feature vectors. We tested our model on examined public dataset and also made conversation with real human directly to verify the effectiveness. Experimental results show GUS significantly outperforms several state-of-the-art user simulators.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kreyssig, F., Casanueva, I., Budzianowski, P., Gasic, M.: Neural user simulation for corpus-based policy optimisation for spoken dialogue systems. In: Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue (2018)
Gasic, M., Young, S.: Gaussian processes for POMDP-based dialogue manager optimization. IEEE/ACM Trans. Audio, Speech, Lang. Process. 22(1), 28–40 (2014)
Zhou, J., et al.: Graph Neural Networks: A Review of Methods and Applications. arXiv preprint arXiv:1812.08434v3 (2019)
Schatzmann, J., Georgila, K., Young, S.: Quantitative evaluation of user simulation techniques for spoken dialogue systems. In: SIGdial6, pp. 45–54 (2005)
Eckert, W., Levin, E., Pieraccini, R.: User modeling for spoken dialogue system evaluation. In: 1997 IEEE Workshop on Automatic Speech Recoginition and Understanding, pp. 80–87 (1997)
Scheffler, K., Young, S.: Probabilistic simulation of human-machine dialogues. In: Speech, and Signal Processing, Acoustics (2000)
Pietquin, O., Dutoit, T.: A probabilistic framework for dialog simulation and optimal strategy learning. IEEE Trans. Audio Speech Lang. Process. 14(2), 589–599 (2006)
Georgila, K., Henderson, J., Lemon, O.: Learning user simulations for information state update dialogue systems. In: Ninth European Conference on Speech Communication and Technology (2005)
Chandramohan, S., Geist, M., Lefevre, F., Pietquin, O.: User simulation in dialogue systems using inverse reinforcement learning. In: Proceedings of the Twelfth Annual Conference of the International Speech Communication Association (2011)
Schatzmann, J., Thomson, B., Weilhammer, K., Ye, H., Young, S.: Agenda-based user simulation for bootstrapping a pomdp dialogue system. In: Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume, Short Papers, pp. 149–152 (2007)
Mrksic, N., Seaghdha, D.O., Wen, T.-H., Thomson, B., Young, S.: Neural belief tracker: data-driven dialogue state tracking. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), vol. 1, pp. 1777–1788 (2017)
Williams, J.D.: Evaluating user simulations with the Cramer-von Mises divergence. Speech Commun. 50(10), 829–846 (2008)
Crook, P., Marin, A.: Sequence to sequence modeling for user simulation in dialog systems. In: Proceedings of the 18th Annual Conference of the International Speech Communication Association (2017)
Li, X., Chen, Y.-N.., Li, L., Gao, J., Celikyilmaz, A.: End-to-end task-completion neural dialogue systems. In: Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers) (2017)
Serras, M., Torres, M.I., Pozo, A.: Regularized neural user model for goal oriented spoken dialogue systems. In: International Workshop on Spoken Dialogue Systems (2017)
Liu, B., Lane, I.: Iterative policy learning in end-to-end trainable task-oriented neural dialog models. In: 2017 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU, pp. 482–489 (2017)
Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. In: Proceedings of the International Conference on Learning Representations, ICLR (2016)
Marcheggiani, D., Perez-Beltranchini, L.: Deep graph convolutional encoders for structured data to text generation. In: The 11th International Conference on Natural Language Generation, INLG (2018)
Marcheggiani, D., Titov, I.: Encoding sentences with graph convolutional networks for semantic role labeling. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP, pp. 1506–1515 (2017)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: Proceedings of the ICLR (2015)
Papineni, K., Roukos, S., Ward, T., Zhu, W.: BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 311–318 (2002)
Acknowledgement
We thank the anonymous reviewers for their insightful comments on this paper. This work was supported by the NSFC (No. 61402403), Alibaba-Zhejiang University Joint Institute of Frontier Technologies, Chinese Knowledge Center for Engineering Sciences and Technology, and the Fundamental Research Funds for the Central Universities.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Nie, X., Lin, Z., Huang, X., Zhang, Y. (2019). Graph Neural Net-Based User Simulator. In: Sun, M., Huang, X., Ji, H., Liu, Z., Liu, Y. (eds) Chinese Computational Linguistics. CCL 2019. Lecture Notes in Computer Science(), vol 11856. Springer, Cham. https://doi.org/10.1007/978-3-030-32381-3_51
Download citation
DOI: https://doi.org/10.1007/978-3-030-32381-3_51
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32380-6
Online ISBN: 978-3-030-32381-3
eBook Packages: Computer ScienceComputer Science (R0)