Regularized Neural User Model for Goal-Oriented Spoken Dialogue Systems

Serras, Manex; Torres, María Inés; del Pozo, Arantza

doi:10.1007/978-3-319-92108-2_24

Manex Serras³⁵,
María Inés Torres³⁶ &
Arantza del Pozo³⁵

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 510))

1034 Accesses

Abstract

User simulation is widely used to generate artificial dialogues in order to train statistical spoken dialogue systems and perform evaluations. This paper presents a neural network approach for user modeling that exploits an encoder-decoder bidirectional architecture with a regularization layer for each dialogue act. In order to minimize the impact of data sparsity, the dialogue act space is compressed according to the user goal. Experiments on the Dialogue State Tracking Challenge 2 (DSTC2) dataset provide significant results at dialogue act and slot level predictions, outperforming previous neural user modeling approaches in terms of F1 score.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

End-to-End latent-variable task-oriented dialogue system with exact log-likelihood optimization

Article 07 June 2019

Understanding Dialogue for Human Communication

Notes

1.
http://camdial.org/~mh521/dstc/.

References

Asher N, Lascarides A (2001) Indirect speech acts. Synthese 128(1):183–228. https://doi.org/10.1023/A:1010340508140
Chandramohan S, Geist M, Lefevre F, Pietquin O (2011) User simulation in dialogue systems using inverse reinforcement learning. Interspeech 2011:1025–1028
Google Scholar
Cho K, van Merriënboer B, Bahdanau D, Bengio Y (2014) On the properties of neural machine translation: encoder–decoder approaches. Syntax Semant Struct Stat Transl, p 103
Google Scholar
Core MG, Allen J (1997) Coding dialogs with the damsl annotation scheme. In: AAAI fall symposium on communicative action in humans and machines, Boston, MA, vol 56
Google Scholar
Cuayáhuitl H, Renals S, Lemon O, Shimodaira H (2005) Human-computer dialogue simulation using hidden markov models. In: 2005 IEEE workshop on automatic speech recognition and understanding. IEEE, pp 290–295
Google Scholar
Eckert W, Levin E, Pieraccini R (1997) User modeling for spoken dialogue system evaluation. In: Proceedings of the IEEE workshop on automatic speech recognition and understanding, 1997. IEEE, pp 80–87
Google Scholar
Hancher M (1979) The classification of cooperative illocutionary acts. Lang Soc, pp 1–14
Google Scholar
Henderson M, Thomson B, Williams J (2013) Dialog state tracking challenge 2 and 3 handbook. http://camdial.org/mh521/dstc
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Article Google Scholar
Hurtado LF, Griol D, Sanchis E, Segarra E (2007) A statistical user simulation technique for the improvement of a spoken dialog system. Springer, Berlin, pp 743–752. https://doi.org/10.1007/978-3-540-76725-1_77
Kingma D, Ba J (2015) Adam: a method for stochastic optimization. In: Proceedings of the 3rd international conference on learning representations (ICLR), pp 1–13
Google Scholar
Layla EA, Jing H, Suleman K (2016) A sequence-to-sequence model for user simulation in spoken dialogue systems. In: Interspeech
Google Scholar
Levin E, Pieraccini R, Eckert W (2000) A stochastic model of human-machine interaction for learning dialog strategies. IEEE Trans Speech Audio Process 8(1):11–23
Article Google Scholar
Pietquin O (2005) A framework for unsupervised learning of dialogue strategies. Presses univ. de Louvain
Google Scholar
Pietquin O, Dutoit T (2006) A probabilistic framework for dialog simulation and optimal strategy learning. IEEE Trans Audio Speech Lang Process 14(2):589–599
Article Google Scholar
Quarteroni S, González M, Riccardi G, Varges S (2010) Combining user intention and error modeling for statistical dialog simulators. In: INTERSPEECH, pp 3022–3025
Google Scholar
Rieser V, Lemon O (2006) Cluster-based user simulations for learning dialogue strategies. In: Interspeech
Google Scholar
Schatzmann J, Thomson B, Weilhammer K, Ye H, Young S (2007) Agenda-based user simulation for bootstrapping a pomdp dialogue system. In: Human language technologies 2007: the conference of the north american chapter of the association for computational linguistics; companion volume, Short Papers. Association for Computational Linguistics, pp 149–152
Google Scholar
Schatzmann J, Weilhammer K, Stuttle M, Young S (2006) A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies. Knowl Eng Rev 21(2):97–126
Article Google Scholar
Scheffler K, Young S (2000) Probabilistic simulation of human-machine dialogues. In: Proceedings of the 2000 IEEE international conference on acoustics, speech, and signal processing, 2000, ICASSP’00, vol 2. IEEE, pp II1217–II1220
Google Scholar

Download references

Acknowledgements

This work has been partially funded by the Spanish Minister of Science under grants TIN2014-54288-C4-4-R and TIN2017-85854-C4-3-R and by the EU H2020 EMPATHIC project grant number 769872.

Author information

Authors and Affiliations

Speech and Natural Language Technologies, Vicomtech Research centre, Donostia-San Sebastian, Spain
Manex Serras & Arantza del Pozo
Speech Interactive Research Group, Universidad del País Vasco UPV/EHU, Leioa, Spain
María Inés Torres

Authors

Manex Serras
View author publications
You can also search for this author in PubMed Google Scholar
María Inés Torres
View author publications
You can also search for this author in PubMed Google Scholar
Arantza del Pozo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to María Inés Torres .

Editor information

Editors and Affiliations

Language Technologies Institute, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA
Maxine Eskenazi
LIMSI-CNRS, Sorbonne University, Paris, France
Laurence Devillers
LIMSI-CNRS, Paris-Saclay University, Orsay, France
Joseph Mariani

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Serras, M., Torres, M.I., del Pozo, A. (2019). Regularized Neural User Model for Goal-Oriented Spoken Dialogue Systems. In: Eskenazi, M., Devillers, L., Mariani, J. (eds) Advanced Social Interaction with Agents . Lecture Notes in Electrical Engineering, vol 510. Springer, Cham. https://doi.org/10.1007/978-3-319-92108-2_24

Download citation

DOI: https://doi.org/10.1007/978-3-319-92108-2_24
Published: 02 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-92107-5
Online ISBN: 978-3-319-92108-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Regularized Neural User Model for Goal-Oriented Spoken Dialogue Systems

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

End-to-End latent-variable task-oriented dialogue system with exact log-likelihood optimization

Understanding Dialogue for Human Communication

Understanding Dialogue for Human Communication

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us