An Adaptive Dialogue System with Online Dialogue Policy Learning

Papangelis, Alexandros; Kouroupas, Nikolaos; Karkaletsis, Vangelis; Makedon, Fillia

doi:10.1007/978-3-642-30448-4_41

An Adaptive Dialogue System with Online Dialogue Policy Learning

Alexandros Papangelis^22,23,
Nikolaos Kouroupas²⁴,
Vangelis Karkaletsis²² &
…
Fillia Makedon²³

Conference paper

1665 Accesses
5 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7297))

Abstract

In this work we present an architecture for Adaptive Dialogue Systems and a novel system that serves as a Museum Guide. It employs several online Reinforcement Learning (RL) techniques to achieve adaptation to the environment as well as to different users. Not many systems have been proposed that apply online RL methods and this is one of the first to fully describe an Adaptive Dialogue System with online dialogue policy learning. We evaluate our system through user simulations and compare the several implemented algorithms on a simple scenario.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bohus, D., Rudnicky, A.I.: The RavenClaw dialog management framework: Architecture and systems. Computer Speech & Language 23(3), 332–361 (2009)
Article Google Scholar
Cuayáhuitl, H., Renals, S., Lemon, O., Shimodaira, H.: Evaluation of a hierarchical reinforcement learning spoken dialogue system. Comput. Speech Lang. 24, 395–429 (2010)
Article Google Scholar
Gašić, M., Jurčíček, F., Thomson, B., Yu, K., Young, S.: On-line policy optimisation of spoken dialogue systems via live interaction with human subjects. In: Automatic Speech Recognition and Understanding, Hawaii (2011)
Google Scholar
Jurčíček, F., Thomson, B., Keizer, S., Mairesse, F., Gašić, M., Yu, K., Young, S.: Natural Belief-Critic: A Reinforcement Algorithm for Parameter Estimation in Statistical Spoken Dialogue Systems. International Speech Communication Association 7, 1–26 (2010)
Google Scholar
Konstantopoulos, S.: An Embodied Dialogue System with Personality and Emotions. In: Proceedings of the 2010 Workshop on Companionable Dialogue Systems, ACL 2010, pp. 31–36 (2010)
Google Scholar
Peng, J., Williams, R.: Incremental multi-step Q-Learning. Machine Learning, 283–290 (1996)
Google Scholar
Pietquin, O., Geist, M., Chandramohan, S., Frezza-Buet, H.: Sample-Effcient Batch Reinforcement Learning for Dialogue Management Optimization. ACM Transactions on Speech and Language Processing 7(3), No. 7 (2011)
Google Scholar
Pietquin, O., Hastie, H.: A survey on metrics for the evaluation of user simulations. The Knowledge Engineering Review (2011) (to appear)
Google Scholar
Rieser, V., Lemon, O.: Natural Language Generation as Planning Under Uncertainty for Spoken Dialogue Systems. In: EACL 2009, pp. 683–691 (2009)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. The MIT Press, Cambridge (1998)
Google Scholar
Szepesvári, C.: Algorithms for Reinforcement Learning. Synthesis Lectures on Artificial Intelligence and Machine Learning, vol. 4(1), pp. 1–103. Morgan & Claypool Publishers (2010)
Google Scholar
Watkins, C.J.C.H.: Learning from delayed rewards, PhD Thesis, University of Cambridge, England (1989)
Google Scholar
Wiering, M.A., Van Hasselt, H.: The QV family compared to other reinforcement learning algorithms. In: IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, pp. 101–108 (2009)
Google Scholar
Young, S., Gašić, M., Keizer, S., Mairesse, F., Schatzmann, J., Thomson, B., Yu, K.: The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management. Computer Speech & Language 24(2), 150–174 (2010)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Informatics and Telecommunications, National Centre for Scientific Research “Demokritos”, Greece
Alexandros Papangelis & Vangelis Karkaletsis
Department of Computer Science and Engineering, University of Texas at Arlington, USA
Alexandros Papangelis & Fillia Makedon
Department of Informatics, University of Piraeus, Greece
Nikolaos Kouroupas

Authors

Alexandros Papangelis
View author publications
You can also search for this author in PubMed Google Scholar
Nikolaos Kouroupas
View author publications
You can also search for this author in PubMed Google Scholar
Vangelis Karkaletsis
View author publications
You can also search for this author in PubMed Google Scholar
Fillia Makedon
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Biomedical Informatics, University of Central Greece, 2-4 Passiopoulou Street, 35100, Lamia, Greece
Ilias Maglogiannis
Department of Computer Science and Biomedical Informatics, University of Central Greece, 2-4 Papassiopoulou Street, 35100, Lamia, Greece
Vassilis Plagianakos
Department of Informatics, Aristotle University of Thessaloniki, 54124, Thessaloniki, Greece
Ioannis Vlahavas

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Papangelis, A., Kouroupas, N., Karkaletsis, V., Makedon, F. (2012). An Adaptive Dialogue System with Online Dialogue Policy Learning. In: Maglogiannis, I., Plagianakos, V., Vlahavas, I. (eds) Artificial Intelligence: Theories and Applications. SETN 2012. Lecture Notes in Computer Science(), vol 7297. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30448-4_41

Download citation

DOI: https://doi.org/10.1007/978-3-642-30448-4_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-30447-7
Online ISBN: 978-3-642-30448-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics