Fast Reinforcement Learning of Dialogue Policies Using Stable Function Approximation

Denecke, Matthias; Dohsaka, Kohji; Nakano, Mikio

doi:10.1007/978-3-540-30211-7_1

Matthias Denecke²²,
Kohji Dohsaka²² &
Mikio Nakano²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3248))

Included in the following conference series:

International Conference on Natural Language Processing

1666 Accesses
4 Citations

Abstract

We propose a method to speed up reinforcement learning of policies for spoken dialogue systems. This is achieved by combining a coarse grained abstract representation of states and actions with learning only in frequently visited states. The value of unsampled states is approximated by a linear interpolation of known states. Experiments show that the proposed method effectively optimizes dialogue strategies for frequently visited dialogue states.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

SimpleDS: A Simple Deep Reinforcement Learning Dialogue System

Deep Reinforcement Learning for On-line Dialogue State Tracking

A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning

Article Open access 07 January 2023

References

Singh, S., Litman, D., Kearns, M., Walker, M.: Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System. Journal of Artificial Intelligence Research 16, 105–133 (2002)
Google Scholar
Gordon, G.J.: Stable function approximation in dynamic programming. In: Proceedings of the Twelfth International Conference on Machine Learning (1995)
Google Scholar
Levin, E., Pieraccini, R.: A Stochastic Model of Human Computer Interaction for Learning Dialog Strategies. In: Proceedings of Eurospeech, Rhodos, Greece (1997)
Google Scholar
Kaelbling, L.P., Littman, M.L., Moore, A.W.: Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research 4, 237–285 (1996)
Google Scholar
Walker, M., Fromer, J., Narayanan, S.: Learning optimal dialogue strategies: A case study of a spoken dialogue agent for email. In: Proceedings of ACL/COLING 1998 (1998)
Google Scholar
Williams, J.D., Young, S.: Using Wizard-of-Oz Simulations to Bootstrap Reinforcement Learning Based Dialog Management Systems. In: Proceedings of the 4th SigDial Workshop on Discourse and Dialogue (2003)
Google Scholar
Roy, N., Pineau, J., Thrun, S.: Spoken Dialog Management for Robots. In: Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics (2000)
Google Scholar
Scheffler, K., Young, S.J.: Corpus-based dialogue simulation for automatic strategy learning and evaluation. In: Proceedings NAACL Workshop on Adaptation in Dialogue Systems, pp. 64–70 (2001)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning. MIT Press, Cambridge (1998)
Google Scholar
Goddeau, D., Pineau, J.: Fast Reinforcement Learning of Dialog Strategies. In: IEEE Conference on Acoustics, Speech and Signal Processing (ICASSP), Istanbul, Turkey (2000)
Google Scholar
Denecke, M.: Informational Characterization of Dialogue States. In: Proceedings of the 6th International Conference on Speech and Language Processing, Beijing, China (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Communication Science Laboratories, Nippon Telegraph and Telephone Corporation, 3-1 Morinosato Wakamiya, Atsugi Kanagawa, 243-0198
Matthias Denecke, Kohji Dohsaka & Mikio Nakano

Authors

Matthias Denecke
View author publications
You can also search for this author in PubMed Google Scholar
Kohji Dohsaka
View author publications
You can also search for this author in PubMed Google Scholar
Mikio Nakano
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Behavior Design Corporation, IV Science-Based Industrial Park Hsinchu, 2F, No.5, Industry E. Rd, Taiwan
Keh-Yih Su
University of Tokyo, Hongo 7-3-1, Bunkyo-ku, Tokyo 113-0033, JST CREST, Honcho 4-1-8, Kawaguchi-shi,, 332-0012, Saitama,
Jun’ichi Tsujii
Pohang University of Science and Technology (POSTECH), AITrc, Republic of Korea
Jong-Hyeok Lee
Language Information Sciences Research Centre, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong
Oi Yee Kwong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Denecke, M., Dohsaka, K., Nakano, M. (2005). Fast Reinforcement Learning of Dialogue Policies Using Stable Function Approximation. In: Su, KY., Tsujii, J., Lee, JH., Kwong, O.Y. (eds) Natural Language Processing – IJCNLP 2004. IJCNLP 2004. Lecture Notes in Computer Science(), vol 3248. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30211-7_1

Download citation

DOI: https://doi.org/10.1007/978-3-540-30211-7_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24475-2
Online ISBN: 978-3-540-30211-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Fast Reinforcement Learning of Dialogue Policies Using Stable Function Approximation

Abstract

Access this chapter

Preview

Similar content being viewed by others

SimpleDS: A Simple Deep Reinforcement Learning Dialogue System

Deep Reinforcement Learning for On-line Dialogue State Tracking

A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Fast Reinforcement Learning of Dialogue Policies Using Stable Function Approximation

Abstract

Access this chapter

Preview

Similar content being viewed by others

SimpleDS: A Simple Deep Reinforcement Learning Dialogue System

Deep Reinforcement Learning for On-line Dialogue State Tracking

A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation