research-article

Effective handling of dialogue state in the hidden information state POMDP-based dialogue manager

Authors:
Milica Gašić

University of Cambridge, Cambridge, UK

University of Cambridge, Cambridge, UK
View Profile

,
Steve Young

University of Cambridge, Cambridge, UK

University of Cambridge, Cambridge, UK
View Profile

ACM Transactions on Speech and Language Processing Volume 7 Issue 3Article No.: 4pp 1–28https://doi.org/10.1145/1966407.1966409

Published:06 June 2011Publication History

ACM Transactions on Speech and Language Processing

Abstract

Effective dialogue management is critically dependent on the information that is encoded in the dialogue state. In order to deploy reinforcement learning for policy optimization, dialogue must be modeled as a Markov Decision Process. This requires that the dialogue state must encode all relevent information obtained during the dialogue prior to that state. This can be achieved by combining the user goal, the dialogue history, and the last user action to form the dialogue state. In addition, to gain robustness to input errors, dialogue must be modeled as a Partially Observable Markov Decision Process (POMDP) and hence, a distribution over all possible states must be maintained at every dialogue turn. This poses a potential computational limitation since there can be a very large number of dialogue states. The Hidden Information State model provides a principled way of ensuring tractability in a POMDP-based dialogue model. The key feature of this model is the grouping of user goals into partitions that are dynamically built during the dialogue. In this article, we extend this model further to incorporate the notion of complements. This allows for a more complex user goal to be represented, and it enables an effective pruning technique to be implemented that preserves the overall system performance within a limited computational resource more effectively than existing approaches.

References

Kim, K., Lee, C., Jung, S., and Lee, G. G. 2008. A frame-based probabilistic framework for spoken dialog management using dialog examples. In Proceedings of the 9th SIGdial Workshop on Discourse and Dialogue. Association for Computational Linguistics, Morristown, NJ, 120--127. Google ScholarDigital Library
Levin, E., Pieraccini, R., and Eckert, W. 1998. Using Markov decision processes for learning dialogue strategies. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing.Google Scholar
Sutton, R. and Barto, A. 1998. Reinforcement Learning: An Introduction. Adaptive Computation and Machine Learning. MIT Press, Cambridge, MA. Google ScholarDigital Library
Thomson, B. 2009. Statistical methods for spoken dialogue management. Ph.D. thesis, University of Cambridge.Google Scholar
Thomson, B. and Young, S. 2010. Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems. Comput. Speech Lang. 24, 562--568. Google ScholarDigital Library
Thomson, B., Yu, K., Gašić, M., Keizer, S., Mairesse, F., Schatzmann, J., and Young, S. 2008. Evaluating semantic-level confidence scores with multiple hypotheses. In Proceedings of Interspeech.Google Scholar
Williams, J. 2010. Incremental partition recombiantion for efficient tracking of multiple dialogue states. In Proceedings of the International Conference on Acoustics Speech and Signal Processing.Google Scholar
Williams, J., Poupart, P., and Young, S. 2005. Factored partially observable Markov decision processes for dialogue management. In Proceedings of the 4th Workshop on Knowledge and Reasoning in Practical Dialogue Systems.Google Scholar
Young, S., Gašić, M., Keizer, S., Mairesse, F., Schatzmann, J., Thomson, B., and Yu, K. 2010. The Hidden Information State Model: A practical framework for POMDP-based spoken dialogue management. Comput. Speech Lang. 24, 2, 150--174. Google ScholarDigital Library

Index Terms

Effective handling of dialogue state in the hidden information state POMDP-based dialogue manager
1. Computing methodologies
  1. Machine learning
2. Human-centered computing
  1. Human computer interaction (HCI)

Recommendations

Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems

This paper describes a statistically motivated framework for performing real-time dialogue state updates and policy learning in a spoken dialogue system. The framework is based on the partially observable Markov decision process (POMDP), which provides ...
Read More
Reinforcement learning for parameter estimation in statistical spoken dialogue systems

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estimate the parameters of a dialogue policy which selects the system's ...
Read More
A Reinforcement Learning approach to evaluating state representations in spoken dialogue systems

Although dialogue systems have been an area of research for decades, finding accurate ways of evaluating different systems is still a very active subfield since many leading methods, such as task completion rate or user satisfaction, capture different ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM Transactions on Speech and Language Processing Volume 7, Issue 3
May 2011
155 pages
ISSN:1550-4875
EISSN:1550-4883
DOI:10.1145/1966407
Issue’s Table of Contents

Copyright © 2011 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 6 June 2011
- Revised: 1 November 2010
- Accepted: 1 November 2010
- Received: 1 July 2010
Published in tslp Volume 7, Issue 3

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
POMDP
Spoken dialogue systems
dialogue belief monitoring
dialogue modelling
dialogue state representation
reinforcement learning
Qualifiers
- research-article
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 11
  Total Citations
  View Citations
- 277
  Total Downloads
- Downloads (Last 12 months)4
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Effective handling of dialogue state in the hidden information state POMDP-based dialogue manager

ACM Transactions on Speech and Language Processing

Abstract

References

Cited By

Index Terms

Recommendations

Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

A Reinforcement Learning approach to evaluating state representations in spoken dialogue systems

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Effective handling of dialogue state in the hidden information state POMDP-based dialogue manager

ACM Transactions on Speech and Language Processing

Abstract

References

Cited By

Index Terms

Recommendations

Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

A Reinforcement Learning approach to evaluating state representations in spoken dialogue systems

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media