skip to main content
research-article

Effective handling of dialogue state in the hidden information state POMDP-based dialogue manager

Published:06 June 2011Publication History
Skip Abstract Section

Abstract

Effective dialogue management is critically dependent on the information that is encoded in the dialogue state. In order to deploy reinforcement learning for policy optimization, dialogue must be modeled as a Markov Decision Process. This requires that the dialogue state must encode all relevent information obtained during the dialogue prior to that state. This can be achieved by combining the user goal, the dialogue history, and the last user action to form the dialogue state. In addition, to gain robustness to input errors, dialogue must be modeled as a Partially Observable Markov Decision Process (POMDP) and hence, a distribution over all possible states must be maintained at every dialogue turn. This poses a potential computational limitation since there can be a very large number of dialogue states. The Hidden Information State model provides a principled way of ensuring tractability in a POMDP-based dialogue model. The key feature of this model is the grouping of user goals into partitions that are dynamically built during the dialogue. In this article, we extend this model further to incorporate the notion of complements. This allows for a more complex user goal to be represented, and it enables an effective pruning technique to be implemented that preserves the overall system performance within a limited computational resource more effectively than existing approaches.

References

  1. Kim, K., Lee, C., Jung, S., and Lee, G. G. 2008. A frame-based probabilistic framework for spoken dialog management using dialog examples. In Proceedings of the 9th SIGdial Workshop on Discourse and Dialogue. Association for Computational Linguistics, Morristown, NJ, 120--127. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Levin, E., Pieraccini, R., and Eckert, W. 1998. Using Markov decision processes for learning dialogue strategies. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing.Google ScholarGoogle Scholar
  3. Sutton, R. and Barto, A. 1998. Reinforcement Learning: An Introduction. Adaptive Computation and Machine Learning. MIT Press, Cambridge, MA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Thomson, B. 2009. Statistical methods for spoken dialogue management. Ph.D. thesis, University of Cambridge.Google ScholarGoogle Scholar
  5. Thomson, B. and Young, S. 2010. Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems. Comput. Speech Lang. 24, 562--568. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Thomson, B., Yu, K., Gašić, M., Keizer, S., Mairesse, F., Schatzmann, J., and Young, S. 2008. Evaluating semantic-level confidence scores with multiple hypotheses. In Proceedings of Interspeech.Google ScholarGoogle Scholar
  7. Williams, J. 2010. Incremental partition recombiantion for efficient tracking of multiple dialogue states. In Proceedings of the International Conference on Acoustics Speech and Signal Processing.Google ScholarGoogle Scholar
  8. Williams, J., Poupart, P., and Young, S. 2005. Factored partially observable Markov decision processes for dialogue management. In Proceedings of the 4th Workshop on Knowledge and Reasoning in Practical Dialogue Systems.Google ScholarGoogle Scholar
  9. Young, S., Gašić, M., Keizer, S., Mairesse, F., Schatzmann, J., Thomson, B., and Yu, K. 2010. The Hidden Information State Model: A practical framework for POMDP-based spoken dialogue management. Comput. Speech Lang. 24, 2, 150--174. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Effective handling of dialogue state in the hidden information state POMDP-based dialogue manager

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Speech and Language Processing
        ACM Transactions on Speech and Language Processing   Volume 7, Issue 3
        May 2011
        155 pages
        ISSN:1550-4875
        EISSN:1550-4883
        DOI:10.1145/1966407
        Issue’s Table of Contents

        Copyright © 2011 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 6 June 2011
        • Revised: 1 November 2010
        • Accepted: 1 November 2010
        • Received: 1 July 2010
        Published in tslp Volume 7, Issue 3

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader