Conferences >2016 International Conference...

Policy optimization of dialogue management in spoken dialogue system for out-of-domain utterances

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper addresses the policy optimization of a dialogue management scheme based on partially observable Markov decision processes (POMDP), which is designed for out-of...Show More

Metadata

Abstract:

This paper addresses the policy optimization of a dialogue management scheme based on partially observable Markov decision processes (POMDP), which is designed for out-of-domain (OOD) utterances processing in spoken dialogue system. First, POMDP-Based DM Modeling for OOD Utterances is proposed, together with detail of some principal elements. Then, joint state transition exploration and dialogue policy optimization are performed in batch. Value iteration method of reinforcement learning framework is employed to optimize the dialogue policy. Our approach is tested through interaction with user in a Chinese restricted domain dialogue system supporting to act as a mobile phone recommendation assistant. Evaluation results show that a usable policy can be learnt in just a few hundred dialogues, and the optimized policy can obtain a convergence of good dialogue reward.

Published in: 2016 International Conference on Asian Language Processing (IALP)

Date of Conference: 21-23 November 2016

Date Added to IEEE Xplore: 13 March 2017

ISBN Information:

DOI: 10.1109/IALP.2016.7875923

Conference Location: Tainan, Taiwan