Conferences >2011 IEEE Symposium on Adapti...

Agent self-assessment: Determining policy quality without execution

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

With the development of data-efficient reinforcement learning (RL) methods, a promising data-driven solution for optimal control of complex technical systems has become a...Show More

Metadata

Abstract:

With the development of data-efficient reinforcement learning (RL) methods, a promising data-driven solution for optimal control of complex technical systems has become available. For the application of RL to a technical system, it is usually required to evaluate a policy before actually applying it to ensure it operates the system safely and within required performance bounds. In benchmark applications one can use the system dynamics directly to measure the policy quality. In real applications, however, this might be too expensive or even impossible. Being unable to evaluate the policy without using the actual system hinders the application of RL to autonomous controllers. As a first step toward agent self-assessment, we deal with discrete MDPs in this paper. We propose to use the value function along with its uncertainty to assess a policy's quality and show that, when dealing with an MDP estimated from observations, the value function itself can be misleading. We address this problem by determining the value function's uncertainty through uncertainty propagation and evaluate the approach using a number of benchmark applications.

Published in: 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL)

Date of Conference: 11-15 April 2011

Date Added to IEEE Xplore: 28 July 2011

ISBN Information:

ISSN Information:

DOI: 10.1109/ADPRL.2011.5967358

Conference Location: Paris, France

Contents

References is not available for this document.

Agent self-assessment: Determining policy quality without execution

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Agent self-assessment: Determining policy quality without execution

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?