Journals & Magazines >IEEE Transactions on Industri... >Volume: 20 Issue: 10

Offline Reinforcement Learning With Reverse Diffusion Guide Policy

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Offline reinforcement learning (ORL) learns policy from a static dataset without further interaction with the environment, which holds significant promise in industrial c...Show More

Metadata

Abstract:

Offline reinforcement learning (ORL) learns policy from a static dataset without further interaction with the environment, which holds significant promise in industrial control systems characterized by inefficient online interaction and inherent safety concerns. To mitigate the extrapolation error induced by distribution shift, it is essential for ORL to constrain the learned policy to perform actions within the support set of behavior policy. Existing methods fail to represent the behavior policy properly and typically tend to prefer actions with higher densities within the support set, resulting suboptimal learned policy. This article proposes a novel ORL method which represents the behavior policy with a diffusion model and trains a reverse diffusion guide policy to instruct the pretrained diffusion model in generating actions. The diffusion model exhibits stable training and strong distribution expression ability, and the reverse diffusion guide policy can effectively explore the entire support set to help generate the optimal action. When facing low-quality datasets, a trainable perturbation can be further added to the generated action to help the learned policy escape the performance limitation of behavior policy. Experimental results on D4RL Gym-MuJoCo benchmark demonstrate the effectiveness of the proposed method, surpassing several state-of-the-art ORL methods.

Published in: IEEE Transactions on Industrial Informatics ( Volume: 20, Issue: 10, October 2024)

Page(s): 11785 - 11793

Date of Publication: 27 June 2024

ISSN Information:

DOI: 10.1109/TII.2024.3413308

Funding Agency:

Contents

References is not available for this document.

Offline Reinforcement Learning With Reverse Diffusion Guide Policy

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Offline Reinforcement Learning With Reverse Diffusion Guide Policy

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?