Conferences >2019 IEEE 58th Conference on ...

Stochastic Subgradient Methods for Dynamic Programming in Continuous State and Action Spaces

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

In this paper, we propose a numerical method for dynamic programming in continuous state and action spaces. We first approximate the Bellman operator by using a convex op...Show More

Metadata

Abstract:

In this paper, we propose a numerical method for dynamic programming in continuous state and action spaces. We first approximate the Bellman operator by using a convex optimization problem, which has many constraints. This convex program is then solved using stochastic subgradient descent. To avoid the full projection onto the high-dimensional feasible set, we develop a novel algorithm that samples, in a coordinated fashion, a mini-batch for a subgradient and another for projection. We show several salient properties of this algorithm, including convergence, and a reduction in the feasibility error and in the variance of the stochastic subgradient.

Published in: 2019 IEEE 58th Conference on Decision and Control (CDC)

Date of Conference: 11-13 December 2019

Date Added to IEEE Xplore: 12 March 2020

ISBN Information:

ISSN Information:

DOI: 10.1109/CDC40024.2019.9028854

Conference Location: Nice, France

Contents

References is not available for this document.

Stochastic Subgradient Methods for Dynamic Programming in Continuous State and Action Spaces

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Stochastic Subgradient Methods for Dynamic Programming in Continuous State and Action Spaces

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?