Conferences >2017 IEEE 56th Annual Confere...

Dynamic programming for risk-aware sequential optimization

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

We consider the problem of minimizing a risk measure of the total cost of a Markov decision process (MDP), under the risk-aware MDPs paradigm. This model accounts for the...Show More

Metadata

Abstract:

We consider the problem of minimizing a risk measure of the total cost of a Markov decision process (MDP), under the risk-aware MDPs paradigm. This model accounts for the variation/spread/dispersion of the random cost in contrast to classical MDPs which are risk-neutral and emphasize expected cost. In this paper, we extend previous work on risk-aware MDPs by considering a wider class of risk measures which are amenable to dynamic programming. We develop solution methods for this class using grid search and convex approximation schemes, and show that the proposed methods produce the optimal policy. We conclude with numerical experiments which demonstrate the versatility and effectiveness of our approach.

Published in: 2017 IEEE 56th Annual Conference on Decision and Control (CDC)

Date of Conference: 12-15 December 2017

Date Added to IEEE Xplore: 22 January 2018

ISBN Information:

DOI: 10.1109/CDC.2017.8264389

Conference Location: Melbourne, VIC, Australia