Dynamic programming for risk-aware sequential optimization | IEEE Conference Publication | IEEE Xplore

Dynamic programming for risk-aware sequential optimization


Abstract:

We consider the problem of minimizing a risk measure of the total cost of a Markov decision process (MDP), under the risk-aware MDPs paradigm. This model accounts for the...Show More

Abstract:

We consider the problem of minimizing a risk measure of the total cost of a Markov decision process (MDP), under the risk-aware MDPs paradigm. This model accounts for the variation/spread/dispersion of the random cost in contrast to classical MDPs which are risk-neutral and emphasize expected cost. In this paper, we extend previous work on risk-aware MDPs by considering a wider class of risk measures which are amenable to dynamic programming. We develop solution methods for this class using grid search and convex approximation schemes, and show that the proposed methods produce the optimal policy. We conclude with numerical experiments which demonstrate the versatility and effectiveness of our approach.
Date of Conference: 12-15 December 2017
Date Added to IEEE Xplore: 22 January 2018
ISBN Information:
Conference Location: Melbourne, VIC, Australia

Contact IEEE to Subscribe

References

References is not available for this document.