Skip to main content
Log in

Conditions for the uniqueness of optimal policies of discounted Markov decision processes

  • Published:
Mathematical Methods of Operations Research Aims and scope Submit manuscript

Abstract.

This paper presents three conditions. Each of them guarantees the uniqueness of optimal policies of discounted Markov decision processes. The conditions presented here impose hypotheses specifically on the state space X, the action space A, the admissible action sets A(x),xX, the transition probability Q, and on the cost function c. Two of these conditions require mainly convexity assumptions, but the third one does not need this kind of assumptions. However, it needs certain stochastic order relations in Q, and the cost function c to reach its minimum with respect to the actions, just in one action. We illustrate the conditions with several examples including, in particular, discrete models, the linear regulator problem, and also a model of an inventory control system.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Raúl Montes-de-Oca.

Additional information

Manuscript received: May 2003 / Final version received: January 2004

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cruz-Suárez, D., Montes-de-Oca, R. & Salem-Silva, F. Conditions for the uniqueness of optimal policies of discounted Markov decision processes. Math Meth Oper Res 60, 415–436 (2004). https://doi.org/10.1007/s001860400372

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s001860400372

Keywords

Mathematics Subject Classification 2000:

Navigation