Markov Decision Processes on Borel Spaces with Total Cost and Random Horizon

Cruz-Suárez, Hugo; Ilhuicatzi-Roldán, Rocio; Montes-de-Oca, Raúl

doi:10.1007/s10957-012-0262-8

Markov Decision Processes on Borel Spaces with Total Cost and Random Horizon

Published: 24 January 2013

Volume 162, pages 329–346, (2014)
Cite this article

Journal of Optimization Theory and Applications Aims and scope Submit manuscript

Hugo Cruz-Suárez¹,
Rocio Ilhuicatzi-Roldán¹ &
Raúl Montes-de-Oca²

492 Accesses
6 Citations
Explore all metrics

Abstract

This paper deals with Markov Decision Processes (MDPs) on Borel spaces with possibly unbounded costs. The criterion to be optimized is the expected total cost with a random horizon of infinite support. In this paper, it is observed that this performance criterion is equivalent to the expected total discounted cost with an infinite horizon and a varying-time discount factor. Then, the optimal value function and the optimal policy are characterized through some suitable versions of the Dynamic Programming Equation. Moreover, it is proved that the optimal value function of the optimal control problem with a random horizon can be bounded from above by the optimal value function of a discounted optimal control problem with a fixed discount factor. In this case, the discount factor is defined in an adequate way by the parameters introduced for the study of the optimal control problem with a random horizon. To illustrate the theory developed, a version of the Linear-Quadratic model with a random horizon and a Logarithm Consumption-Investment model are presented.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Constrained Markov decision processes in Borel spaces: from discounted to average optimality

Article 20 June 2016

Constrained Markov Decision Processes with Non-constant Discount Factor

Article Open access 30 May 2024

Risk-sensitive continuous-time Markov decision processes with unbounded rates and Borel spaces

Article 19 October 2019

References

Puterman, M.L.: Markov Decision Process: Discrete Stochastic Dynamic Programming. Wiley, New York (1994)
Book MATH Google Scholar
Baüerle, N., Rieder, U.: Markov Decision Processes with Applications to Finance. Springer, New York (2010)
Google Scholar
Kozlowski, E.: The linear–quadratic stochastic optimal control problem with random horizon at the finite number of infinitesimal events. Ann. UMCS Inform. 1, 103–115 (2010)
MathSciNet Google Scholar
Levhari, D., Mirman, L.J.: Savings and consumption with uncertain horizon. J. Polit. Econ. 85, 265–281 (1977)
Article Google Scholar
Bather, J.: Decision Theory: An Introduction to Dynamic Programming and Sequential Decision. Wiley, New York (2000)
Google Scholar
Chatterjee, D., Cinquemani, E., Chaloulos, G., Lygeros, J.: Stochastic control up to a hitting time: optimality and Rolling-Horizon implementation (2009). arXiv:0806.3008
Iida, T., Mori, M.: Markov decision processes with random horizon. J. Oper. Res. Soc. Jpn. 39, 592–603 (1996)
MATH MathSciNet Google Scholar
Hernández-Lerma, O., Lasserre, J.B.: Discrete-Time Markov Control Processes: Basic Optimality Criteria. Springer, New York (1996)
Book Google Scholar
Guo, X., Hernandez-del-Valle, A., Hernández-Lerma, O.: Nonstationary discrete-time deterministic and stochastic control systems with infinite horizon. Int. J. Control 83, 1751–1757 (2010)
Article MATH Google Scholar
Hinderer, K.: Foundation of Non-Stationary Dynamic Programming with Discrete Time Parameter. Springer, New York (1970)
Book Google Scholar
Bertsekas, D.P., Shreve, S.E.: Stochastic Optimal Control: the Discrete Time Case. Academic Press, Massachusetts (1978)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Facultad de Ciencias Físico-Matemáticas, Benemérita Universidad Autónoma de Puebla, Av. San Claudio y 18 Sur, Puebla, Mexico
Hugo Cruz-Suárez & Rocio Ilhuicatzi-Roldán
Departamento de Matemáticas, Universidad Autónoma Metropolitana-Iztapalapa, Av. San Rafael Atlixco 186, Col., Vicentina, Mexico D.F., 09340, Mexico
Raúl Montes-de-Oca

Authors

Hugo Cruz-Suárez
View author publications
You can also search for this author inPubMed Google Scholar
Rocio Ilhuicatzi-Roldán
View author publications
You can also search for this author inPubMed Google Scholar
Raúl Montes-de-Oca
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Hugo Cruz-Suárez.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cruz-Suárez, H., Ilhuicatzi-Roldán, R. & Montes-de-Oca, R. Markov Decision Processes on Borel Spaces with Total Cost and Random Horizon. J Optim Theory Appl 162, 329–346 (2014). https://doi.org/10.1007/s10957-012-0262-8

Download citation

Received: 27 June 2012
Accepted: 25 December 2012
Published: 24 January 2013
Issue Date: July 2014
DOI: https://doi.org/10.1007/s10957-012-0262-8

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Markov Decision Processes on Borel Spaces with Total Cost and Random Horizon

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Constrained Markov decision processes in Borel spaces: from discounted to average optimality

Constrained Markov Decision Processes with Non-constant Discount Factor

Risk-sensitive continuous-time Markov decision processes with unbounded rates and Borel spaces

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now