Abstract
This work concerns semi-Markov chains evolving on a finite state space. The system development generates a cost when a transition is announced, as well as a holding cost which is incurred continuously during each sojourn time. It is assumed that these costs are paid by an observer with positive and constant risk-sensitivity, and the overall performance of the system is measured by the corresponding (long-run) risk-sensitive average cost criterion. In this framework, conditions are provided under which the average index does not depend on the initial state and is characterized in terms of a single Poisson equation.
Similar content being viewed by others
References
Alanis-Duran A, Cavazos-Cadena R (2012) An optimality system for finite average Markov decision chains under risk-aversion. Kybernetika 48:83–104
Bauerle N, Rieder U (2011) Markov Decision Processes with Applications to Finance. Springer, New York
Bauerle N, Rieder U (2013) More risk-sensitive Markov decision processes. Math Oper Res 39:105–120
Baykal-Gursoy M (2010) Semi-Markov decision processes. Wiley Encyclopedia of Operations Research and Management Sciences, Wiley, New York
Cavazos-Cadena R (2009) Solutions of the average cost optimality equation for finite Markov decision chains: risk-sensitive and risk-neutral criteria. Math Method Oper Res 70:541–566
Cavazos-Cadena R, Fernandez-Gaucherand E (1999) Controlled Markov chains with risk-sensitive criteria: Average cost, optimality equations, and optimal solutions. Math Method Oper Res 49:299–324
Cavazos-Cadena R, Fernandez-Gaucherand E (2002) Risk-sensitive control in communicating average Markov decision chains. In: Dror M, L’Ecuyer P, Szidarovsky F (eds) Modelling Uncertainty: An examination of Stochastic Theory, Methods and Applications. Kluwer, Boston, pp 525–544
Cavazos-Cadena R, Hernandez-Hernandez D (2004) A characterization of the average cost in finite controlled Markov chains. Ann Appl Probab 15:175–212
Di Masi GB, Stettner L (1999) Risk-sensitive control of discrete time Markov processes with infinite horizon. SIAM J Control Optim 38:61–78
Di Masi GB, Stettner L (2000) Infinite horizon risk sensitive control of discrete time Markov processes with small risk. SIAM J Control Optim 40:15–20
Di Masi GB, Stettner L (2007) Infinite horizon risk sensitive control of discrete time Markov processes under minorization property. SIAM J Control Optim 46:231–252
Hernandez-Hernandez D, Marcus SI (1996) Risk sensitive control of Markov processes in countable state space. Syst Control Lett 29:147–155
Hernandez-Hernandez D, Marcus SI (1999) Existence of risk sensitive optimal stationary polices for controlled Markov processes. Appl Math Opt 40:273–285
Howard RA, Matheson JE (1972) Risk-sensitive Markov decision processes. Manage Sci 18:356–369
Hu Q, Yue W (2003) Optimal replacement of a system according to a semi-Markov decision process in a semi-Markov environment. Optim Method Softw 18:181–196
Jaskiewicz A (2007) Average optimality for risk sensitive control with general state space. Ann Appl Probab 17:654–675
Jianyong L, Xiaobo Z (2004) On average reward semi-Markov decision processes with a general multichain structure. Math Oper Res 29:339–352
Luque-Vasquez F, Hernandez-Lerma O (1999) Semi-Markov control models with average costs. Applicationes Mathematicae 26:315–331
Pinedo M (2008) Scheduling: Theory, Algorithms and Systems. Springer, New York
Schweitzer PJ (1971) Iterative solutions of the functional equations of undiscounted Markov renewal programming. J Math Anal Appl 34:495–501
Sennott LI (1999) Stochastic Dynamic Programming and the Control of Queueing Systems. Wiley, New York
Stidham Jr S, Weber RR (1993) A survey of Markov decision models for control of networks of queues. Queueing Syst 13:291–314
Tijms HC (2003) A first course in stochastic models. Wiley, New York
Acknowledgments
The author is deeply grateful to the reviewers and the Associate Editor for their careful reading of the original manuscript, and for their helpful suggestions to improve the paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Dedicated to Professor Onésimo Hernández-Lerma, on the occasion of his seventieth birthday.
This work was partially supported by the PSF Organization under Grant No. 300-01-15, and by PRODEP under Grant No. 17332-CA-23.
Rights and permissions
About this article
Cite this article
Cavazos-Cadena, R. A poisson equation for the risk-sensitive average cost in semi-markov chains. Discrete Event Dyn Syst 26, 633–656 (2016). https://doi.org/10.1007/s10626-015-0224-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10626-015-0224-z