Abstract
Semi-Markov decision processes with vector rewards are studied here under discounted pay-off criterion. The notion of optimality is replaced by Pareto optimality here. We show that a pure and stationary Pareto-optimal strategy exists along with the existence of an algorithm to construct an approximate version of Pareto curves, i.e., \(\epsilon \)-approximate Pareto curve in polynomial time as in a multi-objective linear programming problem. Semi-Markov decision processes with multiple objectives find applications in situations like dynamic goal programming where the decision-maker has more than one objective to be optimized simultaneously. Further, we also investigate the Pareto-realizability problem as well as the NP-completeness of pure stationary Pareto-realizability problem.
Supported by Department of Science and Technology, Govt. of India, INSPIRE Fellowship Scheme
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Jewell WS (1963) Markov-renewal programming. I: formulation, finite return models. Oper Res
Howard RA (1963) Semi-Markovian decision-processes. Bull Int Stat Inst
Ross SM (2013) Applied probability models with optimization applications. Courier Corporation
Lippman SA, et al (1971) Maximal average-reward policies for semi-Markov decision processes with arbitrary state and action space. Ann Math Stat
Federgruen A, Hordijk A, Tijms HC (1978) A note on simultaneous recurrence conditions on a set of denumerable stochastic matrices. J Appl Probab
Yang P, Catthoor F (2003) Pareto-optimization-based run-time task scheduling for embedded systems. In: Proceedings of the 1st IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
Owen G (1995) Game theory academic press. San Diego
Chatterjee K, Majumdar R, Henzinger TA (2006) Markov decision processes with multiple objectives. In: Annual symposium on theoretical aspects of computer science
Mondal P (2016) On undiscounted semi-Markov decision processes with absorbing states. Math Methods Oper Res
Sinha S, Mondal P (2017) Semi-Markov decision processes with limiting ratio average rewards. J Math Anal Appl
Papadimitriou CH, Yannakakis M (2000) On the approximability of trade-offs and optimal access of web sources. In: Proceedings 41st annual symposium on foundations of computer science
Belenson SM, Kapur KC (2017) An algorithm for solving multicriterion linear programming problems with examples. J Math Anal Appl
Luc DT, Luc DT (2016) Pareto optimality. Multiobjective linear programming: an introduction
Puterman ML (2014) Markov decision processes: discrete stochastic dynamic programming. Wiley
Wessels J, Nunen JAEE van (1975) Discounted semi-Markov decision processes: linear programming and policy iteration. Wiley Online Library
Goldstein AA, Cheney W, et al (1958) A finite algorithm for the solution of consistent linear equations and inequalities and for the Tchebycheff approximation of inconsistent linear equations. Pacific J Math
Garey MR, Johnson DS (1979) Computers and intractability. Freeman San Francisco
Brayton RK, Hachtel GD, Sangiovanni-Vincentelli AL (1981) A survey of optimization techniques for integrated-circuit design. Proc IEEE
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Guha Bakshi, K. (2022). Semi-Markov Decision Processes with Vector Pay-Offs. In: Giri, D., Raymond Choo, KK., Ponnusamy, S., Meng, W., Akleylek, S., Prasad Maity, S. (eds) Proceedings of the Seventh International Conference on Mathematics and Computing . Advances in Intelligent Systems and Computing, vol 1412. Springer, Singapore. https://doi.org/10.1007/978-981-16-6890-6_76
Download citation
DOI: https://doi.org/10.1007/978-981-16-6890-6_76
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-6889-0
Online ISBN: 978-981-16-6890-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)