Semi-Markov Decision Processes with Vector Pay-Offs

Guha Bakshi, Kushal

doi:10.1007/978-981-16-6890-6_76

Kushal Guha Bakshi²⁰

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1412))

575 Accesses

Abstract

Semi-Markov decision processes with vector rewards are studied here under discounted pay-off criterion. The notion of optimality is replaced by Pareto optimality here. We show that a pure and stationary Pareto-optimal strategy exists along with the existence of an algorithm to construct an approximate version of Pareto curves, i.e., \(\epsilon \)-approximate Pareto curve in polynomial time as in a multi-objective linear programming problem. Semi-Markov decision processes with multiple objectives find applications in situations like dynamic goal programming where the decision-maker has more than one objective to be optimized simultaneously. Further, we also investigate the Pareto-realizability problem as well as the NP-completeness of pure stationary Pareto-realizability problem.

Supported by Department of Science and Technology, Govt. of India, INSPIRE Fellowship Scheme

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Jewell WS (1963) Markov-renewal programming. I: formulation, finite return models. Oper Res
Google Scholar
Howard RA (1963) Semi-Markovian decision-processes. Bull Int Stat Inst
Google Scholar
Ross SM (2013) Applied probability models with optimization applications. Courier Corporation
Google Scholar
Lippman SA, et al (1971) Maximal average-reward policies for semi-Markov decision processes with arbitrary state and action space. Ann Math Stat
Google Scholar
Federgruen A, Hordijk A, Tijms HC (1978) A note on simultaneous recurrence conditions on a set of denumerable stochastic matrices. J Appl Probab
Google Scholar
Yang P, Catthoor F (2003) Pareto-optimization-based run-time task scheduling for embedded systems. In: Proceedings of the 1st IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
Google Scholar
Owen G (1995) Game theory academic press. San Diego
Google Scholar
Chatterjee K, Majumdar R, Henzinger TA (2006) Markov decision processes with multiple objectives. In: Annual symposium on theoretical aspects of computer science
Google Scholar
Mondal P (2016) On undiscounted semi-Markov decision processes with absorbing states. Math Methods Oper Res
Google Scholar
Sinha S, Mondal P (2017) Semi-Markov decision processes with limiting ratio average rewards. J Math Anal Appl
Google Scholar
Papadimitriou CH, Yannakakis M (2000) On the approximability of trade-offs and optimal access of web sources. In: Proceedings 41st annual symposium on foundations of computer science
Google Scholar
Belenson SM, Kapur KC (2017) An algorithm for solving multicriterion linear programming problems with examples. J Math Anal Appl
Google Scholar
Luc DT, Luc DT (2016) Pareto optimality. Multiobjective linear programming: an introduction
Google Scholar
Puterman ML (2014) Markov decision processes: discrete stochastic dynamic programming. Wiley
Google Scholar
Wessels J, Nunen JAEE van (1975) Discounted semi-Markov decision processes: linear programming and policy iteration. Wiley Online Library
Google Scholar
Goldstein AA, Cheney W, et al (1958) A finite algorithm for the solution of consistent linear equations and inequalities and for the Tchebycheff approximation of inconsistent linear equations. Pacific J Math
Google Scholar
Garey MR, Johnson DS (1979) Computers and intractability. Freeman San Francisco
Google Scholar
Brayton RK, Hachtel GD, Sangiovanni-Vincentelli AL (1981) A survey of optimization techniques for integrated-circuit design. Proc IEEE
Google Scholar

Download references

Author information

Authors and Affiliations

Jadavpur University, 188, Raja S.C. Mallick Rd, Kolkata, 700032, India
Kushal Guha Bakshi

Authors

Kushal Guha Bakshi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Information Technology, Maulana Abul Kalam Azad University of Technology, Haringhata, India
Debasis Giri
The University of Texas at San Antonio, San Antonio, TX, USA
Kim-Kwang Raymond Choo
Department of Mathematics, Indian Institute of Technology Madras, Chennai, India
Saminathan Ponnusamy
Technical University of Denmark, Kongens Lyngby, Denmark
Weizhi Meng
Department of Computer Engineering, Ondokuz Mayis University, Atakum, Turkey
Sedat Akleylek
Department of Information Technology, Indian Institute of Engineering Science and Technology, West Bengal, India
Santi Prasad Maity

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Guha Bakshi, K. (2022). Semi-Markov Decision Processes with Vector Pay-Offs. In: Giri, D., Raymond Choo, KK., Ponnusamy, S., Meng, W., Akleylek, S., Prasad Maity, S. (eds) Proceedings of the Seventh International Conference on Mathematics and Computing . Advances in Intelligent Systems and Computing, vol 1412. Springer, Singapore. https://doi.org/10.1007/978-981-16-6890-6_76

Download citation

DOI: https://doi.org/10.1007/978-981-16-6890-6_76
Published: 06 March 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-6889-0
Online ISBN: 978-981-16-6890-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Semi-Markov Decision Processes with Vector Pay-Offs