Summary
A method of successive approximations for discountedMarkovian decision problems is described byMacQueen [1966]. This paper presents a set of methods includingMacQueen's improved version of the standard dynamic programming iterative scheme. While furthermore, by the fact that we used a somewhat different approach, the physical meaning of some aspects of the successive approximation methods will probably be more transparent. Some numerical results will be given.
Zusammenfassung
Für diskontierteMarkoff-Entscheidungsprozesse hatMacQueen [1966] eine Methode der sukzessiven Annäherung beschrieben. Die vorliegende Arbeit stellt einen Satz von Methoden vor, derMacQueens verbesserte Version des iterativen Schemas der klassischen Dynamischen Programmierung enthält. Darüber hinaus wird über den hier gewählten unterschiedlichen Ansatz versucht, die praktische Bedeutung einiger Aspekte der Methode der sukzessiven Annäherung transparenter zu machen. Einige numerische Beispiele werden vorgestellt.
Similar content being viewed by others
References
Blackwell, D.: Discrete dynamic programming. Ann. Math. Stat.33, 719–729, 1962.
—: Discounted dynamic programming. Ann. Math. Stat.36, 226–234, 1965.
Denardo, E.: Contraction mappings in the theory underlying dynamic programming. SIAM Review9, 165–177, 1967.
van Doorn, E.: Successieve approximatiemethoden voor Markov beslissingsprocessen met verdiskontering. Memorandum COSOR 73-02, Eindhoven University of Technology, Department of Mathematics, Eindhoven.
de Ghellinck, G., andG. Eppen: Linear programming solutions for separable Markovian decision problems. Management Science13, 371–394, 1967.
Grinold, R.: Elimination of suboptimal actions in Markov decision problems. Operat. Res.21, 848 to 851, 1973.
Hastings, N.: The repair limit replacement method. Operat. Res.19, 337–349, 1969.
Howard, R.: Dynamic programming and Markov processes. Cambridge 1960.
MacQueen, J.: A modified dynamic programming method for Markovian decision problems. J. Math. An. Appl.14, 38–43, 1966.
—: A test for suboptimal actions in Markovian decision problems. Operat. Res.15, 559–561, 1967.
Mine, H., andS. Osaki: Markovian decision processes. New York 1970.
Porteus, E.: Some bounds for discounted sequential decision processes. Man. Sci.18, 7–11, 1971.
Wessels, J., andJ. van Nunen: Discounted semi-Markov decision processes: linear programming and policy iteration. Statistica Neerlandica29, 1–7, 1975.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
van Nunen, J.A.E.E. A set of successive approximation methods for discounted Markovian decision problems. Zeitschrift für Operations Research 20, 203–208 (1976). https://doi.org/10.1007/BF01920264
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF01920264