Abstract
We present a class of countable state space Markovian decision models that can be investigated by means of an associated finite-state, finite-action reduced model which we call the skeleton. In particular, we obtain a turnpike theorem for the original model (Theorem 2 in Section 5) from a known turnpike theorem for the reduced finite model. For illustration, we present in detail an application of this approach to an inventory model (re-establishing a known turnpike result) and sketch analogous results for a cash-balance model and a growth model.
Zusammenfassung
Wir führen eine Klasse von Markovschen Entscheidungsmodellen mit abzählbarem Zustandsraum ein, die mittels eines verbundenen, reduzierten Modells mit endlichem Zustands- und Aktionsraum, welches wir das Skelett nennen, untersucht werden können. Insbesondere erhalten wir ein Turnpike Theorem für das ursprüngliche Modell (Theorem 2 im Abschnitt 5) von einem bekannten Turnpike Theorem für das reduzierte endliche Modell. Zur Erläuterung stellen wir im einzelnen eine Anwendung dieses Ansatzes für ein Lagerhaltungsmodell (Wiederherleitung eines bekannten Turnpike Ergebnisses) dar, und wir skizzieren analoge Ergebnisse für ein Kassenhaltungsmodell und ein Wachstumsmodell.
Similar content being viewed by others
References
Anthonisse, J.M., andH. Tijms: Exponential convergence of products of stochastic matrices. J. Math. Anal. Appl.59, 1977, 360–364.
Constantinides, G.M., andS.F. Richard: Existence of optimal simple policies for discounted-cost inventory and cash management in continuous time. Opns. Res.26, 1978, 620–636.
Hinderer, K., andG. Hübner: An improvement of J.F. Shapiro's turnpike theorem for the horizon of finite stage discrete dynamic programs. Transaction of the Seventh Prague Conference on Math. Statistics, Stochastic Processes and Information Science, vol. A, Prague 1974, 245–257.
Hordijk, A., andH. Tijms: Convergence results and approximations for optimal (s, S) policies. Man. Sci.20, 1974, 1432–1438.
—: On a conjecture of Iglehart. Man. Sci.21, 1975, 1342–1345.
Isaacson, D.L., andR.W. Madsen: Markov Chains. New York 1976.
Kadelka, D.: On existence of optimal control limit policies in a model of optimal growth. Methods of Opns. Res.44, 1981, 341–346.
Kaplan, R.S.: Stochastic growth models. Man. Sci.18, 1972, 249–264.
Neave, E.H.: The stochastic cash balance problem with fixed costs for increases and decreases. Man. Sci.16, 1970, 472–490.
Scarf, H.: The optimality of (S, s) policies in the dynamic inventory model. Mathematical Models in the Social Sciences. Ed. by K.J. Arrow, S. Karlin, and P. Suppes. Stanford 1960.
Schäl, M.: On the optimality of (s, S)-policies in dynamic inventory model with finite horizon. SIAM J. Appl. Math.30, 1976, 528–537.
Shapiro, J.F.: Turnpike planning horizons for a Markovian decision model. Man. Sci.14, 1968, 292–300.
Seneta, E.: Coefficients of ergodicity: structure and applications. Adv. Appl. Prob.5, 1979, 576–590.
Veinott, A.F., Jr.: On the optimality of (s, S) inventory policies: new conditions and a new proof. SIAM J. Appl. Math.14, 1966, 1067–1083.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Wrobel, A. On Markovian decision models with a finite skeleton. Zeitschrift für Operations Research 28, 17–27 (1984). https://doi.org/10.1007/BF01919083
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF01919083