A Potential Reduction Algorithm for Ergodic Two-Person Zero-Sum Limiting Average Payoff Stochastic Games

Boros, Endre; Elbassioni, Khaled; Gurvich, Vladimir; Makino, Kazuhisa

doi:10.1007/978-3-319-12691-3_52

A Potential Reduction Algorithm for Ergodic Two-Person Zero-Sum Limiting Average Payoff Stochastic Games

Endre Boros¹⁷,
Khaled Elbassioni¹⁸,
Vladimir Gurvich¹⁷ &
…
Kazuhisa Makino¹⁹

Conference paper
First Online: 13 November 2014

1429 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8881))

Abstract

We suggest a new algorithm for two-person zero-sum undiscounted stochastic games focusing on stationary strategies. Given a positive real \(\epsilon \), let us call a stochastic game \(\epsilon \)-ergodic, if its values from any two initial positions differ by at most \(\epsilon \). The proposed new algorithm outputs for every \(\epsilon >0\) in finite time either a pair of stationary strategies for the two players guaranteeing that the values from any initial positions are within an \(\epsilon \)-range, or identifies two initial positions \(u\) and \(v\) and corresponding stationary strategies for the players proving that the game values starting from \(u\) and \(v\) are at least \(\epsilon /24\) apart. In particular, the above result shows that if a stochastic game is \(\epsilon \)-ergodic, then there are stationary strategies for the players proving \(24\epsilon \)-ergodicity. This result strengthens and provides a constructive version of an existential result by Vrieze (1980) claiming that if a stochastic game is \(0\)-ergodic, then there are \(\epsilon \)-optimal stationary strategies for every \(\epsilon > 0\). The suggested algorithm extends the approach recently introduced for stochastic games with perfect information, and is based on the classical potential transformation technique that changes the range of local values at all positions without changing the normal form of the game.

Part of this research was done at the Mathematisches Forschungsinstitut Oberwolfach during a stay within the Research in Pairs Program from March 7 to March 20, 2010.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Boros, E., Elbassioni, K., Gurvich, V., Makino, K.: A pumping algorithm for ergodic stochastic mean payoff games with perfect information. In: Eisenbrand, F., Shepherd, F.B. (eds.) IPCO 2010. LNCS, vol. 6080, pp. 341–354. Springer, Heidelberg (2010)
Chapter Google Scholar
Boros, E., Elbassioni, K., Gurvich, V., Makino, K.: On canonical forms for zero-sum stochastic mean payoff games. Dyn. Games Appl. 3(2), 128–161 (2013)
Article MATH MathSciNet Google Scholar
Blackwell, D., Ferguson, T.S.: The big match. Ann. Math. Statist. 39(1), 159–163 (1968)
Article MATH MathSciNet Google Scholar
Basu, S., Pollack, R., Roy, M.: On the combinatorial and algebraic complexity of quantifier elimination. J. ACM 43(6), 1002–1045 (1996). Preliminary version in FOCS 1994
Article MATH MathSciNet Google Scholar
Chatterjee, K., Ibsen-Jensen, R.: The complexity of ergodic mean-payoff games. In: Esparza, J., Fraigniaud, P., Husfeldt, T., Koutsoupias, E. (eds.) ICALP 2014, Part II. LNCS, vol. 8573, pp. 122–133. Springer, Heidelberg (2014)
Chapter Google Scholar
Chatterjee, K., Majumdar, R., Henzinger, T.A.: Stochastic limit-average games are in exptime. Int. J. Game Theory 37, 219–234 (2008)
Article MATH MathSciNet Google Scholar
Federgruen, A.: Successive approximation methods in undiscounted stochastic games. Oper. Res. 1, 794–810 (1980)
Article MathSciNet Google Scholar
Gallai, T.: Maximum-minimum Sätze über Graphen. Acta Mathematica Academiae Scientiarum Hungaricae 9, 395–434 (1958)
Article MATH MathSciNet Google Scholar
Gillette, D.: Stochastic games with zero stop probabilities. In: Dresher, M., Tucker, A.W., Wolfe, P. (eds) Contribution to the Theory of Games III, volume 39 of Annals of Mathematics Studies, pp. 179–187. Princeton University Press (1957)
Google Scholar
Grigoriev, D., Vorobjov, N.: Solving systems of polynomial inequalities in subexponential time. J. Symb. Comput. 5(1/2), 37–64 (1988)
Article Google Scholar
Hoffman, A.J., Karp, R.M.: On nonterminating stochastic games. Manag. Sci. Ser. A 12(5), 359–370 (1966)
MATH MathSciNet Google Scholar
Hansen, K.A., Koucky, M., Lauritzen, N., Miltersen, P.B., Tsigaridas. E.P.: Exact algorithms for solving stochastic games: extended abstract. In: Proceedings of the 43rd Annual ACM Symposium on Theory of Computing, STOC ’11, pp. 205–214. ACM, New York (2011)
Google Scholar
Kemeny, J.G., Snell, J.L.: Finite Markov chains. Springer, New York (1963)
Google Scholar
Miltersen, P.B.: Discounted stochastic games poorly approximate undiscounted ones, manuscript. Technical report (2011)
Google Scholar
Mertens, J.F., Neyman, A.: Stochastic games. Int. J. Game Theory 10, 53–66 (1981)
Article MATH MathSciNet Google Scholar
Moulin, H.: Prolongement des jeux à deux joueurs de somme nulle. Bull. Soc. Math. France, Memoire, 45 (1976)
Google Scholar
Renegar, J.: On the computational complexity and geometry of the first-order theory of the reals. J. Symb. Comput. 13(3), 255–352 (1992)
Article MATH MathSciNet Google Scholar
Raghavan, T.E.S., Filar, J.A.: Algorithms for stochastic games: a survey. Math. Methods Oper. Res. 35(6), 437–472 (1991)
Article MATH MathSciNet Google Scholar
Shapley, L.S.: Stochastic games. Proc. Nat. Acad. Sci. USA 39, 1095–1100 (1953)
Article MATH MathSciNet Google Scholar
Vrieze, O.J.: Stochastic games with finite state and action spaces. Ph.D. thesis, Centrum voor Wiskunde en Informatica, Amsterdam, The Netherlands (1980)
Google Scholar

Download references

Author information

Authors and Affiliations

MSIS Department of RBS and RUTCOR, Rutgers University, 100 Rockafeller Road, Piscataway, NJ, 08854-8054, USA
Endre Boros & Vladimir Gurvich
Masdar Institute of Science and Technology, Abu Dhabi, UAE
Khaled Elbassioni
Research Institute for Mathematical Sciences (RIMS), Kyoto University, Kyoto, 606–8502, Japan
Kazuhisa Makino

Authors

Endre Boros
View author publications
You can also search for this author in PubMed Google Scholar
Khaled Elbassioni
View author publications
You can also search for this author in PubMed Google Scholar
Vladimir Gurvich
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhisa Makino
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Khaled Elbassioni .

Editor information

Editors and Affiliations

Zhejiang Normal University, Jinhua, Zhejiang, China
Zhao Zhang
University of Texas, Tyler, Texas, USA
Lidong Wu
University of Texas, Dallas, Texas, USA
Wen Xu
University of Texas, Dallas, USA
Ding-Zhu Du

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Boros, E., Elbassioni, K., Gurvich, V., Makino, K. (2014). A Potential Reduction Algorithm for Ergodic Two-Person Zero-Sum Limiting Average Payoff Stochastic Games. In: Zhang, Z., Wu, L., Xu, W., Du, DZ. (eds) Combinatorial Optimization and Applications. COCOA 2014. Lecture Notes in Computer Science(), vol 8881. Springer, Cham. https://doi.org/10.1007/978-3-319-12691-3_52

Download citation

DOI: https://doi.org/10.1007/978-3-319-12691-3_52
Published: 13 November 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12690-6
Online ISBN: 978-3-319-12691-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics