An Arbitrary Starting Tracing Procedure for Computing Subgame Perfect Equilibria

Li, Peixuan; Dang, Chuangyin

doi:10.1007/s10957-020-01703-z

An Arbitrary Starting Tracing Procedure for Computing Subgame Perfect Equilibria

Published: 26 June 2020

Volume 186, pages 667–687, (2020)
Cite this article

Journal of Optimization Theory and Applications Aims and scope Submit manuscript

388 Accesses
6 Citations
Explore all metrics

Abstract

The computation of subgame perfect equilibrium in stationary strategies is an important but challenging problem in applications of stochastic games. In 2004, Herings and Peeters developed a homotopy method called stochastic linear tracing procedure to solve this problem. However, the starting point of their method requires to be explicitly calculated. To remedy this issue, we formulate an arbitrary starting linear tracing procedure in this paper. By introducing a homotopy variable ranging from two to zero, an artificial penalty game is developed, whose solutions construct a differentiable path after a well-chosen transformation of variables. The starting point of the path can be arbitrarily chosen, so that there is no need to employ additional algorithms to obtain it. Following the path, one can readily attain the “starting point” of the stochastic tracing procedure coined by Herings and Peeters. Then, as the homotopy variable changes from one to zero, the path essentially resumes to the stochastic tracing procedure. We prove that our method globally converges to a subgame perfect equilibrium in stationary strategies for the stochastic game of interest. Numerical results further illustrate the effectiveness and efficiency of our method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Computing perfect stationary equilibria in stochastic games

Article Open access 08 April 2024

A reformulation-based smooth path-following method for computing Nash equilibria

Article 27 September 2015

Complementarity Enhanced Nash’s Mappings and Differentiable Homotopy Methods to Select Perfect Equilibria

Article 23 November 2021

Notes

We highlight the starting point in [15] by double quotation marks to distinguish the difference in this concept between [15] and our paper.
Interested readers may refer to [16] for a comprehension about the value iteration algorithm and policy iteration algorithm for solving Markov decision problems.
$\beta _0$ is chosen as $10^{-12}$ or even smaller in our numerical experiments. We introduce $\beta _0$ here just to guarantee the differentiability of $\nu (t)$ and our homotopy, which will be discussed later.
The homotopy variable t in SLTP changes from zero to one, while the direction of t in our method is opposite.
Interested readers can refer to [25] for a comprehension about the transversality theorem.

References

Chen, Y., Dang, C.: A differentiable homotopy method to compute perfect equilibria. Math. Program. 2, (2019). https://doi.org/10.1007/s10107-019-01422-y
Chen, Y., Dang, C.: An extension of quantal response equilibrium and determination of perfect equilibrium. Games Econ. Behav. (2018). https://doi.org/10.1016/j.geb.2017.12.023
Article Google Scholar
Herings, J.J., Peeters, R.: A globally convergent algorithm to compute all nash equilibria for $n$-person games. Ann. Oper. Res. 137(1), 349–368 (2005)
Article MathSciNet Google Scholar
Shapley, L.: Stochastic gamesproceedings of the national academy of sciences of the USA 39, 1095–1100 (chapter 1 in this volume). MathSciNet zbMATH (1953)
Adlakha, S., Johari, R., Weintraub, G.Y.: Equilibria of dynamic games with many players: existence, approximation, and market structure. J. Econ. Theory 156, 269–316 (2015)
Article MathSciNet Google Scholar
Maskin, E., Tirole, J.: Markov perfect equilibrium. I. Observable actions. J. Econ. Theory 100(2), 191–219 (2001)
Article MathSciNet Google Scholar
Fink, A.M., et al.: Equilibrium in a stochastic $n$-person game. J. Sci. Hiroshima Univ. Series ai (mathematics) 28(1), 89–93 (1964)
Article MathSciNet Google Scholar
Sobel, M.J.: Noncooperative stochastic games. Ann. Math. Stat. 42(6), 1930–1935 (1971)
Article MathSciNet Google Scholar
Takahashi, M., et al.: Equilibrium points of stochastic non-cooperative $ n $-person games. J. Sci. Hiroshima Univ. Ser. AI (Mathematics) 28(1), 95–99 (1964)
Article MathSciNet Google Scholar
Eaves, B.C.: Homotopies for computation of fixed points. Math. Program. 3(1), 1–22 (1972)
Article MathSciNet Google Scholar
Scarf, H.: The approximation of fixed points of a continuous mapping. SIAM J. Appl. Math. 15(5), 1328–1343 (1967)
Article MathSciNet Google Scholar
Dreves, A.: How to select a solution in generalized nash equilibrium problems. J. Optim. Theory Appl. 178, 973–997 (2018). https://doi.org/10.1007/s10957-018-1327-0
Article MathSciNet MATH Google Scholar
Dang, C.: Simplicial methods for approximating fixed point with applications in combinatorial optimizations. In: Handbook of Combinatorial Optimization, pp. 3015–3056 (2013)
Zhan, Y., Dang, C.: A smooth path-following algorithm for market equilibrium under a class of piecewise-smooth concave utilities. Comput. Optim. Appl. 71(2), 381–402 (2018)
Article MathSciNet Google Scholar
Herings, P.J.J., Peeters, R.J.: Stationary equilibria in stochastic games: structure, selection, and computation. J. Econ. Theory 118, 32–60 (2004)
MathSciNet MATH Google Scholar
Puterman, M.L.: Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, New York (1994)
Book Google Scholar
Fudenberg, D., Tirole, J.: Game Theory, vol. 393. MIT Press, Cambridge (1991)
MATH Google Scholar
Parthasarathy, T., Raghavan, T.E.S.: An orderfield property for stochastic games when one player controls transition probabilities. J. Optim. Theory Appl. 33(3), 375–392 (1981)
Article MathSciNet Google Scholar
Corbae, D., Stinchcombe, M.B., Zeman, J.: An Introduction to Mathematical Analysis for Economic Theory and Econometrics. Princeton University Press, Princeton (2009)
MATH Google Scholar
Browder, F.E.: On continuity of fixed points under deformations of continuous mappings. Summa Brasiliensis Mathematicae 4, 183–191 (1960)
MathSciNet MATH Google Scholar
Herings, J.J.: Two simple proofs of the feasibility of the linear tracing procedure. Econ. Theory 15(2), 485–490 (2000)
Article MathSciNet Google Scholar
Herings, P.J.J., Peeters, R.J.: A differentiable homotopy to compute nash equilibria of $n$-person games. Econ. Theory 18(1), 159–185 (2001)
Article MathSciNet Google Scholar
Lemke, C.: Pathways to Solutions, Fixed-points, and Equilibria. Prentice-Hall, Upper Saddle River (1983)
Google Scholar
Schanuel, S.H., Simon, L.K., Zame, W.R.: The algebraic geometry of games and the tracing procedure. In: Game Equilibrium Models II, pp. 9–43. Springer, New York (1991)
Chapter Google Scholar
Eaves, B.C., Schmedders, K.: General equilibrium models and homotopy methods. J. Econ. Dyn. Control 23(9–10), 1249–1279 (1999)
Article MathSciNet Google Scholar
Allgower, E.L., Georg, K.: Introduction to Numerical Continuation Methods, vol. 45. SIAM, Philadelphia (2003)
Book Google Scholar

Download references

Acknowledgements

This work was partially supported by National Nature Science Foundation of China (61976184).

Author information

Authors and Affiliations

City University of Hong Kong, Kowloon, Hong Kong
Peixuan Li & Chuangyin Dang

Authors

Peixuan Li
View author publications
You can also search for this author in PubMed Google Scholar
Chuangyin Dang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Peixuan Li.

Additional information

Communicated by Francesco Zirilli.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

A: Proof of Theorem 3.2

In this appendix, we intend to prove that zero is a regular value of $q(y,\mu ,t;\alpha )$ when $0< t\le 2$. This result is applied in the proof of Theorem 3.2.

First, we consider the case that $0<t<2$. For simplicity, we rewrite q as

$$\begin{aligned} q=(q_{\omega j}^{1i},q_{\omega }^{2i})^\top _{\omega \in \varOmega ,i\in N,j\in M_{\omega }^i}, \end{aligned}$$

where

$$\begin{aligned}&q_{\omega j}^{1i}(y,\mu ,t;\alpha )\\&\quad :=(1-\zeta (t))((1-\nu (t))(u^i(\omega ,s^i_{\omega j},x_\omega ^{-i}(y))+\delta \sum _{\bar{\omega }\in \varOmega } \pi (\bar{\omega }\;|\;\omega , s^{i}_{\omega j}, x^{-i}_{\omega }(y))\mu _{\bar{\omega }}^i)\\&\qquad +\nu (t)(u^i(\omega ,s^i_{\omega j},p_\omega ^{0,-i})+\delta \sum _{\bar{\omega }\in \varOmega } \pi (\bar{\omega }\;|\;\omega , s^{i}_{\omega j}, p^{0,-i}_{\omega })\mu _{\bar{\omega }}^i))\\&\qquad -\zeta (t)(x_{\omega j}^i(y)-x_{\omega j}^{0,i})+\lambda ^i_{\omega j}(y)-\mu _{\omega }^i-t(2-t)\alpha _{\omega j}^i, \end{aligned}$$

and

$$\begin{aligned} q_{\omega }^{2i}(y,\mu ,t;\alpha ):=\sum _{j\in M^i_{\omega }} x^i_{\omega j}(y)-1. \end{aligned}$$

Let $v:=(y,\mu ,t;\alpha )\in \mathbb R^m\times \varLambda \times (0,2) \times {\mathbb {R}}^{m}$. Then, the Jacobian matrix of q is a $(m+nd)\times (2m+nd+1)$ matrix, which can be written as

$$\begin{aligned} J:=\frac{\partial q}{\partial v}=\begin{pmatrix} \partial q^1 / \partial v\\ {} &{}\quad \partial q^2/ \partial v\end{pmatrix}= \begin{pmatrix} \dfrac{\partial q^1}{\partial y}&{}\quad \dfrac{\partial q^1}{\partial \mu }&{} \quad \dfrac{\partial q^1}{\partial t}&{}\quad \dfrac{\partial q^1}{\partial \alpha }\\ \dfrac{\partial q^2}{\partial y}&{}\quad 0&{}\quad 0&{}\quad 0 \end{pmatrix}, \end{aligned}$$

where $\dfrac{\partial q^1}{\partial y}\in {\mathbb {R}}^{m}\times {\mathbb {R}}^m$, $\dfrac{\partial q^1}{\partial \mu }\in {\mathbb {R}}^m \times {\mathbb {R}}^{nd}$, and $\dfrac{\partial q^1}{\partial t} \in {\mathbb {R}}^m \times {\mathbb {R}}^1$. Besides, $\dfrac{\partial q^1}{\partial \alpha }=-t(2-t) \mathbf{I _{\varvec{m}}}$, where $\mathbf{I _{\varvec{m}}}$ is an $m\times m$ identity matrix. Thus, when $0<t<2$, $\dfrac{\partial q^1}{\partial \alpha }$ is a diagonal matrix with full rank. Let $r_{\omega j}^i:=\max \{0,y_{\omega j}^i\}$ and $r_{\omega }^i$ be a vector $(r_{\omega 1}^i, r_{\omega 2}^i,\ldots ,r_{\omega m_{\omega }^i}^i)$. Then, $\dfrac{\partial q^2}{\partial y}=2\times \mathrm {diag} (r_{\omega }^i)$ is a $(nd\times m)$ matrix. Obviously, $\dfrac{\partial q^2}{\partial y}$ is of full-row rank, because for any pair of i and $\omega $, there must be at least one positive $r_{\omega j}^i$$(j\in M_{\omega }^i)$, and any two row vectors of the matrices are linearly independent. It follows from the basic operations of matrices that the Jacobian matrix J is of full-row rank.

Second, we discuss the case that $t=2$, where the Jacobian matrix is defined as $J_1$. Specifically speaking,

$$\begin{aligned}J_1:=\begin{pmatrix} J_{11}&{} \quad J_{12}\\ \dfrac{\partial q^2}{\partial y}&{}\quad 0 \end{pmatrix}\in {\mathbb {R}}^{(m+nd)\times (m+nd)}, \end{aligned}$$

where

$$\begin{aligned} J_{11}=\left( \dfrac{\partial x(y)}{\partial y}+\dfrac{\partial \lambda (y)}{\partial y}\right) =-2\times \text {diag}(y_{\omega j}^i)\in {\mathbb {R}}^{m\times m} \end{aligned}$$

is a diagonal matrix with full rank, and $J_{12}=\text {diag}(e_{m_{\omega }^i})\in {\mathbb {R}}^{m\times nd}$ with $e_{m_{\omega }^i}=(1,\ldots ,1)^{\top }$ being a $m_{\omega }^i$-dimensional column vector and $\sum \limits _{i\in N}\sum \limits _{\omega \in \varOmega }m_{\omega }^i=m$. Hence, $J_{12}$ is of full-column rank. It has been proved that $\dfrac{\partial q^2}{\partial y}$ is a full-row rank matrix. Then, $J_1$ is of full-rank. This completes the proof. $\square $

B: The Compactness of $\varPhi $

In this section, we prove the compactness of $\varPhi $. This result is used in Corollary 3.1. Let $S_0$ be the set of all $(x,\lambda ,\mu ,t)$ satisfying the system (9). Now, we show that $S_0$ is bounded. Because $x_{\omega j}^i$ is a probability with $0\le x_{\omega j}^i\le 1$, $x_{\omega j}^i$ is bounded, where $j\in M_{\omega }^i$, $\omega \in \varOmega $ and $i\in N$. Let $u^{i+}:=(u^{i+}_{\omega })_{\omega \in \varOmega }$ and $\pi ^+:=(\pi ^+({\bar{\omega }}|\omega ))_{{\bar{\omega }},\omega \in \varOmega }$ with

$$\begin{aligned} u^{i+}_{\omega }:=\max \,u^i(\omega ,x_{\omega })\quad \text {and}\quad \pi ^+({\bar{\omega }}|\omega ):=\max \,\pi ({\bar{\omega }}|\omega ,x_{\omega }). \end{aligned}$$

Rewriting Eq. (7) in a vector form, we get that

$$\begin{aligned} \mu ^i\le (1-\zeta (t))(u^{i+}+\delta \pi ^+\mu ^i)-\zeta (t)\big (((x_{\omega }^i-x_{\omega }^{0,i})^{\top }x_{\omega }^i)_{\omega \in \varOmega }\big ). \end{aligned}$$

Because $(1-(1-\zeta (t))\delta \pi ^+)>0$, we get the boundedness of $\mu $ that

$$\begin{aligned} \mu ^i\le (1-(1-\zeta (t))\delta \pi ^+)^{-1}[(1-\zeta (t))u^{i+}-\zeta (t)\big (((x_{\omega }^i-x_{\omega }^{0,i})^{\top }x_{\omega }^i)_{\omega \in \varOmega }\big )]. \end{aligned}$$

Moreover, from the first group of equations of the system (9), the boundedness of $\lambda _{\omega j}^i$ is obtained subsequently. Let $X:=\{x\in {\mathbb {R}}^m_+\;:\; 0\le x\le 1\}$ and $\varDelta :=\{\lambda \in {\mathbb {R}}^m_+\;:\; \lambda _0\le \lambda \le \lambda _1 \}$, where $\lambda _0$ and $\lambda _1$ are the lower bound and upper bound of $\lambda $, respectively. Similarly, we define $\varLambda :=\{\mu \in \mathbb R^{nd}\;:\;\mu _0\le \mu \le \mu _1\}$, where $\mu _0$ and $\mu _1$ are the lower bound and upper bound of $\mu $, respectively. Hence, $S_0 \subset X\times \varDelta \times \varLambda \times [0,2]$ is a nonempty, convex and compact set. Then, the compactness of $\varPhi $ is established immediately. $\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, P., Dang, C. An Arbitrary Starting Tracing Procedure for Computing Subgame Perfect Equilibria. J Optim Theory Appl 186, 667–687 (2020). https://doi.org/10.1007/s10957-020-01703-z

Download citation

Received: 18 June 2019
Accepted: 11 June 2020
Published: 26 June 2020
Issue Date: August 2020
DOI: https://doi.org/10.1007/s10957-020-01703-z

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Arbitrary Starting Tracing Procedure for Computing Subgame Perfect Equilibria

Abstract

Access this article

Similar content being viewed by others

Computing perfect stationary equilibria in stochastic games

A reformulation-based smooth path-following method for computing Nash equilibria

Complementarity Enhanced Nash’s Mappings and Differentiable Homotopy Methods to Select Perfect Equilibria

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

Appendices

A: Proof of Theorem 3.2

B: The Compactness of \(\varPhi \)

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

An Arbitrary Starting Tracing Procedure for Computing Subgame Perfect Equilibria

Abstract

Access this article

Similar content being viewed by others

Computing perfect stationary equilibria in stochastic games

A reformulation-based smooth path-following method for computing Nash equilibria

Complementarity Enhanced Nash’s Mappings and Differentiable Homotopy Methods to Select Perfect Equilibria

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

Appendices

A: Proof of Theorem 3.2

B: The Compactness of \(\varPhi \)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation