Abstract
In this note, we consider the non-cooperative linear feedback Nash quadratic differential game with an infinite planning horizon. The performance function is assumed to be indefinite and the underlying system affine. We derive both necessary and sufficient conditions under which this game has a Nash equilibrium. As a special case, we derive existence conditions for the multi-player zero-sum game.
Notes
\(\bar{N}:=\{1,\ldots,N\}\).
σ(H) denotes the spectrum of matrix H; \(\mathbb {C}^{-} := \{ \lambda \in \mathbb {C}\mid \operatorname{Re}(\lambda)<0\}\); \(\mathbb {C}^{+} := \{ \lambda \in \mathbb {C}\mid \operatorname{Re}(\lambda)>0\}\).
References
Dockner, E., Jørgensen, S., van Long, N., Sorger, G.: Differential Games in Economics and Management Science. Cambridge University Press, Cambridge (2000)
Jørgensen, S., Zaccour, G.: Differential Games in Marketing. Kluwer, Deventer (2003)
Plasmans, J., Engwerda, J., van Aarle, B., Di Bartolomeo, B., Michalak, T.: Dynamic modeling of monetary and fiscal cooperation among nations. In: Series: Dynamic Modeling and Econometrics in Economics and Finance, vol. 8. Springer, Berlin (2006)
Grass, D., Caulkins, J.P., Feichtinger, G., Tragler, G., Behrens, D.A.: Optimal Control of Nonlinear Processes: With Applications in Drugs. Springer, Berlin (2008)
Gu, D.: A differential game approach to formation control. IEEE Trans. Control System Technology 16, 85–93 (2006)
Mukaidani, H.: Soft-constrained stochastic Nash games for weakly coupled large-scale systems. Automatica 45, 1758–1764 (2009)
Lin, L., Wang, A., Zhou, X., Miao, X.: Noncooperative differential game based efficiency-aware traffic assignment for multipath routing in CRAHN. Wireless Personal Communications (2010). doi:10.1007/s11277-010-0063-z
Limebeer, D.J.N., Anderson, B.D.O., Hendel, B.: A Nash game approach to mixed H 2/H ∞ control. IEEE Trans. Autom. Control 39, 69–82 (1994)
Başar, T., Bernhard, P.: H ∞-Optimal Control and Related Minimax Design Problems. Birkhäuser, Boston (1995)
Kun, G.: Stabilizability, controllability, and optimal strategies of linear and nonlinear dynamical games. Ph.D. Thesis, RWTH-Aachen, Germany (2001)
van den Broek, W.A., Engwerda, J.C., Schumacher, J.M.: Robust equilibria in indefinite linear-quadratic differential games. J. Optim. Theory Appl. 119(3), 565–595 (2003)
Starr, A.W., Ho, Y.C.: Nonzero-sum differential games. J. Optim. Theory Appl. 3, 184–206 (1969). 1969
Lukes, D.L.: Equilibrium feedback control in linear games with quadratic costs. SIAM J. Control Optim. 9(2), 234–252 (1971)
Papavassilopoulos, G.P., Medanic, J., Cruz, J.: On the existence of Nash strategies and solutions to coupled Riccati equations in linear-quadratic games. J. Optim. Theory Appl. 28, 49–75 (1979)
Papavassilopoulos, G.P., Olsder, G.J.: On the linear-quadratic, closed-loop, no-memory Nash game. J. Optim. Theory Appl. 42(4), 551–560 (1984)
Freiling, G., Jank, G., Abou-Kandil, H.: On global existence of solutions to coupled matrix Riccati equations in closed-loop Nash games. IEEE Trans. Autom. Control 41, 264–269 (1996)
Weeren, A.J.T.M., Schumacher, J.M., Engwerda, J.C.: Asymptotic analysis of linear feedback Nash equilibria in nonzero-sum linear-quadratic differential games. J. Optim. Theory Appl. 101, 693–723 (1999)
Cruz, J.B., Chen, C.I.: Series Nash solution of two person nonzero-sum linear-quadratic games. J. Optim. Theory Appl. 7, 240–257 (1971)
Jodar, L., Abou-Kandil, H.: Kronecker products and coupled matrix Riccati differential equations. Linear Algebra Appl. 121, 39–51 (1989)
Başar, T., Olsder, G.J.: Dynamic Noncooperative Game Theory. SIAM, Philadelphia (1999)
Engwerda, J.C.: LQ Dynamic Optimization and Differential Games. Wiley, New York (2005)
Engwerda, J.C., Weeren, A.J.T.M.: A result on output feedback linear quadratic control. Automatica 44(1), 265–271 (2008)
Engwerda, J.C., Salmah Wijayanti, I.E.: The (multi-player) linear quadratic state feedback control problem for index one descriptor systems. J. Franklin Inst. 348(10), 2923–2941 (2011)
Wang, H.-S., Yung, C.-F., Chang, F-R.: H ∞ Control for Nonlinear Descriptor Systems. Springer, Berlin (2006)
Engwerda, J.C.: Solving the scalar feedback Nash algebraic Riccati equations: an eigenvector approach. IEEE Trans. Autom. Control 48, 847–853 (2003)
Engwerda, J.C.: Algorithms for computing Nash equilibria in deterministic LQ games. Computational Management Science 4, 113–140 (2007)
Hackbusch, W.: Integral Equations: Theory and Numerical Treatment. Birkhäuser, Berlin (1995)
Acknowledgements
The authors like to thank the referee for his comments on an earlier draft of the paper and the editor for his comments that helped to improve the readability of the paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by José B. Cruz.
Appendix A
Appendix A
Lemma A.1
Let S:=BR −1 B T and c(.,x 0),p(⋅)∈L 2. Consider the minimization of the linear-quadratic cost function
subject to the state dynamics
and \(u \in \mathcal{U}_{s}(x_{0})\). Then,
-
(a)
with c(⋅)=p(⋅)=0, (9)–(10) has a solution for all x 0∈ℝn if and only if the algebraic Riccati equation
(11)has a symmetric stabilizing solution K(⋅) (i.e. A−SK is a stable matrix).
-
(b)
for every x 0, problem (9)–(10) has a solution iff. Equation (11) has a stabilizing solution. Moreover if (11) has a stabilizing solution then problem (9)–(10) has the unique solution u ∗(t)=−R −1 B T(Kx ∗(t)+h(t)). Here h(t) is given by \(h(t)=\int_{t}^{\infty}e^{-(A-SK)^{T}(t-s)}(Kc(s)+p(s))\,ds\), and x ∗(t) satisfies \(\dot{x}^{*}(t)=(A-SK)x^{*}(t)-Sh(t)+c(t), x^{*}(0)=x_{0}\).
Proof
Similar to the proof of [21, Theorem 5.16]. □
Proof of Theorem 3.1
⇒ part. Let u ∗ be a FNE. Then, for all t 0≥0, \(\lim_{t_{f}\rightarrow \infty} J_{i}(t_{0}, t_{f}, x_{0},u^{*}) \leq \lim_{t_{f}\rightarrow \infty} J_{i}(t_{0},t_{f},x_{0},u_{-i}^{*}(\alpha)) \) for every x 0 and input α such that \(u_{-i}^{*}(\alpha) \in \mathcal{U}_{s}\). Let t 0 be fixed. Consequently, with \(\bar{M}_{i}:=M_{i}-M_{i}E_{i+1}R_{ii}^{-1}E_{i+1}^{T}M_{i}\), for every x 0∈ℝn the minimization of
subject to the state equation
has a solution. Straightforward calculations show that \(\bar{M}_{i}\) is in the kernel of \(E_{i+1}^{T}\). Therefore, with
the above minimization problem can be rewritten as the minimization of
subject to the (nonhomogeneous) state equation
Let \(S_{i}:=B_{i}R_{ii}^{-1}B_{i}^{T}\) and \(\bar{A}_{i}:=A+B_{-i}F_{-i}^{*}-B_{i}R_{ii}^{-1}E_{i+1}^{T}M_{i}[I\ F_{-i}^{*^{T}}]^{T}-S_{i}K_{i}\). Then, by Lemma A.1, it follows that (16) below has a stabilizing solution:
According Lemma A.1 the minimization problem (14)–(15) has the unique solution
where
and K i is the stabilizing solution of (16). So, by (13), \(\tilde{u}_{i}(t)\) below solves problem (12).
Since the optimal control for this problem is uniquely determined and, by definition, the equilibrium control \(u_{i}^{*}=F_{i}^{*}x(t)+g_{i}^{*}(t)\) solves the optimization problem, it follows that
Consequently, ∀i, \(\bar{A}_{i}=A+BF^{*}=:A_{\mathrm{cl}}\). Furthermore, by (19),
or
Similarly, by (19), \(R_{ii}g_{i}^{*}+E_{i+1}^{T}M_{i}[0\ g_{-i}^{*^{T}}(t)]^{T}=-B_{i}^{T}h_{i}(t)\). Therefore
Furthermore, notice from (19) that \(-R_{ii}^{-1}E_{i+1}^{T}M_{i}[I\ F_{-i}^{*^{T}}]^{T}=F_{i}^{*}+R_{ii}^{-1}B_{i}^{T}K_{i}\). So,
On the other hand, by (19), \(-R_{ii}^{-1}B_{i}^{T}K_{i}=F_{i}^{*}+R_{ii}^{-1}E_{i+1}^{T}M_{i}[I\ F_{-i}^{*^{T}}]^{T}\). Substitution of this into \(K_{i}S_{i}K_{i}=(R_{ii}^{-1}B_{i}^{T}K_{i})^{T}R_{ii}R_{ii}^{-1}B_{i}^{T}K_{i}\) yields then the result, together with (20), that (16) can be rewritten as (4). Next, reconsider h i (t). Substitution of n i (s) and p i (s) into (17) shows that
Pre-multiplication of (19) by M i E i+1 shows that
Using this, h i (t) from (21) can be rewritten as
Since \(g_{-i}^{*}=I_{N,-i}g^{*}=-I_{N,-i}G^{-1}\tilde{B}^{T}h(t)\), (5) results. As σ(A cl)⊂ℂ− and c(⋅)∈L 2 it follows from, e.g., [27, Theorem 2.1.1] that (5) has a unique solution.
⇐ part. Let K be a stabilizing solution of (4) and define, for i≠1, \(u_{i}^{*}:=(F^{*}_{i},g_{i}^{*})\) by (3)–(5). Next, without loss of generality, consider for a fixed t 0 the minimization by player one of the cost functional
subject to the system \(\dot{x}(t)=(A+B_{-1}F_{-1}^{*})x(t)+B_{1}u_{1}(t)+B_{-1}g_{-1}^{*}(t)+c(t), x(t_{0})=x_{0}\). By the “⇒ part” the problem can be rewritten as the minimization of (14) subject to (15). From (3) it follows (see, e.g., (19) again) that (4) can be rewritten as (16). Taking i=1 in (16) shows that the ARE (22), below, has a stabilizing solution K=K 1:
But this implies, by Lemma A.1, that the minimization of (14) subject to (15) has a solution. By the “⇒ part” its solution is (17). So, using (3), the optimal control for player one is
So, in particular at time t=t 0, \((F_{1}^{*},g_{1}^{*}(t_{0}))\) is the optimal response of player one in case all other players i use the control strategy \((F_{i}^{*},g_{i}^{*}(t_{0}))\). Since the closed-loop system is \(\dot{x}(t)=A_{\mathrm{cl}}x(t)+Bg(t)+c(t)\), with v(s):=(2x T(s)[I F T]+[0 g T(s)])M i [0 g T(s)]T, J i can be rewritten as
□
Proof of Corollary 3.1
Since σ(H)⊂ℂ+, h(t) in (7) is well-defined. Differentiation of (7) shows that h(t) satisfies (6). Using (6) the right-hand side of (5) can be rewritten as
So, h(t) satisfies (5). Since (5) has a unique solution, this concludes the proof. □
Rights and permissions
About this article
Cite this article
Engwerda, J.C., Salmah Necessary and Sufficient Conditions for Feedback Nash Equilibria for the Affine-Quadratic Differential Game. J Optim Theory Appl 157, 552–563 (2013). https://doi.org/10.1007/s10957-012-0188-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10957-012-0188-1