Necessary and Sufficient Conditions for Feedback Nash Equilibria for the Affine-Quadratic Differential Game

Engwerda, J. C.; Salmah

doi:10.1007/s10957-012-0188-1

Necessary and Sufficient Conditions for Feedback Nash Equilibria for the Affine-Quadratic Differential Game

Published: 03 October 2012

Volume 157, pages 552–563, (2013)
Cite this article

Journal of Optimization Theory and Applications Aims and scope Submit manuscript

J. C. Engwerda¹ &
Salmah²

714 Accesses
13 Citations
Explore all metrics

Abstract

In this note, we consider the non-cooperative linear feedback Nash quadratic differential game with an infinite planning horizon. The performance function is assumed to be indefinite and the underlying system affine. We derive both necessary and sufficient conditions under which this game has a Nash equilibrium. As a special case, we derive existence conditions for the multi-player zero-sum game.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Notes

$\bar{N}:=\{1,\ldots,N\}$.
σ(H) denotes the spectrum of matrix H; $\mathbb {C}^{-} := \{ \lambda \in \mathbb {C}\mid \operatorname{Re}(\lambda)<0\}$; $\mathbb {C}^{+} := \{ \lambda \in \mathbb {C}\mid \operatorname{Re}(\lambda)>0\}$.

References

Dockner, E., Jørgensen, S., van Long, N., Sorger, G.: Differential Games in Economics and Management Science. Cambridge University Press, Cambridge (2000)
Book MATH Google Scholar
Jørgensen, S., Zaccour, G.: Differential Games in Marketing. Kluwer, Deventer (2003)
Google Scholar
Plasmans, J., Engwerda, J., van Aarle, B., Di Bartolomeo, B., Michalak, T.: Dynamic modeling of monetary and fiscal cooperation among nations. In: Series: Dynamic Modeling and Econometrics in Economics and Finance, vol. 8. Springer, Berlin (2006)
Google Scholar
Grass, D., Caulkins, J.P., Feichtinger, G., Tragler, G., Behrens, D.A.: Optimal Control of Nonlinear Processes: With Applications in Drugs. Springer, Berlin (2008)
Book MATH Google Scholar
Gu, D.: A differential game approach to formation control. IEEE Trans. Control System Technology 16, 85–93 (2006)
Article Google Scholar
Mukaidani, H.: Soft-constrained stochastic Nash games for weakly coupled large-scale systems. Automatica 45, 1758–1764 (2009)
Article MathSciNet MATH Google Scholar
Lin, L., Wang, A., Zhou, X., Miao, X.: Noncooperative differential game based efficiency-aware traffic assignment for multipath routing in CRAHN. Wireless Personal Communications (2010). doi:10.1007/s11277-010-0063-z
Google Scholar
Limebeer, D.J.N., Anderson, B.D.O., Hendel, B.: A Nash game approach to mixed H ₂/H _∞ control. IEEE Trans. Autom. Control 39, 69–82 (1994)
Article MathSciNet MATH Google Scholar
Başar, T., Bernhard, P.: H _∞-Optimal Control and Related Minimax Design Problems. Birkhäuser, Boston (1995)
MATH Google Scholar
Kun, G.: Stabilizability, controllability, and optimal strategies of linear and nonlinear dynamical games. Ph.D. Thesis, RWTH-Aachen, Germany (2001)
van den Broek, W.A., Engwerda, J.C., Schumacher, J.M.: Robust equilibria in indefinite linear-quadratic differential games. J. Optim. Theory Appl. 119(3), 565–595 (2003)
Article MathSciNet MATH Google Scholar
Starr, A.W., Ho, Y.C.: Nonzero-sum differential games. J. Optim. Theory Appl. 3, 184–206 (1969). 1969
Article MathSciNet MATH Google Scholar
Lukes, D.L.: Equilibrium feedback control in linear games with quadratic costs. SIAM J. Control Optim. 9(2), 234–252 (1971)
Article MathSciNet MATH Google Scholar
Papavassilopoulos, G.P., Medanic, J., Cruz, J.: On the existence of Nash strategies and solutions to coupled Riccati equations in linear-quadratic games. J. Optim. Theory Appl. 28, 49–75 (1979)
Article MathSciNet MATH Google Scholar
Papavassilopoulos, G.P., Olsder, G.J.: On the linear-quadratic, closed-loop, no-memory Nash game. J. Optim. Theory Appl. 42(4), 551–560 (1984)
Article MathSciNet MATH Google Scholar
Freiling, G., Jank, G., Abou-Kandil, H.: On global existence of solutions to coupled matrix Riccati equations in closed-loop Nash games. IEEE Trans. Autom. Control 41, 264–269 (1996)
Article MathSciNet MATH Google Scholar
Weeren, A.J.T.M., Schumacher, J.M., Engwerda, J.C.: Asymptotic analysis of linear feedback Nash equilibria in nonzero-sum linear-quadratic differential games. J. Optim. Theory Appl. 101, 693–723 (1999)
Article MathSciNet MATH Google Scholar
Cruz, J.B., Chen, C.I.: Series Nash solution of two person nonzero-sum linear-quadratic games. J. Optim. Theory Appl. 7, 240–257 (1971)
Article MathSciNet MATH Google Scholar
Jodar, L., Abou-Kandil, H.: Kronecker products and coupled matrix Riccati differential equations. Linear Algebra Appl. 121, 39–51 (1989)
Article MathSciNet MATH Google Scholar
Başar, T., Olsder, G.J.: Dynamic Noncooperative Game Theory. SIAM, Philadelphia (1999)
MATH Google Scholar
Engwerda, J.C.: LQ Dynamic Optimization and Differential Games. Wiley, New York (2005)
Google Scholar
Engwerda, J.C., Weeren, A.J.T.M.: A result on output feedback linear quadratic control. Automatica 44(1), 265–271 (2008)
Article MathSciNet MATH Google Scholar
Engwerda, J.C., Salmah Wijayanti, I.E.: The (multi-player) linear quadratic state feedback control problem for index one descriptor systems. J. Franklin Inst. 348(10), 2923–2941 (2011)
Article MathSciNet MATH Google Scholar
Wang, H.-S., Yung, C.-F., Chang, F-R.: H _∞ Control for Nonlinear Descriptor Systems. Springer, Berlin (2006)
MATH Google Scholar
Engwerda, J.C.: Solving the scalar feedback Nash algebraic Riccati equations: an eigenvector approach. IEEE Trans. Autom. Control 48, 847–853 (2003)
Article MathSciNet MATH Google Scholar
Engwerda, J.C.: Algorithms for computing Nash equilibria in deterministic LQ games. Computational Management Science 4, 113–140 (2007)
Article MathSciNet MATH Google Scholar
Hackbusch, W.: Integral Equations: Theory and Numerical Treatment. Birkhäuser, Berlin (1995)
MATH Google Scholar

Download references

Acknowledgements

The authors like to thank the referee for his comments on an earlier draft of the paper and the editor for his comments that helped to improve the readability of the paper.

Author information

Authors and Affiliations

Department of Econometrics and O.R., Tilburg University, P.O. Box 90153, 5000 LE, Tilburg, The Netherlands
J. C. Engwerda
Dept. of Mathematics, Gadjah Mada University, 55281, Yogyakarta, Indonesia
Salmah

Authors

J. C. Engwerda
View author publications
You can also search for this author in PubMed Google Scholar
Salmah
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to J. C. Engwerda.

Additional information

Communicated by José B. Cruz.

Appendix A

Lemma A.1

Let S:=BR ⁻¹ B ^T and c(.,x ₀),p(⋅)∈L ₂. Consider the minimization of the linear-quadratic cost function

(9)

subject to the state dynamics

$$ \dot{x}(t) = A x(t) + B u(t) + c(t,x_0), \quad x(0)=x_0, $$

(10)

and $u \in \mathcal{U}_{s}(x_{0})$. Then,

(a)
with c(⋅)=p(⋅)=0, (9)–(10) has a solution for all x ₀∈ℝⁿ if and only if the algebraic Riccati equation
(11)
has a symmetric stabilizing solution K(⋅) (i.e. A−SK is a stable matrix).
(b)
for every x ₀, problem (9)–(10) has a solution iff. Equation (11) has a stabilizing solution. Moreover if (11) has a stabilizing solution then problem (9)–(10) has the unique solution u ^∗(t)=−R ⁻¹ B ^T(Kx ^∗(t)+h(t)). Here h(t) is given by $h(t)=\int_{t}^{\infty}e^{-(A-SK)^{T}(t-s)}(Kc(s)+p(s))\,ds$, and x ^∗(t) satisfies $\dot{x}^{*}(t)=(A-SK)x^{*}(t)-Sh(t)+c(t), x^{*}(0)=x_{0}$.

Proof

Similar to the proof of [21, Theorem 5.16]. □

Proof of Theorem 3.1

⇒ part. Let u ^∗ be a FNE. Then, for all t ₀≥0, $\lim_{t_{f}\rightarrow \infty} J_{i}(t_{0}, t_{f}, x_{0},u^{*}) \leq \lim_{t_{f}\rightarrow \infty} J_{i}(t_{0},t_{f},x_{0},u_{-i}^{*}(\alpha)) $ for every x ₀ and input α such that $u_{-i}^{*}(\alpha) \in \mathcal{U}_{s}$. Let t ₀ be fixed. Consequently, with $\bar{M}_{i}:=M_{i}-M_{i}E_{i+1}R_{ii}^{-1}E_{i+1}^{T}M_{i}$, for every x ₀∈ℝⁿ the minimization of

(12)

subject to the state equation

has a solution. Straightforward calculations show that $\bar{M}_{i}$ is in the kernel of $E_{i+1}^{T}$. Therefore, with

(13)

the above minimization problem can be rewritten as the minimization of

(14)

subject to the (nonhomogeneous) state equation

(15)

Let $S_{i}:=B_{i}R_{ii}^{-1}B_{i}^{T}$ and $\bar{A}_{i}:=A+B_{-i}F_{-i}^{*}-B_{i}R_{ii}^{-1}E_{i+1}^{T}M_{i}[I\ F_{-i}^{*^{T}}]^{T}-S_{i}K_{i}$. Then, by Lemma A.1, it follows that (16) below has a stabilizing solution:

(16)

According Lemma A.1 the minimization problem (14)–(15) has the unique solution

(17)

where

and K _i is the stabilizing solution of (16). So, by (13), $\tilde{u}_{i}(t)$ below solves problem (12).

(18)

Since the optimal control for this problem is uniquely determined and, by definition, the equilibrium control $u_{i}^{*}=F_{i}^{*}x(t)+g_{i}^{*}(t)$ solves the optimization problem, it follows that

(19)

Consequently, ∀i, $\bar{A}_{i}=A+BF^{*}=:A_{\mathrm{cl}}$. Furthermore, by (19),

or

Similarly, by (19), $R_{ii}g_{i}^{*}+E_{i+1}^{T}M_{i}[0\ g_{-i}^{*^{T}}(t)]^{T}=-B_{i}^{T}h_{i}(t)$. Therefore

Furthermore, notice from (19) that $-R_{ii}^{-1}E_{i+1}^{T}M_{i}[I\ F_{-i}^{*^{T}}]^{T}=F_{i}^{*}+R_{ii}^{-1}B_{i}^{T}K_{i}$. So,

$$ \bigl[A+B_{-i}F_{-i}^{*}-B_iR_{ii}^{-1}E_{i+1}^{T}M_i \bigl[I\ F_{-i}^{*^T} \bigr]^T \bigr]^TK_i = A_\mathrm{cl}^TK_i+K_iS_iK_i. $$

(20)

On the other hand, by (19), $-R_{ii}^{-1}B_{i}^{T}K_{i}=F_{i}^{*}+R_{ii}^{-1}E_{i+1}^{T}M_{i}[I\ F_{-i}^{*^{T}}]^{T}$. Substitution of this into $K_{i}S_{i}K_{i}=(R_{ii}^{-1}B_{i}^{T}K_{i})^{T}R_{ii}R_{ii}^{-1}B_{i}^{T}K_{i}$ yields then the result, together with (20), that (16) can be rewritten as (4). Next, reconsider h _i(t). Substitution of n _i(s) and p _i(s) into (17) shows that

(21)

Pre-multiplication of (19) by M _i E _i+1 shows that

Using this, h _i(t) from (21) can be rewritten as

Since $g_{-i}^{*}=I_{N,-i}g^{*}=-I_{N,-i}G^{-1}\tilde{B}^{T}h(t)$, (5) results. As σ(A _cl)⊂ℂ⁻ and c(⋅)∈L ² it follows from, e.g., [27, Theorem 2.1.1] that (5) has a unique solution.

⇐ part. Let K be a stabilizing solution of (4) and define, for i≠1, $u_{i}^{*}:=(F^{*}_{i},g_{i}^{*})$ by (3)–(5). Next, without loss of generality, consider for a fixed t ₀ the minimization by player one of the cost functional

subject to the system $\dot{x}(t)=(A+B_{-1}F_{-1}^{*})x(t)+B_{1}u_{1}(t)+B_{-1}g_{-1}^{*}(t)+c(t), x(t_{0})=x_{0}$. By the “⇒ part” the problem can be rewritten as the minimization of (14) subject to (15). From (3) it follows (see, e.g., (19) again) that (4) can be rewritten as (16). Taking i=1 in (16) shows that the ARE (22), below, has a stabilizing solution K=K ₁:

(22)

But this implies, by Lemma A.1, that the minimization of (14) subject to (15) has a solution. By the “⇒ part” its solution is (17). So, using (3), the optimal control for player one is

So, in particular at time t=t ₀, $(F_{1}^{*},g_{1}^{*}(t_{0}))$ is the optimal response of player one in case all other players i use the control strategy $(F_{i}^{*},g_{i}^{*}(t_{0}))$. Since the closed-loop system is $\dot{x}(t)=A_{\mathrm{cl}}x(t)+Bg(t)+c(t)$, with v(s):=(2x ^T(s)[I F ^T]+[0 g ^T(s)])M _i[0 g ^T(s)]^T, J _i can be rewritten as

□

Proof of Corollary 3.1

Since σ(H)⊂ℂ⁺, h(t) in (7) is well-defined. Differentiation of (7) shows that h(t) satisfies (6). Using (6) the right-hand side of (5) can be rewritten as

So, h(t) satisfies (5). Since (5) has a unique solution, this concludes the proof. □

Rights and permissions

Reprints and permissions

About this article

Cite this article

Engwerda, J.C., Salmah Necessary and Sufficient Conditions for Feedback Nash Equilibria for the Affine-Quadratic Differential Game. J Optim Theory Appl 157, 552–563 (2013). https://doi.org/10.1007/s10957-012-0188-1

Download citation

Received: 06 December 2011
Accepted: 20 September 2012
Published: 03 October 2012
Issue Date: May 2013
DOI: https://doi.org/10.1007/s10957-012-0188-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Necessary and Sufficient Conditions for Feedback Nash Equilibria for the Affine-Quadratic Differential Game

Abstract

Access this article

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix A

Appendix A

Lemma A.1

Proof

Proof of Theorem 3.1

Proof of Corollary 3.1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation