Abstract
A linear quadratic differential game on an infinite-time horizon is studied in the case when the controls of the minimizing player are subject to constraints. A sufficient condition for a saddle point equilibrium is provided based on the conversion of the infinite-time horizon game to a game on a finite-time horizon. The method is applied to a simple monetary policy model as an illustrative example.


Similar content being viewed by others
Notes
Differential games with orthogonal instantaneous reaction functions are such that the first derivatives of the cost functions and dynamic equations with respect to the controls of each player do not depend on the controls of the other player (Rubio [16], Def. 2.4).
Werning’s original model does not include the control in the loss function, but other authors do, motivating this choice by the preference of central banks to adjust interest rates gradually [19].
While a natural rate of 5 percent may be high for advanced countries, this is a plausible value for a less developed economy. Also, recent experience suggests that the zero lower bound is not as strict as previously perceived.
References
Başar T, Bernhard P (1995) \(H^{\infty }\)- optimal control and related minimax design problems: a dynamic game approach, Birkhäuser
Başar T, Olsder GJ (1999) Dynamic noncooperative game theory, SIAM
Borzi A, Campana FC (2021) On the SQH method for solving differential nash games, J Dyn Control Syst
Cochrane J (2013) The New Keynsian Liquidity Trap, NBER Working Paper 19476, Cambridge, MA
Dennis J, Leitemo K, Soderstrom U (2009) Methods for robust control. J Econ Dyn Control 33:1604–1616
Dockner E, Jorgensen S, Van Long N, Sorger G (2000) Differential games in economics and management science. Cambridge University Press, Cambridge
Engwerda J (2005) LQ dynamic optimization and differential games. John Wiley and Sons Ltd, New Jersey
Engwerda J (2022) Min-max robust control in LQ-differential games. Dyn Games Appl. https://doi.org/10.1007/s13235-021-00421-z
Giannoni M (2007) Robust optimal monetary policy in a forward-looking model with parameter and shock uncertainty. J Appl Econom 22:179–213
Goebel R, Subbotin M (2007) Continuous time linear quadratic regulator with control constraints via convex duality. IEEE Trans Autom Control 52(5):886–892
Hansen L, Sargent T (2008) Robustness. Princeton University Press, Princeton
Ivanov GE (1997) Saddle point for differential games with strongly convex-concave integrand. Math Notes 62(5):607–622
Krylov IA, Chernousko FL (1962) On the method of successive approximations for solution of optimal control problems. J Comp Math Math Phys 2(6):1132–1139 ((in Russian))
Lyubushin AA (1982) Modifications of the method of successive approximations for solving optimal control problems. J Comp Math Math Phys 22(1):29–34 ((in Russian))
Onatski A, Williams N (2003) Modeling model uncertainty. J Eur Econ Assoc 1:1087–1122
Rubio S (2006) On coincidence of feedback nash equilibria and stackelberg equilibria in economic applications of differential games. J Optim Theory Appl 128(1):203–221
Werning I (2012) Managing a liquidity trap: monetary and fiscal policy, MIT Working Paper, Cambridge
Williams RJ (1980) Sufficient conditions for nash equilibria in N-person games over reflexive banach spaces. J Optim Theory Appl 30(3):383–394
Woodford M (2003) Optimal interest-rate smoothing. Rev Econ Stud 70:861–886
Acknowledgements
The authors are grateful to the editor and two anonymous referees whose valuable comments and suggestions, including the question posed in Remark 1, significantly helped to improve the paper.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
No datasets were generated or analyzed during the current study.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
M.I. Krastanov: This work has been partially supported by Sofia University “St. Kliment Ohridski” under contract No. 80-10-180/27.05.2022 and by the Bulgarian Ministry of Science and Higher Education National Fund for Science Research under contract KP-06-H22-4/04.12.2018.
B.K. Stefanov: This work has been partially supported by the Center of Excellence in Informatics and ICT, Grant No. BG05M2OP001-1.001-0003 (financed by the Science and Education for Smart Growth Operational Program (2014–2020) and co-financed by the European Union through the European structural and investment funds).
Appendix
Appendix
Proof
For simplicity, we present the proof for \(t_0= 0\). The case \(t_0>0\) can be considered in the same way. We set \(\Vert z\Vert _\Theta ^2:= z^\top \Theta z \), where \(z^\top \) is the transpose of the vector z and \(\Theta \) is an arbitrary symmetric positive definite square matrix of suitable dimension. Because \(\int _{0}^\infty \Vert w(t)\Vert ^2 dt < \infty \) and the matrices
are asymptotically stable, we have that \(x_{u,w}(t)\rightarrow 0\), \(x_{{\bar{u}},w}(t)\rightarrow 0\), \(x_{u, {\bar{w}}}(t) \rightarrow 0 \) and \(x_{{\bar{u}}, {\bar{w}}}(t) \rightarrow 0\) as \(t \rightarrow \infty \). We consider the case of open-loop controls and present the proof in full detail. (The cases, where one or two closed-loop controls are used, are simpler and can be studied in the same way.) Let \(u \in \mathcal U\) and \(w\in \mathcal W\) be arbitrary admissible open-loop controls, and let \(x_{u,w}(T)\), \(T >0\), be the corresponding trajectory. Then for each \(\tau \in (0, T)\), we have that
Because the eigenvalues of the matrix A have negative real parts, there exist a constant \(c>0\) and a real number \(\alpha >0\) such that \(\left\| e^{At}\right\| \le c e^{-\alpha t}\) for each \(t\ge 0\). Hence,

Applying the H\(\ddot{\text{ o }}\)lder inequality, one can check that

and then to obtain the following two estimates:
and
Hence,
Similarly, the following estimates holds true:
Since \(\tau \) is an arbitrary element of (0, T), we obtain that
and hence
Because
for each \(\varepsilon > 0\), there exists \(\tau _\varepsilon \) such that
Moreover, there exists \(T_\varepsilon >\tau _\varepsilon \) such that for each \(T>T_\varepsilon \)
From here, it follows that
and hence
Then,
i.e.,
The definition of \({\bar{u}}\) and \({\bar{w}}\) (according to (4)) implies that
Then,
Also,
and
Hence, we have obtained that
for each control function pair \(u\in \mathcal U\) and \(w \in \mathcal W\), i.e., \(({\bar{u}}, {\bar{w}})\) is a saddle point (Nash equilibrium) for the considered differential game. \(\square \)
Proof
Since U is a convex neighborhood of the origin in \({\mathbb {R}}^k\), there exists \(\varepsilon >0\) such that the closed ball \(\varepsilon \bar{\textbf{ B }}_k \subset {\mathbb {R}}^k \) is a subset of U. We can choose the positive real \(\delta _0>0\) to be so small that the norm of the vector \(-R^{-1}B^\top Px\) is less than \(\varepsilon \) for each \(x \in \delta _0 \bar{\textbf{ B }}_n \), and hence \(-R^{-1}B^\top Px\in U\). Thus, i) holds true for each \(0<\delta \le \delta _0\).
Let
Because the matrix P is positive definite, the real \(\bar{\alpha }\) is positive. We choose an arbitrary \(\alpha \in (0, \bar{\alpha })\) and set
Clearly, the set \(\Omega \) is a compact neighborhood of the origin contained in the interior of the ball \(\delta _0\bar{\textbf{ B}}_n \subset {\mathbb {R}}^n\).
Let x be an arbitrary point from \(\Omega \). The definition of the set \(\Omega \) implies that \(x \in \delta _0\bar{\textbf{ B }}_n \) and \(x^\top P x \le \alpha \). Then, the relation \(x \in \delta _0\bar{\textbf{ B}}_n \) implies that \(-R^{-1}B^\top Px \in U\).
Let us fix an arbitrary point y from \(\Omega \) and a real \(\tau \ge t_0\). Then, the trajectory \({\bar{x}}_{{\bar{u}}, \bar{w}}(\cdot ,y,\tau )\) corresponding to the controls \({\bar{u}}\) and \(\bar{w}\) and starting from y at the moment of time \(\tau \) is the solution of the following Cauchy problem:
where \({\bar{A}}=A -BR^{-1}B^\top P + ES^{-1}E^\top P\). Because P is a symmetric positive definite solution of the matrix algebraic Riccati equation
one can check that
Indeed, we have that
Because \({\bar{Q}}\) is a symmetric positive definite matrix, the last equality implies that the function \(V_L:{\mathbb {R}}^n \rightarrow {\mathbb {R}}\) defined as \(V_L(x) = x^\top Px\) is a Lyapunov function for the system
and because the function \(V _L({\bar{x}}(\cdot )) \) is decreasing, we obtain that
for each \(t \ge t_0\), i.e., \({\bar{x}}_{{\bar{u}},{\bar{w}} }(t,y,\tau ) \in \Omega \) for each \(t\ge t_0\). Thus, ii) also holds true.
The positive definiteness of matrices P and Q implies that the system (14) is asymptotically stable at the origin, and hence \({\bar{x}}_{{\bar{u}},{\bar{w}} }(t,y,\tau )\) tends to zero as \(t \rightarrow +\infty \). This completes the proof. \(\square \)
Proof
Let us assume that \(x_{{\hat{u}}, {\hat{w}}}( t)\) does not belong to \( \delta \bar{\textbf{ B }}_n\) for each \( t\ge t_0\), i.e., for each \( t\ge t_0\), we have that \(\Vert x_{{\hat{u}}, {\hat{w}}}( t)\Vert >\delta \). The positive definiteness of the matrices P and Q implies the existence of positive reals p and q such that \(x^\top P x \ge p \Vert x\Vert ^2\) and \(x^\top Q x \ge q \Vert x\Vert ^2\) for each \(x \in \mathbb R^n\). Then,
Then,
The last inequality is impossible because \(\int _{t_0}^{\infty } \Vert {\hat{w}} (t)\Vert ^2<\infty \). The obtained contradiction shows that there exists \(T\ge t_0\) such that \(x_{{\hat{u}}, {\hat{w}}}( T) \in \delta \bar{\textbf{ B }}_n\). According to Proposition 2, we obtain that \(x_{{\hat{u}}, {\hat{w}}}( t) \in \delta \bar{\textbf{ B }}_n\) for each \(t\ge T\) and \(x_{{\hat{u}}, {\hat{w}}}( t) \rightarrow 0\) as \(t\rightarrow \infty \). This completes the proof. \(\square \)
Proof
Because \(\Vert x_{{\hat{u}}, {\hat{w}}} (T)\Vert \le \delta \), Proposition 2 implies that
From here, we obtain that
Let u be an arbitrary element of \(\mathcal{U}_U\) defined on \([t_0, +\infty )\). We fix an arbitrary \(\varepsilon >0\) and choose \(\hat{w}_{u,\varepsilon } \in \mathcal W \) so that
Then,
(here we take into account (5))
If we let \(\varepsilon \) to tend to zero, then we obtain that
Because u is an arbitrary element of \(\mathcal{U}_U\), the last inequality implies that
This completes the proof. \(\square \)
Proof
The corresponding proof is similar to the proof of Proposition 4. \(\square \)
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Krastanov, M.I., Rozenov, R. & Stefanov, B.K. On a Constrained Infinite-Time Horizon Linear Quadratic Game. Dyn Games Appl 13, 843–858 (2023). https://doi.org/10.1007/s13235-022-00484-6
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13235-022-00484-6