Abstract
In this paper, we present a numerical algorithm to compute high-order approximate solutions to Bellman’s dynamic programming equation that arises in the optimal stabilization of discrete-time nonlinear control systems. The method uses a patchy technique to build local Taylor polynomial approximations defined on small domains, which are then patched together to create a piecewise smooth approximation. The numerical domain is dynamically computed as the level sets of the value function are propagated in reverse time under the closed-loop dynamics. The patch domains are constructed such that their radial boundaries are contained in the level sets of the value function and their lateral boundaries are constructed as invariant sets of the closed-loop dynamics. To minimize the computational effort, an adaptive subdivision algorithm is used to determine the number of patches on each level set depending on the relative error in the dynamic programming equation. Numerical tests in 2D and 3D are given to illustrate the accuracy of the method.
Similar content being viewed by others
References
Laub, A.J.: A Schur method for solving algebraic Riccati equations. IEEE Trans. Autom. Control 24, 913–921 (1979)
Crandall, M.G., Lions, P.L.: Viscosity solutions of Hamilton–Jacobi equations. Trans. Am. Math. Soc. 277, 1–42 (1983)
Albrekht, E.G.: On the optimal stabilization of nonlinear systems. J. Appl. Math. Mech. 25, 1254–1266 (1961)
Leake, R.J., Liu, R.-W.: Construction of suboptimal control sequences. SIAM J. Control 5, 54–63 (1967)
Lukes, D.L.: Optimal regulation of nonlinear dynamical systems. SIAM J. Control 7, 75–100 (1969)
Capuzzo-Dolcetta, I., Ishii, H.: Approximate solutions of the Bellman equation of deterministic control theory. Appl. Math. Optim. 11, 161–181 (1984)
Falcone, M., Ferretti, R.: Discrete time high-order schemes for viscosity solutions of Hamilton–Jacobi–Bellman equations. Numer. Math. 67, 315–344 (1994)
Beard, R.W., Sardis, G.N., Wen, J.T.: Galerkin approximations of the generalized Hamilton–Jacobi–Bellman equation. Automatica 33, 2159–2177 (1997)
Mracek, C.P., Cloutier, J.R.: Control designs for the nonlinear benchmark problem via the state-dependent Riccati equation method. Int. J. Robust Nonlinear Control 8, 401–433 (1998)
Sethian, J.A.: Level Set Methods and Fast Marching Methods. Cambridge University Press, Cambridge (1999)
Markman, J., Katz, I.N.: An iterative algorithm for solving Hamilton–Jacobi type equations. SIAM J. Sci. Comput. 22, 312–329 (2000)
Navasca, C., Krener, A.J.: Patchy solution of the HJB PDE. In: Chiuso, A., Ferrante, A., Pinzoni, S. (eds.) Modeling, Estimation and Control. Lecture Notes in Control and Information Sciences, vol. 364, pp. 251–270 (2007)
Sakamoto, N., van der Schaft, A.J.: Analytical approximation methods for the stabilizing solution of the Hamilton–Jacobi equation. IEEE Trans. Autom. Control 53, 2335–2350 (2008)
Beeler, S.C., Tran, H.T., Banks, H.T.: Feedback control methodologies for nonlinear systems. J. Optim. Theory Appl. 107, 1–33 (2000)
Cacace, S., Cristiani, E., Falcone, M., Picarelli, A.: A patchy dynamic programming scheme for a class of Hamilton–Jacobi–Bellman equations. SIAM J. Sci. Comput. 34, 2625–2649 (2012)
Garrard, W.L., Jordan, J.M.: Design of nonlinear automatic flight control systems. Automatica 13, 497–505 (1977)
Yoshida, T., Loparo, K.A.: Quadratic regulatory theory for analytic non-linear systems with additive controls. Automatica 25, 531–544 (1989)
Spencer, B.F., Timlin, T.L., Sain, M.K., Dyke, S.J.: Series solution of a class of nonlinear optimal regulators. J. Optim. Theory Appl. 91, 321–345 (1996)
Ancona, F., Bressan, A.: Patchy vector fields and asymptotic stabilization. ESAIM Control Optim. Calc. Var. 4, 445–471 (1999)
Hunt, T.: A proof of the higher order accuracy of the patchy method for solving the Hamilton–Jacobi–Bellman equation. PhD thesis, University of California, Davis, CA (2011)
Nešić, D., Teel, A.R.: Backstepping on the Euler approximate model for stabilization of sampled-data nonlinear systems. In: Proc. 40th IEEE Conference on Decision and Control, pp. 1737–1742 (2001)
Neśić, D., Teel, A.R., Kokotović, P.V.: Sufficient conditions for the stabilization of sampled-data nonlinear systems via discrete-time approximations. Syst. Control Lett. 38, 259–270 (1999)
Grüne, L., Nešić, D.: Optimization-based stabilization of sampled-data nonlinear systems via their approximate discrete-time models. SIAM J. Control Optim. 42(42), 98–122 (2003)
Keerthi, S.S., Gilbert, E.G.: Optimal infinite-horizon feedback laws for a general class of constrained discrete-time systems: Stability and moving-horizon approximations. J. Optim. Theory Appl. 57, 265–293 (1988)
Bellman, R.: Introduction to the Mathematical Theory of Control Processes. Academic Press, New York (1971)
Navasca, C.: Local solutions of the dynamic programming equations and the Hamilton–Jacobi–Bellman PDEs. PhD Thesis, University of California, Davis (2002)
Lewis, F.: Optimal Control. Wiley-Interscience, New York (1986)
Baumgardner, J.R., Frederickson, P.O.: Icosahedral discretization of the two-sphere. SIAM J. Numer. Anal. 22, 1107–1115 (1985)
Guckenheimer, J., Holmes, P.: Nonlinear Oscillations, Dynamical Systems, and Bifurcations of Vector Fields. Springer, New York (1983)
Witkin, A., Heckbert, P.: Using particles to sample and control implicit surfaces. Comput. Graph. 28, 269–278 (1994)
Acknowledgements
The authors would like to thank the anonymous reviewers for their valuable comments and suggestions that improved the presentation of this paper. The first author acknowledges the support of the Natural Research Council Postdoctoral Associateship program and the Naval Postgraduate School.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Lars Grüne.
Rights and permissions
About this article
Cite this article
Aguilar, C.O., Krener, A.J. Numerical Solutions to the Bellman Equation of Optimal Control. J Optim Theory Appl 160, 527–552 (2014). https://doi.org/10.1007/s10957-013-0403-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10957-013-0403-8