Abstract
In this paper, a Hamilton–Jacobi–Bellman (HJB) equation–based optimal control algorithm for robust controller design is proposed for nonlinear systems. The HJB equation is formulated using a suitable nonquadratic term in the performance functional to tackle constraints on the control input. Utilizing the direct method of Lyapunov stability, the controller is shown to be optimal with respect to a cost functional, which includes penalty on the control effort and the maximum bound on system uncertainty. The bounded controller requires the knowledge of the upper bound of system uncertainty. In the proposed algorithm, neural network is used to approximate the solution of HJB equation using least squares method. Proposed algorithm has been applied on the nonlinear system with matched and unmatched type system uncertainties and uncertainties in the input matrix. Necessary theoretical and simulation results are presented to validate proposed algorithm.
Similar content being viewed by others
References
Lin F, Brand RD, Sun J (1992) Robust control of nonlinear systems: compensating for uncertainty. Int J Contr 56(6):1453–1459
Lin F (2000) An optimal control approach to robust control design. Int J Contr 73(3):177–186
Mclian TW, Beard RW (1998) Successive Galerkin approximations to the nonlinear optimal control of an underwater robotic vehicle. In: Proceeding of the IEEE International Conference on Robotics and Automation, (Leuven, Belgium), pp 762–767
Beard RW, Saridis GN, Wen JT (1997) Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation. Automatica 33(12):2159–2177
Tassa Y, Erez T (2007) Least squares solutions of the HJB equation with neural network value-function approximators. IEEE Trans Neural Netw 18(4):1031–1040
Lyshevski SE (1998) Optimal control of nonlinear continuous-time systems: design of bounded controllers via generalized nonquadratic functionals. In: Proceeding of the American Control Conference, Philadelphia, Pennsylvania, pp 205–209
Abu-Khalaf M, Huang J, Lewis FL (2006) Nonlinear H2/H∞ constrained feedback control: a practical design approach using neural networks. Springer, Berlin
Abu-Khalaf M, Lewis FL (2005) Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica 41:779–791
Chen YH (1988) Design of robust controllers for uncertain dynamical systems. IEEE Trans Autom Contr 33(5):487–491
Barmish BR, Corless M, Leitmann G (1983) A new class of stabilizing controllers for uncertain dynamical systems. SIAM J Contr Optim 21(2):246–255
Barmish BR (1985) Necessary and sufficient conditions for quadratic stabilizability of an uncertain system. J Optim Theory Appl 46(4):399–408
Qu Z (1995) Robust control of nonlinear uncertain systems without generalized matching conditions. IEEE Trans Autom Contr 40(8):1453–1460
Gopal M (1993) Modern control system theory, 2nd edn. New Age International Publishers, New Delhi
Haykin S (1998) Neural networks: a comprehensive edition. Prentice-Hall of India
Cheng T, Lewis FL, Abu-Khalaf M (2007) A neural network solution for fixed final time optimal control of nonlinear systems. Automatica 43:482–490
Cheng T, Lewis FL, Abu-Khalaf M (2007) Fixed–final time constrained optimal control of nonlinear systems using neural network HJB approach. IEEE Trans Neural Netw 18(6):1725–1737
Lin Feng, Brandt RD (1998) An optimal control approach to robust control of robot manipulators. IEEE Trans Robot Autom 14(1):69–77
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix 1
The dynamics of robot manipulator is given by
where q consists of joint variables, τ is the generalized forces, M(q) the inertia matrix, \( V(q,\dot{q}) \) the centripetal vector, G(q) the gravity vector and \( F(\dot{q}) \) the friction vector. For simplicity, we denote \( N(q,\dot{q}) = V(q,\dot{q}) + F(\dot{q}) + G(q) \). There are uncertainties in M(q) and \( N(q,\dot{q}) \) due to, say, unknown load on the manipulator and unmodeled frictions.
The robot manipulator has two joint variables: two angles q 1 and q 2. The inertia matrix is
where
For simplicity, we denote \( N(q,\dot{q}) = V(q,\dot{q}) + F(\dot{q}) + G(q) \). There are uncertainties in M(q) and \( N(q,\dot{q}) \) due to, say, unknown load on the manipulator and unmodeled frictions.
The centripetal vector is
where
The friction vector
Without loss of generality, we assume that gravity vector G(q) = 0; otherwise, it is easy to calculate the additional torque to balance the gravity. The value of the parameters is as follows:
The load unknown, so we assume
where ε ∈ [0, 1].
With these values, we can calculate M(q) and \( N(q,\dot{q}) \) as follows:
Assuming ε = 1 for the simplicity, we can select the following M 0(q) and \( N_{0} (q,\dot{q}) \):
Appendix 2
In Sects. 5.1 and 5.2, we applied proposed algorithm on two different class of nonlinear systems. Now we would like to show that proposed algorithm will also work for the linear system:Consider a linear uncertain system
It is assumed that f is bounded by a known term, \( f_{\max } \), i.e., \( \left\| f \right\| \le f_{\max } \).
For the given open-loop system, our aim is to find a feedback control law u = Kx such that the closed-loop system is global asymptotically stable for all admissible uncertainties f. With the performance function \( \int_{0}^{\infty } {\left( {x^{T} \mu f_{\max }^{2} I_{n \times n} x + x^{T} Qx + u^{T} Ru} \right)} {\text{d}}t, \) where μ > 0 is design parameter, this problem can be solved using following HJB equation:
where \( V^{ * } (x) = x^{T} Px \). It gives optimal control law \( u^{ * } = Kx = - {\frac{1}{2 \, }}R^{ - 1} B^{T} V_{x}^{ * } \) and corresponding algebraic Reccati equation (ARE) is
u* becomes a robust control law if
It is similar to condition (11) proposed for nonlinear system.
We have considered following system:
where \( p \in \left[ { - 10,1} \right] \) is the uncertain term. We would like to design a robust control law u = Kx such that the closed-loop system is stable for all \( p \in \left[ { - 10,1} \right]. \)
Since f = p, it gives \( f_{\max }^{2} = \left\| p \right\|^{2} \le 100 \).
The corresponding ARE (I) is solved with μ = 2 and identity matrices Q and R. It gives solution of ARE as
Condition (II) is always satisfied as
Hence, we have shown that condition similar to (11) is always satisfied for the linear system.
Rights and permissions
About this article
Cite this article
Adhyaru, D.M., Kar, I.N. & Gopal, M. Bounded robust control of nonlinear systems using neural network–based HJB solution. Neural Comput & Applic 20, 91–103 (2011). https://doi.org/10.1007/s00521-010-0441-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-010-0441-1