Abstract
In this paper, a comparison is addressed between two methods, that is, the optimal tracking control methods of unknown nonlinear systems with and without constrained input. Firstly, the optimal tracking problem for a class of affine nonlinear system is formulated. The tracking cost functions are also defined, both for the two methods. The two methods are proved to be equivalent as the actuator bound is large enough. Integral reinforcement learning (IRL) algorithm is employed to solve the optimal tracking problem by using only system data. To facilitate the implementation of the IRL algorithm, the actor-critic neural network technique and the least squares method are employed in approximating the unknown weights iteratively. In the simulation, a detailed comparison is given to demonstrate the relationship between the two methods in the aspects of control input and tracking cost value.
Similar content being viewed by others
References
Modares H, Lewis F (2014) Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning. Automatica 50:1780–1792
Modares H, Lewis F, Naghibi-Sistani M-B (2014) Integral reinforcement learning and experience replay for adaptive optimal control of partially unknown constrained-input continuous-time systems. Automatica 50:193–202
Yang X, Liu D, Luo B, Li C (2016) Data-based robust adaptive control for a class of unknown nonlinear constrained-input systems via integral reinforcement learning. Inf Sci 369:731–747
Cui X, Zhang H, Luo Y, Jiang H (2017) Adaptive dynamic programming for H tracking design of uncertain nonlinear systems with disturbances and input constraints. Int J Adapt Control Signal Process 31:1567–1583
Ding C, Li Y, Zhang L, Zhang J, Yang L, Wei W (2018) Fast-convergent fully connected deep learning model using constrained nodes input. Neural Process Lett pp 1–11. https://doi.org/10.1007/s11063-018-9872-y
Lewis FL, Liu D (2013) Reinforcement learning and approximate dynamic programming for feedback control, vol 17. Wiley, Hoboken
Xiao G, Zhang H, Luo Y, Jiang H (2016) Data-driven optimal tracking control for a class of affine non-linear continuous-time systems with completely unknown dynamics. IET Control Theory Appl 10:700–710
Liu C, Zhang H, Xiao G, Sun S (2019) Integral reinforcement learning based decentralized optimal tracking control of unknown nonlinear large-scale interconnected systems with constrained-input. Neurocomputing 323:1–11
Devasia S, Degang C, Paden B (1996) Nonlinear inversion-based output tracking. IEEE Trans Autom Control 41:930–942
Qin C, Zhang H, Luo Y (2013) Online optimal tracking control of continuous-time linear systems with unknown dynamics by using adaptive dynamic programming. Int J Control 87:1000–1009
Huang HC, Chiang CH (2016) An evolutionary radial basis function neural network with robust genetic-based immunecomputing for online tracking control of autonomous robots. Neural Process Lett 44:19–35
Zhang H, Cui L, Zhang X, Luo Y (2011) Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method. IEEE Trans Neural Netw 22:2226–2236
Lin Q, Wei Q, Liu D (2017) A novel optimal tracking control scheme for a class of discrete-time nonlinear systems using generalised policy iteration adaptive dynamic programming algorithm. Int J Syst Sci 48:525–534
Wei Q, Liu D, Xu Y (2016) Neuro-optimal tracking control for a class of discrete-time nonlinear systems via generalized value iteration adaptive dynamic programming approach. Soft Comput 20:697–706
Song R, Wei Q, Xiao W (2016) ADP-based optimal sensor scheduling for target tracking in energy harvesting wireless sensor networks. Neural Comput Appl 27:1543–1551
Kiumarsi B, Lewis FL, Modares H, Karimpour A, Naghibi-Sistani MB (2014) Reinforcement -learning for optimal tracking control of linear discrete-time systems with unknown dynamics. Automatica 50:1167–1175
Han K, Feng J, Cui X (2017) Fault-tolerant optimised tracking control for unknown discrete-time linear systems using a combined reinforcement learning and residual compensation methodology. Int J Syst Sci 48:2811–2825
Qu Q, Zhang H, Feng T, Jiang H (2017) Decentralized adaptive tracking control scheme for nonlinear large-scale interconnected systems via adaptive dynamic programming. Neurocomputing 225:1–10
Na J, Herrmann G (2014) Online adaptive approximate optimal tracking control with simplified dual approximation structure for continuous-time unknown nonlinear systems. IEEE CAA J Autom Sin 1:412–422
Zhang H, Luo Y, Liu D (2009) Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints. IEEE Trans Neural Netw 20:1490–1503
Liu Y, Zhang H, Luo Y, Han J (2016) ADP based optimal tracking control for a class of linear discrete-time system with multiple delays. J Frankl Inst 353:2117–2136
Luo B, Wu H, Huang T, Liu D (2014) Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design. Automatica 50:3281–3290
Wang D, He H, Liu D (2017) Adaptive critic nonlinear robust control: a survey. IEEE Trans Cybern 47:3429–3451
Lv Y, Na J, Ren X (2017) Online H\(\infty \) control for completely unknown nonlinear systems via an identifier critic-based ADP structure. Int J Control 3:1–28
Wei Q, Song R, Yan P (2016) Data-driven zero-sum neuro-optimal control for a class of continuous-time unknown nonlinear systems with disturbance using ADP. IEEE Trans Neural Netw Learn Syst 27:444–458
Yang X, Liu D, Wei Q, Wang D (2016) Guaranteed cost neural tracking control for a class of uncertain nonlinear systems using adaptive dynamic programming. Neurocomputing 198:80–90
Zhang H, Zhang J, Yang G, Luo Y (2015) Leader-based optimal coordination control for the consensus problem of multiagent differential games via fuzzy adaptive dynamic programming. IEEE Trans Fuzzy Syst 23:152–163
Zhang H, Liu D, Luo Y, Wang D (2012) Adaptive dynamic programming for control: algorithms and stability. Springer, London
Wei Q, Zhang H, Dai J (2009) Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions. Neurocomputing 72:1839–1848
Wang D, Liu D (2018) Learning and guaranteed cost control with event-based adaptive critic implementation. IEEE Trans Neural Netw Learn Syst 29:6004–6014
Song R, Lewis F, Wei Q (2017) Off-policy integral reinforcement learning method to solve nonlinear continuous-time multiplayer nonzero-sum games. IEEE Trans Neural Netw Learn Syst 28:704–713
Zhang H, Cui L, Luo Y (2013) Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using single-network ADP. IEEE Trans Cybern 43:206–216
Ding C, Sun Y, Zhu Y (2017) A NN-based hybrid intelligent algorithm for a discrete nonlinear uncertain optimal control problem. Neural Process Lett 45:457–473
Wang D, He H, Liu D (2018) Intelligent optimal control with critic learning for a nonlinear overhead crane system. IEEE Trans Ind Inform 14:2932–2940
Faußser S, Schwenker F (2015) Neural network ensembles in reinforcement learning. Neural Process Lett 41:55–69
Pucheta J, Patiño H, Fullana R, Schugurensky C, Kuchen B (2006) A neuro-dynamic programming-based optimal controller for tomato seedling growth in greenhouse systems. Neural Process Lett 24:241–260
Modares H, Lewis F, Jiang Z (2015) H\(\infty \) tracking control of completely unknown continuous-time systems via off-policy reinforcement learning. IEEE Trans Neural Netw Learn Syst 26:2550–2562
Stewart J (2015) Single variable calculus: early transcendentals. Cengage Learning, Boston
Acknowledgements
This work was supported by the National Natural Science Foundation of China (6143300461627809, 61621004), and the National High Technology Research and Development Program of China (2012AA040104).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Liu, C., Zhang, H., Ren, H. et al. An Analysis of IRL-Based Optimal Tracking Control of Unknown Nonlinear Systems with Constrained Input. Neural Process Lett 50, 2681–2700 (2019). https://doi.org/10.1007/s11063-019-10029-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11063-019-10029-5