Reinforcement learning-based robust optimal tracking control for disturbed nonlinear systems

Fan, Zhong-Xin; Tang, Lintao; Li, Shihua; Liu, Rongjie

doi:10.1007/s00521-023-08993-0

Reinforcement learning-based robust optimal tracking control for disturbed nonlinear systems

Original Article
Published: 12 September 2023

Volume 35, pages 23987–23996, (2023)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Zhong-Xin Fan^1,2,
Lintao Tang³,
Shihua Li^1,2 &
…
Rongjie Liu ORCID: orcid.org/0000-0003-1653-8609³

351 Accesses
Explore all metrics

Abstract

This paper concludes a robust optimal tracking control law for a class of nonlinear systems. A characteristic of this paper is that the designed controller can guarantee both robustness and optimality under nonlinearity and mismatched disturbances. Optimal controllers for nonlinear systems are difficult to obtain, hence a reinforcement learning method is adopted with two neural networks (NNs) approximating the cost function and optimal controller, respectively. We designed weight update laws for critic NN and actor NN based on gradient descent and stability, respectively. In addition, matched and mismatched disturbances are estimated by fixed-time disturbance observers and an artful transformation based on backstepping method is employed to convert the system into a filtered error nonlinear system. Through a rigorous analysis using the Lyapunov method, we demonstrate states and estimation errors remain uniformly ultimately bounded. Finally, the effectiveness of the proposed method is verified through two illustrative examples.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Reinforcement Learning-Based Adaptive Output-Feedback Control for Discrete-Time Strict-Feedback Nonlinear Systems

Robust Near-optimal Control for Constrained Nonlinear System via Integral Reinforcement Learning

Article 03 March 2023

Reinforcement Learning-Based Anti-disturbances Adaptive Control for Systems Subjected to Mismatched Disturbances and Input Uncertainties

Data availability

Data sharing not applicable as no new data were generated in this study.

References

Tang L, Gao Y, Liu YJ (2014) Adaptive near optimal neural control for a class of discrete-time chaotic system. Neural Comput Appl 25:1111–1117
Article Google Scholar
Na J, Lv Y, Zhang K, Zhao J (2020) Adaptive identifier-critic-based optimal tracking control for nonlinear systems with experimental validation. IEEE Trans Syst Man Cybern Syst 52(1):459–472
Article Google Scholar
Fan ZX, Li S, Liu R (2022) ADP-based optimal control for dystems with mismatched disturbances: a PMSM application. IEEE Trans Circ Syst II Express Briefs 70(6):2057–2061
Google Scholar
Fan ZX, Adhikary AC, Li S, Liu R (2020) Anti-disturbance inverse optimal control for systems with disturbances. Optim Control Appl Methods 44(3):1321–1340
Article MathSciNet Google Scholar
Chen J, Li K, Li K, Yu PS (2021) Dynamic bicycle dispatching of dockless public bicycle-sharing systems using multi-objective reinforcement learning. ACM Trans Cyber-Phys Syst 5(4):1–24
Article Google Scholar
Lewis FL, Vrabie DL, Syrmos VL (2012) Optimal control. Wiley
Book MATH Google Scholar
Werbos PJ (1992) Approximate dynamic programming for real-time control and neural modeling. handbook of intelligent control neural fuzzy and adaptive approaches, 1992
Wei Q, Zhu L, Song R, Zhang P, Liu D, Xiao J (2022) Model-free adaptive optimal control for unknown nonlinear multiplayer nonzero-sum game. IEEE Trans Neural Netw Learn Syst 33(2):879–892
Article MathSciNet Google Scholar
Gao W, Jiang ZP (2016) Adaptive dynamic programming and adaptive optimal output regulation of linear systems. IEEE Trans Autom Control 61(12):4164–4169
Article MathSciNet MATH Google Scholar
Gao W, Jiang ZP, Lewis FL, Wang Y (2018) Leader-to-formation stability of multiagent systems: an adaptive optimal control approach. IEEE Trans Autom Control 63(10):3581–3587
Article MathSciNet MATH Google Scholar
Krstic M, Tsiotras P (1999) Inverse optimal stabilization of a rigid spacecraft. IEEE Trans Autom Control 44(5):1042–1049
Article MathSciNet MATH Google Scholar
Fan ZX, Adhikary AC, Li S, Liu R (2022) Disturbance observer based inverse optimal control for a class of nonlinear systems. Neurocomputing 500:821–831
Article Google Scholar
Ming X, Balakrishnan SN (2005) A new method for suboptimal control of a class of non-linear systems. Optim Control Appl Methods 26(2):55–83
Article MathSciNet Google Scholar
Do TD, Choi HH, Jung WJ (2015) \(\theta\)-D approximation technique for nonlinear optimal speed control design of surface-mounted PMSM drives. IEEE/ASME Trans Mechatron 20(4):1822–1831
Article Google Scholar
Zhang H, Cui L, Zhang X, Luo Y (2011) Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method. IEEE Trans Neural Netw 22(12):2226–2236
Article Google Scholar
Qin C, Zhang H, Luo Y (2014) Optimal tracking control of a class of nonlinear discrete-time switched systems using adaptive dynamic programming. Neural Comput Appl 24:531–538
Article Google Scholar
Wang D, Liu D, Zhao D, Huang Y, Zhang D (2013) A neural-network-based iterative GDHP approach for solving a class of nonlinear optimal control problems with control constraints. Neural Comput Appl 22(2):219–227
Article Google Scholar
Vamvoudakis KG, Lewis FL (2010) Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem. Automatica 46(5):878–888
Article MathSciNet MATH Google Scholar
Yang W, Li K, Li K (2019) A pipeline computing method of SpTV for three-order tensors on CPU and GPU. ACM Trans Knowl Discov Data 13(6):1–27
Article MathSciNet Google Scholar
Zhong K, Yang Z, Xiao G, Li X, Yang W, Li K (2022) An efficient parallel reinforcement learning approach to cross-layer defense mechanism in industrial control systems. IEEE Trans Parallel Distrib Syst 3(11):2979–2990
Google Scholar
Liu C, Tang F, Hu Y, Li K, Tang Z, Li K (2021) Distributed task migration optimization in MEC by extending multi-agent deep reinforcement learning approach. IEEE Trans Parallel Distrib Syst 32(7):1603–1614
Article Google Scholar
Jiang Y, Jiang ZP (2012) Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics. Automatica 48(10):2699–2704
Article MathSciNet MATH Google Scholar
Bian T, Jiang Y, Jiang ZP (2014) Adaptive dynamic programming and optimal control of nonlinear nonaffine systems. Automatica 50(10):2624–2632
Article MathSciNet MATH Google Scholar
Wang D (2020) Robust policy learning control of nonlinear plants with case studies for a power system application. IEEE Trans Industr Inf 16(3):1733–1741
Article Google Scholar
Zhao J, Yang C, Gao W, Modares H, Chen X, Dai W (2023) Linear quadratic tracking control of unknown systems: a two-phase reinforcement learning method. Automatica 148:110761
Article MathSciNet MATH Google Scholar
Modares H, Lewis FL (2014) Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning. Automatica 50(7):1780–1792
Article MathSciNet MATH Google Scholar
Chen WH (2004) Disturbance observer based control for nonlinear systems. IEEE/ASME Trans Mechatron 9(4):706–710
Article Google Scholar
Yu B, Du H, Ding L, Wu D, Li H (2022) Neural network-based robust finite-time attitude stabilization for rigid spacecraft under angular velocity constraint. Neural Comput Appl 34:5107–5117
Article Google Scholar
Zhou K, Doyle J, Glover K (1995) Robust and optimal control. Prentice Hall, New Jersey
MATH Google Scholar
Utkin V (2003) Variable structure systems with sliding modes. IEEE Trans Autom Control 22(2):212–222
Article MathSciNet MATH Google Scholar
Levant A (2003) Higher-order sliding modes, differentiation and output-feedback control. Int J Control 76(9–10):924–941
Article MathSciNet MATH Google Scholar
Huang J (2004) Nonlinear output regulation- theory and applications. SIAM
Ohishi K, Nakao M, Ohnishi K et al (1987) Microprocessor-controlled DC motor for load-insensitive position servo system. IEEE Trans Industr Electron 34(1):44–49
Article Google Scholar
Han J (2009) From PID to active disturbance rejection control. IEEE Trans Industr Electron 56(3):900–906
Article Google Scholar
Li S, Yang J, Chen WH, Chen X (2014) Disturbance observer-based control: methods and applications. CRC Press, Inc., Boca Raton
Google Scholar
Li S, Yang J, Chen WH, Chen X (2012) Generalized extended state observer based control for systems with mismatched uncertainties. IEEE Trans Industr Electron 59(12):4792–4802
Article Google Scholar
Sun H, Guo L (2017) Neural network-based DOBC for a class of nonlinear systems with unmatched disturbances. IEEE Trans Neural Netw Learn Syst 28(2):482–489
Article MathSciNet Google Scholar
Cui B, Zhang L, Xia Y, Zhang J (2022) Continuous distributed fixed-time attitude controller design for multiple spacecraft systems with a directed graph. IEEE Trans Circ Syst II- Express Briefs 69(11):478–4482
Google Scholar
Li X, Ma L, Mei K, Ding S, Pan T (2023) Fixed-time adaptive fuzzy SOSM controller design with output constraint. Neural Comput Appl 35(13):9893–9905
Article Google Scholar
Liu W, Chen M, Shi P (2022) Fixed-time disturbance observer-based control for quadcopter suspension transportation system. IEEE Trans Circ Syst I- Regul Pap 69(11):4632–4642
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Automation, Southeast University, Sipailou, Nanjing, 210096, Jiangsu, China
Zhong-Xin Fan & Shihua Li
Key Laboratory of Measurement and Control of Complex Systems of Engineering, Ministry of Education, Southeast University, Sipailou, Nanjing, 210096, Jiangsu, China
Zhong-Xin Fan & Shihua Li
Department of Statistics, Florida State University, Tallahassee, FL, 32304, USA
Lintao Tang & Rongjie Liu

Authors

Zhong-Xin Fan
View author publications
You can also search for this author in PubMed Google Scholar
Lintao Tang
View author publications
You can also search for this author in PubMed Google Scholar
Shihua Li
View author publications
You can also search for this author in PubMed Google Scholar
Rongjie Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rongjie Liu.

Ethics declarations

Conflict of interest

All authors declare that there are no conflicts of interest in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Fan, ZX., Tang, L., Li, S. et al. Reinforcement learning-based robust optimal tracking control for disturbed nonlinear systems. Neural Comput & Applic 35, 23987–23996 (2023). https://doi.org/10.1007/s00521-023-08993-0

Download citation

Received: 29 May 2023
Accepted: 22 August 2023
Published: 12 September 2023
Issue Date: November 2023
DOI: https://doi.org/10.1007/s00521-023-08993-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Reinforcement learning-based robust optimal tracking control for disturbed nonlinear systems

Abstract

Access this article

Similar content being viewed by others

Reinforcement Learning-Based Adaptive Output-Feedback Control for Discrete-Time Strict-Feedback Nonlinear Systems

Robust Near-optimal Control for Constrained Nonlinear System via Integral Reinforcement Learning

Reinforcement Learning-Based Anti-disturbances Adaptive Control for Systems Subjected to Mismatched Disturbances and Input Uncertainties

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Reinforcement learning-based robust optimal tracking control for disturbed nonlinear systems

Abstract

Access this article

Similar content being viewed by others

Reinforcement Learning-Based Adaptive Output-Feedback Control for Discrete-Time Strict-Feedback Nonlinear Systems

Robust Near-optimal Control for Constrained Nonlinear System via Integral Reinforcement Learning

Reinforcement Learning-Based Anti-disturbances Adaptive Control for Systems Subjected to Mismatched Disturbances and Input Uncertainties

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation