Abstract
This study introduces a neural network (NN) adaptive tracking controller-based reinforcement learning (RL) scheme for unknown nonlinear systems. First, an observer using feed-forward NN (FFNN) is performed to estimate the controlled system states. Second, an adaptive control based on actor-critic RL is developed, in which a quantum diagonal recurrent neural network (QDRNN) is proposed to represent the critic and actor parts. The critic QDRNN is applied to perform the “strategic” utility function, and it is minimized by the actor QDRNN. The proposed adaptive tracking NN control guarantees the faster convergence due to the developed updated algorithm for the controller parameters, which is derived using the Lyapunov function. Simulation and practical results indicate the robustness of the proposed observer-based adaptive control relative to other existing controllers.



























Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Wang CC, Yang GH (2018) Observer-based adaptive prescribed performance tracking control for nonlinear systems with unknown control direction and input saturation. Neurocomputing 284:17–26
Narendra KS (1996) Neural networks for control theory and practice. Proc IEEE 84(10):1385–1406
Peng J, Dubay R (2011) Identification and adaptive neural network control of a DC motor system with dead-zone characteristics. ISA Trans 50(4):588–598
Lakhal AN, Tlili AS, Braiek NB (2010) Neural network observer for nonlinear systems application to induction motors. Int J Control Autom 3(1):1–16
Chow TW, Fang Y (1998) A recurrent neural-network-based real-time learning control strategy applying to nonlinear systems with unknown dynamics. IEEE Trans Ind Electr 45(1):151–161
Kumar R, Srivastava S, Gupta JRP (2017) Diagonal recurrent neural network based adaptive control of nonlinear dynamical systems using Lyapunov stability criterion. ISA Trans 67:407–427
Kumar R, Srivastava S, Gupta JRP, Mohindru A (2018) Diagonal recurrent neural network based identification of nonlinear dynamical systems with Lyapunov stability based adaptive learning rates. Neurocomputing 287:102–117
Krener AJ, Respondek W (1985) Nonlinear observers with linearizable error dynamics. SIAM Journal on Control and Optimization 23(2):197–216
Oh S, Khalil HK (1997) Nonlinear output-feedback tracking using high-gain observer and variable structure control. Automatica 33(10):1845–1856
Yan XG, Edwards C (2007) Nonlinear robust fault reconstruction and estimation using a sliding mode observer. Automatica 43(9):1605–1614
Iqbal A (2019) Applications of an Extended Kalman Filter in nonlinear mechanics (Doctoral dissertation, Ph.D Thesis, University of Management and Technology. https://www.physlab.org/wpcontent/uploads/2019/06/Thesis-compressed.pdf).
Mu C, Zhang Y, Wang K (2018) Observer-based adaptive control of uncertain nonlinear systems via neural networks. IEEE Access 6:42675–42686
Yang Y, Xu C, Yue D, Xie X (2018) Output feedback tracking control of a class of continuous-time nonlinear systems via adaptive dynamic programming approach. Inf Sci 469:1–13
He P, Jagannathan S (2005) Reinforcement learning-based output feedback control of nonlinear systems with input constraints. IEEE Trans Syst Man Cybern Part B Cybern 35(1):150–154
Rios JD, Alanis AY, Arana-Daniel N, Lopez-Franco C (2017) Recurrent high order neural observer for discrete-time non-linear systems with unknown time-delay. Neural Process Lett 46(2):663–679
He P, Jagannathan S (2007) Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints. IEEE Trans Syst Man Cybern Part B Cybern 37(2):425–436
Lewis FL, Liu D (eds) (2013) Reinforcement learning and approximate dynamic programming for feedback control, vol 17. Wiley, New York
Yang X, Liu D, Wang D (2013) Observer-based adaptive output feedback control for nonaffine nonlinear discrete-time systems using reinforcement learning. In: International Conference on Neural Information Processing, Springer, Berlin, pp 631–638
Barto AG (1992) Reinforcement learning and adaptive critic methods. Handbook of intelligent control
Sharma M, Verma A (2013) Wavelet reduced order observer based adaptive tracking control for a class of uncertain nonlinear systems using reinforcement learning. Int J Control Autom Syst 11(3):496–502
Sharma M, Verma A (2013) Wavelet reduced order observer-based adaptive tracking control for a class of uncertain delayed non-linear systems subjected to actuator saturation using actor-critic architecture. Int J Autom Control 7(4):288–303
Kim EK, Mwasilu F, Choi HH, Jung JW (2014) An observer-based optimal voltage control scheme for three-phase UPS systems. IEEE Trans Ind Electr 62(4):2073–2081
Yang X, Liu D, Wang D (2013) Observer-based adaptive output feedback control for nonaffine nonlinear discrete-time systems using reinforcement learning. In: International Conference on Neural Information Processing, Springer, Berlin, pp 631–638
Mu D, Guan Z, Zhang H (2013) Learning algorithm and application of quantum neural networks with quantum weights. Int J Comput Theory Eng 5(5):788
Shang F (2015) Quantum-inspired neural network with quantum weights and real Weights. Open J Appl Sci 5(10):609
Panella, M., & Martinelli, G (2011) Neural networks with quantum architecture and quantum learning. Int J Circuit Theory Appl 39(1):61–77
Sarangapani J (2006) Neural network control of nonlinear discrete-time systems. CRC Press, Boca Raton
Liu D, Huang Y, Wang D, Wei Q (2013) Neural-network-observer-based optimal control for unknown nonlinear systems using adaptive dynamic programming. Int J Control 86(9):1554–1566
Abdollahi F, Talebi HA, Patel RV (2006) A stable neural network-based observer with application to flexible-joint manipulators. IEEE Trans Neural Netw 17(1):118–129
Khater AA, El-Nagar AM, El-Bardini M, El-Rabaie NM (2018) Adaptive T-S fuzzy controller using reinforcement learning based on Lyapunov stability. J Franklin Instit 355(14):6390–6415
Khater AA, El-Nagar AM, El-Bardini M, El-Rabaie NM (2020) Online learning based on adaptive learning rate for a class of recurrent fuzzy neural network. Neural Comput Appl 32(12):8691–8710
Khater AA, El-Nagar AM, El-Bardini M, El-Rabaie N (2019) A novel structure of actor-critic learning based on an interval type-2 TSK fuzzy neural network. IEEE Trans Fuzzy Syst 28:3047–3061
Elkenawy A, El-Nagar AM, El-Bardini M, El-Rabaie NM (2020) Diagonal recurrent neural network observer-based adaptive control for unknown nonlinear systems. Trans Instit Meas Control 42(15):2833–2856
Esfandiari K, Abdollahi F, Talebi HA (2014) Observer-based adaptive neural network control for a class of uncertain nonlinear systems. In: 2014 22nd Iranian Conference on Electrical Engineering (ICEE). IEEE, pp 1354–1359
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
There is no conflict of interest between the authors to publish this manuscript.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Elkenawy, A., El-Nagar, A.M., El-Bardini, M. et al. Full-state neural network observer-based hybrid quantum diagonal recurrent neural network adaptive tracking control. Neural Comput & Applic 33, 9221–9240 (2021). https://doi.org/10.1007/s00521-020-05685-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-020-05685-x