Neural Network-Based Adaptive Optimal Controller – A Continuous-Time Formulation

Vrabie, Draguna; Lewis, Frank; Levine, Daniel

doi:10.1007/978-3-540-85930-7_37

Draguna Vrabie^1,2,
Frank Lewis^1,2 &
Daniel Levine^1,2

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 15))

Included in the following conference series:

International Conference on Intelligent Computing

1709 Accesses
3 Citations

Abstract

We present a new online adaptive control scheme, for partially unknown nonlinear systems, which converges to the optimal state-feedback control solution for affine in the input nonlinear systems. The main features of the algorithm map on the characteristics of the rewards-based decision making process in the mammal brain.

The derivation of the optimal adaptive control algorithm is presented in a continuous-time framework. The optimal control solution will be obtained in a direct fashion, without system identification. The algorithm is an online approach to policy iterations based on an adaptive critic structure to find an approximate solution to the state feedback, infinite-horizon, optimal control problem.

This work was supported by NSF ECS-0501451, NSF ECCS-0801330 and ARO W91NF-05-1-0314.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Reinforcement learning-based neural control for discrete-time nonlinear systems via deterministic learning

Article 07 December 2024

Observer-Based Adaptive Optimized Control for Stochastic Nonlinear Systems with State Constraints

Reinforcement learning-based robust optimal tracking control for disturbed nonlinear systems

Article 12 September 2023

References

Abu-Khalaf, M., Lewis, F.L.: Nearly Optimal Control Laws for Nonlinear Systems with Saturating Actuators Using a Neural Network HJB Approach. Automatica 41(5), 779–791 (2005)
Article MATH MathSciNet Google Scholar
Beard, R., Saridis, G., Wen, J.: Galerkin Approximations of the Generalized Hamilton-Jacobi-Bellman Equation. Automatica 33(12), 2159–2177 (1997)
Article MATH MathSciNet Google Scholar
Beard, R., Saridis, G., Wen, J.: Approximate Solutions to the Time-Invariant Hamilton-Jacobi-Bellman Equation. Journal of Optimization Theory and Application 96(3), 589–626 (1998)
Article MATH MathSciNet Google Scholar
Bertsekas, D.P., Tsitsiklis, J.N.: Neuro-Dynamic Programming. Athena Scientific, MA (1996)
MATH Google Scholar
Bradtke, S.J., Ydestie, B.E., Barto, A.G.: Adaptive Linear Quadratic Control Using Policy Iteration. In: Proc. of ACC, pp. 3475–3476, Baltimore (June 1994)
Google Scholar
Brown, J., Bullock, D., Grossberg, S.: How the basal ganglia use parallel excitatory and inhibitory learning pathways to selectively respond to unexpected rewarding cues. J. Neuroscience 19, 10502–10511 (1999)
Google Scholar
Doya, K.: Reinforcement Learning In Continuous Time and Space. Neural Computation 12(1), 219–245 (2000)
Article Google Scholar
Feldbaum, A.A.: Dual control theory I-II, Autom. Remote Control 21, 874–880, 1033–1039 (1960)
MathSciNet Google Scholar
Filatov, N.M., Unbehauen, H.: Survey of adaptive dual control methods. IEE Proc. Control Theory and Applications 147(1), 118–128 (2000)
Article Google Scholar
Hanselmann, T., Noakes, L., Zaknich, A.: Continuous-Time Adaptive Critics. IEEE Trans. on Neural Networks 18(3), 631–647 (2007)
Article Google Scholar
Hewer, G.: An Iterative Technique for the Computation of the Steady State Gains for the Discrete Optimal Regulator. IEEE Trans. on Automatic Control 16, 382–384 (1971)
Article Google Scholar
Hornik, K., Stinchcombe, M., White, H.: Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks. Neural Networks 3, 551–560 (1990)
Article Google Scholar
Howard, R.A.: Dynamic Programming and Markov Processes. MIT Press, Cambridge (1960)
MATH Google Scholar
Kleinman, D.: On an Iterative Technique for Riccati Equation Computations. IEEE Trans. on Automatic Control 13, 114–115 (1968)
Article Google Scholar
Levine, D.S., Brown, V.R., Shirey, V.T. (eds.): Oscillations in Neural Systems. Lawrence Erlbaum Associates, Mahwah (2000)
Google Scholar
Lewis, F., Syrmos, V.: Optimal Control. Wiley, New York (1995)
Google Scholar
Li, Z.H., Krstic, M.: Optimal design of adaptive tracking controllers for nonlinear systems. In: Proc. of ACC, pp. 1191–1197 (1997)
Google Scholar
Miller, W.T., Sutton, R., Werbos, P.: Neural networks for control. MIT Press, Cambridge (1990)
Google Scholar
Murray, J.J., Cox, C.J., Lendaris, G.G., Saeks, R.: Adaptive Dynamic Programming. IEEE Trans. on Systems, Man and Cybernetics 32(2), 140–153 (2002)
Article Google Scholar
Prokhorov, D., Wunsch, D.: Adaptive critic designs. IEEE Trans. on Neural Networks 8(5), 997–1007 (1997)
Article Google Scholar
Saridis, G., Lee, C.S.: An Approximation Theory of Optimal Control for Trainable Manipulators. IEEE Trans. on Systems, Man and Cybernetics 9(3), 152–159 (1979)
Article MATH MathSciNet Google Scholar
Schultz, W., Dayan, P., Read Montague, P.: A Neural Substrate of Prediction and Reward. Science 275, 1593–1599 (1997)
Article Google Scholar
Schultz, W.: Neural coding of basic reward terms of animal learning theory, game theory, microeconomics and behavioral ecology. Current Opinion in Neurobiology 14, 139–147 (2004)
Article Google Scholar
Slotine, J.J., Li, W.: Applied Nonlinear Control. Prentice-Hall, Englewood Cliffs (1991)
MATH Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning – An introduction. MIT Press, Cambridge (1998)
Google Scholar
Vrabie, D., Pastravanu, O., Lewis, F.L.: Policy Iteration for Continuous-time Systems with Unknown Internal Dynamics. In: Proc. of MED (2007)
Google Scholar
Watkins, C.J.C.H.: Learning from delayed rewards. PhD Thesis, University of Cambridge, England (1989)
Google Scholar
Werbos P.: Neural networks for control and system identification. In: IEEE Proc. CDC 1989 (1989)
Google Scholar
Werbos, P.: Approximate dynamic programming for real-time control and neural modeling. In: White, D.A., Sofge, D.A. (eds.) Handbook of Intelligent Control, Van Nostrand Reinhold, New York (1992)
Google Scholar
Wittenmark, B.: Adaptive dual control methods: An overview. In: 5th IFAC Symp. on Adaptive Systems in Control and Signal Processing, pp. 67–73 (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

Automation and Robotics Research Institute, University of Texas at Arlington, 7300 Jack Newell Blvd. S., Fort Worth, TX 76118, USA
Draguna Vrabie, Frank Lewis & Daniel Levine
Department of Psychology, University of Texas at Arlington, Arlington, TX 76019-0528, USA
Draguna Vrabie, Frank Lewis & Daniel Levine

Authors

Draguna Vrabie
View author publications
You can also search for this author in PubMed Google Scholar
Frank Lewis
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Levine
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

De-Shuang Huang Donald C. Wunsch II Daniel S. Levine Kang-Hyun Jo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vrabie, D., Lewis, F., Levine, D. (2008). Neural Network-Based Adaptive Optimal Controller – A Continuous-Time Formulation. In: Huang, DS., Wunsch, D.C., Levine, D.S., Jo, KH. (eds) Advanced Intelligent Computing Theories and Applications. With Aspects of Contemporary Intelligent Computing Techniques. ICIC 2008. Communications in Computer and Information Science, vol 15. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85930-7_37

Download citation

DOI: https://doi.org/10.1007/978-3-540-85930-7_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85929-1
Online ISBN: 978-3-540-85930-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Neural Network-Based Adaptive Optimal Controller – A Continuous-Time Formulation

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Reinforcement learning-based neural control for discrete-time nonlinear systems via deterministic learning

Observer-Based Adaptive Optimized Control for Stochastic Nonlinear Systems with State Constraints

Reinforcement learning-based robust optimal tracking control for disturbed nonlinear systems

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Neural Network-Based Adaptive Optimal Controller – A Continuous-Time Formulation

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Reinforcement learning-based neural control for discrete-time nonlinear systems via deterministic learning

Observer-Based Adaptive Optimized Control for Stochastic Nonlinear Systems with State Constraints

Reinforcement learning-based robust optimal tracking control for disturbed nonlinear systems

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation