Optimal and Adaptive Control Design Using Recursive Least Square with a New Exponential Forgetting Factor

Park, On; Shin, Hyo-Sang; Lee, Hae-In; Antonios, Tsourdos

doi:10.1007/978-3-030-97672-9_11

On Park¹⁶,
Hyo-Sang Shin¹⁶,
Hae-In Lee¹⁶ &
…
Tsourdos Antonios¹⁶

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 429))

Included in the following conference series:

International Conference on Robot Intelligence Technology and Applications

1328 Accesses

Abstract

This paper presents a Reinforcement Learning application using a recursive least squares (RLS) with an exponential forgetting (EF) factor to solve the Discrete Linear Quadratic Regulator problem. Temporal Difference learning based RLS algorithm is implemented to find a kernel matrix of the action value function (or Q-function) approximated by neural network. Based on the EF RLS, a New Exponential Forgetting (New EF) factor algorithm is developed by adding a covariance term to the forgetting factor to prevent the estimator windup problem. Numerical simulations on a fixed-wing aircraft are performed to show the effectiveness of the new EF RLS.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Sutton, R.S., Barto, A.G.: Reinforcement Learning - An Introduction, 2nd edn. MIT Press, Cambridge (2018)
MATH Google Scholar
Lewis, F.L., Vrabie, D., Syrmos, V.: Optimal Control, 3rd edn. Wiley, Hoboken (2012)
Book Google Scholar
Subbaram, N.D.: Optimal Control Systems. CRC Press, Boca Raton (2002)
Google Scholar
Vrabie, D., Vamvoudakis, K.G., Lewis, F.L.: Optimal adaptive control and differential games by reinforcement learning IT the institution of engineering and technology control engineering series, vol. 81, The Institution of Engineering and Technology (2013)
Google Scholar
Lewis, F.L., Vrabie, D., Vamvoudakis, K.G.: Reinforcement learning and feedback control: using natural decision methods to design optimal adaptive controllers. IEEE Control Syst. Mag. 32(6), 76–105 (2012)
Article MathSciNet Google Scholar
Vrabie, D., Lewis, F.L.: Neural network approach to continuous time direct adaptive optimal control for partially-unknown nonlinear systems. Neural Netw. 22(3), 237–246 (2009)
Article Google Scholar
Bradtke, S.J., Ydstie, B.E., Barto, A.G.: Adaptive linear quadratic control using policy iteration. In: Proceedings of 1994 Americal Control Conference -ACC 1994, Baltimore, Maryland (1994)
Google Scholar
Zhu, L.M., Modares, H., Peen, G.O., Lewis, F.L., Yue, B.: Adaptive suboptimal output-feedback control for linear systems using integral reinforcement learning. IEEE Trans. Control Syst. Technol. 23(1), 264–273 (2014)
Article Google Scholar
Modares, H., Lewis., F.L.: Linear quadratic tracking control of partially-unknown continuous-time systems using reinforcement learning. IEEE Trans. Automat. Control 59(11), 3051–3056 (2014)
Google Scholar
Ljung, L.: System Identification: Theory for the user, Englewood Cliffs. Prentice-Hall, Hoboken (1999)
Google Scholar
Shin, H.S., Lee, H.I.: A new exponential forgetting algorithm for recursive least-squares parameter estimation. arXiv:2004.03910v [eess.SY], April 2020
Bittani, S., Bolzern, P., Campi, M.: Convergence and exponential convergence of identification algorithms with directional forgetting factor. Automatica 26(5), 929–932 (1990)
Article Google Scholar
Campi, M.: On the convergence of minimum-variance directional-forgetting adaptive control scheme. Automatica 28(1), 221–225 (1992)
Article MathSciNet Google Scholar
Johnstone, R.M., Johnson, C.R., Bitmead, R.R., Anderson, B.D.O.: Exponential convergence of recursive least squares with exponential forgetting factor. Syst. Control Lett. 2(2), 77–82 (1982)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

The School of Aerospace Transport and Manufacturing (SATM), Cranfield University, Cranfield, MK43 0AL, UK
On Park, Hyo-Sang Shin, Hae-In Lee & Tsourdos Antonios

Authors

On Park
View author publications
You can also search for this author in PubMed Google Scholar
Hyo-Sang Shin
View author publications
You can also search for this author in PubMed Google Scholar
Hae-In Lee
View author publications
You can also search for this author in PubMed Google Scholar
Tsourdos Antonios
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hyo-Sang Shin .

Editor information

Editors and Affiliations

Department of Mechanical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Jinwhan Kim
Mechanical Engineering, Stevens Institute of Technology, Hoboken, NJ, USA
Brendan Englot
Department of Mechanical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Hae-Won Park
Aerospace Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Han-Lim Choi
Civil and Environmental Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Hyun Myung
School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Junmo Kim
School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Jong-Hwan Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Park, O., Shin, HS., Lee, HI., Antonios, T. (2022). Optimal and Adaptive Control Design Using Recursive Least Square with a New Exponential Forgetting Factor. In: Kim, J., et al. Robot Intelligence Technology and Applications 6. RiTA 2021. Lecture Notes in Networks and Systems, vol 429. Springer, Cham. https://doi.org/10.1007/978-3-030-97672-9_11

Download citation

DOI: https://doi.org/10.1007/978-3-030-97672-9_11
Published: 01 April 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-97671-2
Online ISBN: 978-3-030-97672-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics