A Model-Based Reinforcement Learning Approach to Time-Optimal Control Problems

Liao, Hsuan-Cheng; Liu, Jing-Sin

doi:10.1007/978-3-030-22999-3_56

Hsuan-Cheng Liao¹³ &
Jing-Sin Liu¹³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11606))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

Abstract

Reinforcement Learning has achieved an exceptional performance in the last decade, yet its application to robotics and control remains a field for deeper investigation due to potential challenges. These include high-dimensional continuous state and action spaces, as well as complicated system dynamics and constraints in robotic settings. In this paper, we demonstrate a pioneering experiment in applying an existing model-based RL framework, PILCO, to the problem of time-optimal control. At first, the algorithm models the system dynamics with Gaussian Processes, successfully reducing the effect of model biases. Then, policy evaluation is done through iterated prediction with Gaussian posteriors and deterministic approximate inference. Finally, analytic gradients are used for policy improvement. A simulation and an experiment of an autonomous car completing a rest-to-rest linear locomotion is documented. Time-optimality and data efficiency of the task are shown in the simulation results, and learning under real-world circumstances is proved possible with our methodology.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Learning High-Level Navigation Strategies via Inverse Reinforcement Learning: A Comparative Analysis

Practical Bayesian Inverse Reinforcement Learning for Robot Navigation

The Challenges of Reinforcement Learning in Robotics and Optimal Control

References

Deisenroth, M., Rasmussen, C.: PILCO: a model-based and data-efficient approach to policy search. In: Proceedings of the International Conference on Machine Learning (2011).
Google Scholar
Shin, K., McKay, N.: A dynamic programming approach to trajectory planning of robotic manipulators. IEEE Trans. Autom. Control 31(6), 491–500 (1986)
Article Google Scholar
Verscheure, D., Demeulenaere, B., Swevers, J., DeSchutter, J., Diehl, M.: Time-optimal path tracking for robots: a convex optimization approach. IEEE Trans. Autom. Control 54(10), 2318–2327 (2009)
Article MathSciNet Google Scholar
Shin, K., McKay, N.: Minimum-time control of robotic manipulators with geometric path constraints. IEEE Trans. Autom. Control 30(6), 531–541 (1985)
Article Google Scholar
Lamiraux, F., Laumond, J.: From paths to trajectories for multibody mobile robots. In: Proceedings of the 5th International Symposium on Experimental Robotics, pp. 301–309 (1998).
Google Scholar
Polydoros, A.S., Nalpantidis, L.: Survey of model-based reinforcement learning: applications on robotics. J. Intell. Rob. Syst. 86(2), 153–173 (2017)
Article Google Scholar
Rasmussen, C., Kuss, M.: Gaussian processes in reinforcement learning. NIPS 7, 51–759 (2004)
Google Scholar
Mayne, D.Q., Rawlings, J.B., Rao, C.V., Scokaert, P.O.: Constrained model predictive control: stability and optimality. Automatica 36, 789–814 (2000)
Article MathSciNet Google Scholar

Download references

Acknowledgement

We acknowledge Dr. Marc Peter Deisenroth for his kind help when implementing the PILCO algorithm in our project. His advice on system constraints handling was very useful to us.

Author information

Authors and Affiliations

Institute of Information Science, Academia Sinica, Nangang, Taipei, 115, Taiwan, ROC
Hsuan-Cheng Liao & Jing-Sin Liu

Authors

Hsuan-Cheng Liao
View author publications
You can also search for this author in PubMed Google Scholar
Jing-Sin Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Hsuan-Cheng Liao or Jing-Sin Liu .

Editor information

Editors and Affiliations

Institute for Software Technology, Graz University of Technology, Graz, Austria
Franz Wotawa
Department of Applied Informatics, University of Klagenfurt, Klagenfurt, Austria
Gerhard Friedrich
Institute for Software Technology, Graz University of Technology, Graz, Austria
Ingo Pill
Institute for Software Technology, Graz University of Technology, Graz, Austria
Roxane Koitz-Hristov
Department of Computer Science, Texas State University, San Marcos, TX, USA
Moonis Ali

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liao, HC., Liu, JS. (2019). A Model-Based Reinforcement Learning Approach to Time-Optimal Control Problems. In: Wotawa, F., Friedrich, G., Pill, I., Koitz-Hristov, R., Ali, M. (eds) Advances and Trends in Artificial Intelligence. From Theory to Practice. IEA/AIE 2019. Lecture Notes in Computer Science(), vol 11606. Springer, Cham. https://doi.org/10.1007/978-3-030-22999-3_56

Download citation

DOI: https://doi.org/10.1007/978-3-030-22999-3_56
Published: 15 June 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-22998-6
Online ISBN: 978-3-030-22999-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics