Learning Footstep Planning for the Quadrupedal Locomotion with Model Predictive Control

Byun, Joo Woong; Youm, Donghoon; Jeon, Seunghoon; Hwangbo, Jemin; Park, Hae-Won

doi:10.1007/978-3-030-97672-9_4

Joo Woong Byun¹⁶,
Donghoon Youm¹⁶,
Seunghoon Jeon¹⁶,
Jemin Hwangbo¹⁶ &
…
Hae-Won Park¹⁶

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 429))

Included in the following conference series:

International Conference on Robot Intelligence Technology and Applications

1832 Accesses

Abstract

This paper presents a combined framework with nonlinear model predictive control (NMPC) reinforcement learning (RL) for locomotion of a legged robot. A neural network trained by RL works as a footstep planner which decides where to put the feet of the robot on the ground. Given the constraints of footsteps and dynamics of the model, ground reaction forces exerting on each legs are obtained through NMPC and applied to the robot. This framework increases sample efficiency compared to the end-to-end RL and shows better performances than base NMPC controller which decides its footsteps in a heuristic manner. The proposed framework is verified on a simulation environment by performing challenging tasks such as push recovery and rough terrain walking.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Control of Wheeled-Legged Quadrupeds Using Deep Reinforcement Learning

3-Dimensional A* for Collision Free Walking Gait

A modular framework to generate robust biped locomotion: from planning to control

Article Open access 11 August 2021

References

Kuindersma, S., et al.: Optimization-based locomotion planning, estimation, and control design for the atlas humanoid robot. Auton. Robots 40(3), 429–455 (2015). https://doi.org/10.1007/s10514-015-9479-3
Article Google Scholar
Hong, S., Kim, J.-H., Park, H.-W.: Real-time constrained nonlinear model predictive control on SO(3) for dynamic legged locomotion. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 3982–3989 (2020)
Google Scholar
Ding, Y., Pandala, A., Park, H.-W., et al.: Representation-free model predictive control for dynamic motions in quadrupeds. IEEE Trans. Robot. 37, 1154–1171 (2021)
Article Google Scholar
Carlo, J., Wensing, P.M., Kim, S., et al.: Dynamic locomotion in the MIT cheetah 3 through convex model-predictive control. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1–9 (2018)
Google Scholar
Raibert, M.H., Brown, H.B., Chepponis, M.: Experiments in balance with a 3D one-legged hopping machine. Int. J. Robot. Res. 3, 75–92 (1984)
Article Google Scholar
Pratt, J., Carff, J., Goswami, A., et al.: Capture point: a step toward humanoid push recovery. In: IEEE-RAS International Conference on Humanoid Robots, vol. 6, pp. 200–207 (2006)
Google Scholar
Mordatch, I., Todorov, E., Popovic, Z.: Discovery of complex behaviors through contact-invariant optimization. ACM Trans. Graph. 31(4), 1–8 (2012)
Article Google Scholar
Winkler, A.W., Bellicoso, C.D., Hutterm, M., et al.: Gait and trajectory optimization for legged systems through phase-based end-effector parameterization. IEEE Robot. Autom. Lett. 3, 1560–1567 (2018)
Article Google Scholar
Li, C., Ding, Y., Park, H.-W.: Centroidal-momentum-based trajectory generation for legged locomotion. Mechatronics 68, 102364 (2020)
Google Scholar
Hwangbo, J., Lee, J., Hutter, M., et al.: Learning agile and dynamic motor skills for legged robots. Sci. Robot. 4, 26 (2019)
Article Google Scholar
Carius, J., Farshidian, F., Hutter, M.: MPC-Net: a first principles guided policy search. IEEE Robot. Autom. Lett. 5, 2897–2904 (2020)
Google Scholar
Peng, X.B., Coumans, E., Levine, S., et al.: Learning agile robotic locomotion skills by imitating animals. arXiv:2004.00784 (2020)
Mahony, R., Hamel, T., Pflimlin, J.: Nonlinear complementary filters on the special orthogonal group. IEEE Trans. Autom. Control 53(5), 1203–1218 (2008)
Article MathSciNet Google Scholar
Kim, D., Carlo, J., Kim, S., et al.: Highly dynamic quadruped locomotion via whole-body impulse control and model predictive control. arXiv:1909.06586 (2019)
Schulman, J., Wolski, F., Klimov, O., et al.: Proximal policy optimization algorithm. arXiv:1707. 06347 (2017)
Tan, J., Coumans, E., Vanhoucke, V., et al.: Sim-to-real: learning agile locomotino for quadruped robots. arXiv:1804.10332 (2018)
Hwangbo, J., Lee, J., Hutter, M.: Per-contact iteration method for solving contact dynamics. IEEE Robot. Autom. Lett. 3(2), 895–902 (2018)
Article Google Scholar
Tsounis, V., Alge, M., Hutter, M. et al.: DeepGait: planning and control of quadrupedal gaits using deep reinforcement learning. arXiv:1909.08339 (2019)
Yang, Y., Zhang, T, Boots, B., et al.: Fast and efficient locomotion via learned gait transitions. arXiv:2104.04644 (2021)

Download references

Author information

Authors and Affiliations

Korea Advanced Institute of Science and Technology, Daejeon, South Korea
Joo Woong Byun, Donghoon Youm, Seunghoon Jeon, Jemin Hwangbo & Hae-Won Park

Authors

Joo Woong Byun
View author publications
You can also search for this author in PubMed Google Scholar
Donghoon Youm
View author publications
You can also search for this author in PubMed Google Scholar
Seunghoon Jeon
View author publications
You can also search for this author in PubMed Google Scholar
Jemin Hwangbo
View author publications
You can also search for this author in PubMed Google Scholar
Hae-Won Park
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hae-Won Park .

Editor information

Editors and Affiliations

Department of Mechanical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Jinwhan Kim
Mechanical Engineering, Stevens Institute of Technology, Hoboken, NJ, USA
Brendan Englot
Department of Mechanical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Hae-Won Park
Aerospace Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Han-Lim Choi
Civil and Environmental Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Hyun Myung
School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Junmo Kim
School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Jong-Hwan Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Byun, J.W., Youm, D., Jeon, S., Hwangbo, J., Park, HW. (2022). Learning Footstep Planning for the Quadrupedal Locomotion with Model Predictive Control. In: Kim, J., et al. Robot Intelligence Technology and Applications 6. RiTA 2021. Lecture Notes in Networks and Systems, vol 429. Springer, Cham. https://doi.org/10.1007/978-3-030-97672-9_4

Download citation

DOI: https://doi.org/10.1007/978-3-030-97672-9_4
Published: 01 April 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-97671-2
Online ISBN: 978-3-030-97672-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics