Bipedal Walking Robot Control Using PMTG Architecture

Danilov, Vladimir; Klimov, Konstantin; Kapytov, Dmitrii; Diane, Sekou

doi:10.1007/978-3-031-47272-5_8

Vladimir Danilov^13,15,
Konstantin Klimov^14,15,
Dmitrii Kapytov^14,15 &
…
Sekou Diane¹³

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 811))

Included in the following conference series:

Climbing and Walking Robots Conference

Abstract

Reinforcement learning based methods can achieve excellent results for robot locomotion control. However, their serious disadvantage is the long agent training time and large number of parameters defining its behavior. In this paper, we propose a method that significantly reduces training time. It is based on the Policy Modulating Trajectory Generator (PMTG) architecture, which uses Central Pattern Generators (CPG) as a gait generator. We tested this approach on an OpenAI BipedalWalker-v3 environment. The paper presents the results of this algorithm, showing its effectiveness in solving a locomotion problem over challenging terrain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Adapting Biped Locomotion to Sloped Environments

Article 01 February 2015

Learning Footstep Planning for the Quadrupedal Locomotion with Model Predictive Control

Biped Walking Learning from Imitation Using Dynamic Movement Primitives

Notes

References

McGhee, R.B.: Finite state control of quadruped locomotion. Simulation 9(3), 135–140 (1967). https://doi.org/10.1177/003754976700900308
Article Google Scholar
Raibert, M.H.: Legged Robots that Balance. MIT Press, Cambridge (1986)
Book Google Scholar
Villarreal, O., Barasuol, V., Wensing, P.M., Caldwell, D.G., Semini, C.: Mpc-based controller with terrain insight for dynamic legged locomotion. In: 2020 IEEE International Conference on Robotics and Automation (ICRA), pp. 2436–2442 (2020)
Google Scholar
Sleiman, J.P., Farshidian, F., Minniti, M.V., Hutter, M.: A unified mpc framework for whole-body dynamic locomotion and manipulation. IEEE Robot. Autom. Lett. 6(3), 4688–4695 (2021)
Article Google Scholar
Bjelonic, M., Grandia, R., Harley, O., Galliard, C., Zimmermann, S., Hutter, M.: Whole-body mpc and online gait sequence generation for wheeled-legged robots. In: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 8388–8395 (2021)
Google Scholar
Di Carlo, J., Wensing, P.M., Katz, B., Bledt, G., Kim, S.: Dynamic locomotion in the mit cheetah 3 through convex model-predictive control. In: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1–9 (2018)
Google Scholar
Iscen, A., Caluwaerts, K., Tan, J., Zhang, T., Coumans, E., Sindhwani, V., Vanhoucke, V.: Policies modulating trajectory generators. In: Conference on Robot Learning, pp. 916–926 (2018)
Google Scholar
Tan, J., Zhang, T., Coumans, E., Iscen, A., Bai, Y., Hafner, D., Vanhoucke, V.: Sim-to-real: Learning agile locomotion for quadruped robots (2018). arXiv:1804.10332
Kumar, A., Fu, Z., Pathak, D., Malik, J.: Rma: rapid motor adaptation for legged robots (2021). arXiv:2107.04034
Hwangbo, J., Lee, J., Dosovitskiy, A., Bellicoso, D., Tsounis, V., Koltun, V., Hutter, M.: Learning agile and dynamic motor skills for legged robots. Sci. Robot. 4(26) (2019). https://doi.org/10.1126/scirobotics.aau5872
Margolis, G.B., Yang, G., Paigwar, K., Chen, T., Agrawal, P.: Rapid locomotion via reinforcement learning (2022). arXiv:2205.02824
Margolis, G.B., Chen, T., Paigwar, K., Fu, X., Kim, D., Kim, S., Agrawal, P: Learning to jump from pixels (2021). arXiv:2110.15344
Alexander, R.M.: Optimization and gaits in the locomotion of vertebrates. Physiol. Rev. 69(4), 1199–1227 (1989)
Article MathSciNet Google Scholar
Haarnoja, T., Zhou, A., Hartikainen, K., Tucker, G., Ha, S., Tan, J., Kumar, V., Zhu, H., Gupta, A., Abbeel, P., et al.: Soft actor-critic algorithms and applications (2018). arXiv:1812.0590
Rudin, N., Hoeller, D., Reist, P., Hutter, M.: Learning to walk in minutes using massively parallel deep reinforcement learning. In: Conference on Robot Learning, pp. 91–100 (2022)
Google Scholar
Danilov, V., Diane, S.: CPG-based gait generator for a quadruped robot with sidewalk and turning operations. In: Robotics in Natural Settings: CLAWAR 2022, pp. 276–288 (2022). https://doi.org/10.1007/978-3-031-15226-9_27
Brockman, G., Cheung, V., Pettersson, L., Schneider, J., Schulman, J., Tang, J., Zaremba, W.: Openai gym (2016). arXiv:1606.01540
Yu, J., Tan, M., Chen, J., Zhang, J.: A survey on CPG-inspired control models and system implementation. IEEE Trans. Neural Netw. Learn. Syst. 25(3), 441–456 (2013)
Article Google Scholar
Yu, J., Tan, M., Chen, J., Zhang, J.: A survey on CPG-inspired control models and system implementation. IEEE Trans. Neural Netw. Learn. Syst. 25(3), 441–456 (2013)
Article Google Scholar
Akiba, T., Sano, S., Yanase, T., Ohta, T., Koyama, M.: Optuna: a next-generation hyperparameter optimization framework. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 2623–2631 (2019)
Google Scholar
OpenAI Gym Leaderboard. https://github.com/openai/gym/wiki/Leaderboard

Download references

Author information

Authors and Affiliations

Institute of Control Problems of Russian Academy of Sciences, Moscow, Russia
Vladimir Danilov & Sekou Diane
Robotics Laboratory, Institute of Mechanics Lomonosov Moscow State University, Michurinsky prosp.1, 119192, Moscow, Russia
Konstantin Klimov & Dmitrii Kapytov
Voltbro LCC, Moscow, Russia
Vladimir Danilov, Konstantin Klimov & Dmitrii Kapytov

Authors

Vladimir Danilov
View author publications
You can also search for this author in PubMed Google Scholar
Konstantin Klimov
View author publications
You can also search for this author in PubMed Google Scholar
Dmitrii Kapytov
View author publications
You can also search for this author in PubMed Google Scholar
Sekou Diane
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vladimir Danilov .

Editor information

Editors and Affiliations

Universidade Federal de Santa Catarina, Blumenau, Santa Catarina, Brazil
Ebrahim Samer El Youssef
School of Engineering, London South Bank University, London, UK
Mohammad Osman Tokhi
School of Engineering, Polytechnic Institute of Porto, Porto, Portugal
Manuel F. Silva
Universidade Federal de Santa Catarina, Blumenau, Santa Catarina, Brazil
Leonardo Mejia Rincon

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Danilov, V., Klimov, K., Kapytov, D., Diane, S. (2024). Bipedal Walking Robot Control Using PMTG Architecture. In: Youssef, E.S.E., Tokhi, M.O., Silva, M.F., Rincon, L.M. (eds) Synergetic Cooperation between Robots and Humans. CLAWAR 2023. Lecture Notes in Networks and Systems, vol 811. Springer, Cham. https://doi.org/10.1007/978-3-031-47272-5_8

Download citation

DOI: https://doi.org/10.1007/978-3-031-47272-5_8
Published: 04 January 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-47271-8
Online ISBN: 978-3-031-47272-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Bipedal Walking Robot Control Using PMTG Architecture

Abstract

Access this chapter

Similar content being viewed by others

Adapting Biped Locomotion to Sloped Environments

Learning Footstep Planning for the Quadrupedal Locomotion with Model Predictive Control

Biped Walking Learning from Imitation Using Dynamic Movement Primitives

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Bipedal Walking Robot Control Using PMTG Architecture

Abstract

Access this chapter

Similar content being viewed by others

Adapting Biped Locomotion to Sloped Environments

Learning Footstep Planning for the Quadrupedal Locomotion with Model Predictive Control

Biped Walking Learning from Imitation Using Dynamic Movement Primitives

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation