Reinforcement learning in dynamic environment: abstraction of state-action space utilizing properties of the robot body and environment

Ito, Kazuyuki; Takeuchi, Yutaka

doi:10.1007/s10015-015-0258-1

Reinforcement learning in dynamic environment: abstraction of state-action space utilizing properties of the robot body and environment

Original Article
Published: 18 January 2016

Volume 21, pages 11–17, (2016)
Cite this article

Artificial Life and Robotics Aims and scope Submit manuscript

Kazuyuki Ito¹ &
Yutaka Takeuchi²

497 Accesses
4 Citations
Explore all metrics

Abstract

In this paper, we address the autonomous control of a 3D snake-like robot through the use of reinforcement learning, and we apply it in a dynamic environment. In general, snake-like robots have high mobility that is realized by many degrees of freedom, and they can move over dynamically shifting environments such as rubble. However, this freedom and flexibility leads to a state explosion problem, and the complexity of the dynamic environment leads to incomplete learning by the robot. To solve these problems, we focus on the properties of the actual operating environment and the dynamics of a mechanical body. We design the body of the robot so that it can abstract small, but necessary state-action space by utilizing these properties, and we make it possible to apply reinforcement learning. To demonstrate the effectiveness of the proposed snake-like robot, we conduct experiments; from the experimental results we conclude that learning is completed within a reasonable time, and that effective behaviors for the robot to adapt itself to an unknown 3D dynamic environment were realized.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Abstraction of State-Action Space Utilizing Properties of the Body and Environment

Toward Effective Soft Robot Control via Reinforcement Learning

A Learning Based Recovery for Damaged Snake-Like Robots

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Arai M, Takayama T, Hirose S (2004), Development of Souryu-III: Connected crawler vehicle for inspection inside narrow and winding spaces, Proc. Int. Conf. Intelligent Robots and Systems, p 52–57
Paap KL, Christaller T, Kirchner F (2000) A robot snake to inspect broken buildings, Proc. Int. Conf. Intelligent Robots and Systems, p 2079–2082
Wolf A, Brown HB, Casciola R et al (2003) A mobile hyper redundant mechanism for search and rescue tasks, Proc. Int. Conf. Intelligent Robots and Systems, p 2889–2895
Kamegawa T, Yamasaki T, Igarashi H et al (2004) Development of the snake-like rescue robot “KOHGA,” Proc. 2004 IEEE Int. Conf. on Robotics and Automation, p 5081–5086
Yamada H, Mori M, Hirose S (2007) Stabilization of the head of an undulating snake-like robot, Proc. Int. Conf. Intelligent Robots and Systems, p 3566–3571
Ito K, Fukumori Y (2006) Autonomous control of a snake-like robot utilizing passive mechanism, Proc. 2006 IEEE Int. Conf. Robotics and Automation, p 381–386
Ito K, Kamegawa T, Matsuno F (2003) Extended QDSEGA for controlling real robots -Acquisition of locomotion patterns for snake-like robot-, Proc. 2003 IEEE Int. Conf. Robotics and Automation, Sep 14–19, p 791–796
Murai R, Ito K, Matsuno F (2006) An intuitive human-robot interface for rescue operation of a 3D snake robot, Proc. 12th IASTED Int. Conf. Robotics and Applications p138–143
Watkins CJ, Dayan P (1992) Q-learning. Mach Learn 8:279–292
MATH Google Scholar
Kober J et al (2013) Reinforcement learning in robotics: a survey. Int J Robot Res 32(11):1238–1274
Article Google Scholar
Kimura H, Yamashita T, Kobaysahi S (2001) Reinforcement learning of walking behavior for a four-legged robot, Proc. 40th IEEE Conf. Decision and Control, p 411–416
Pfeifer R (2001) “Understand Intelligence,” The MIT Press, New edition
Gibson JJ (1987) The ecological approach to visual perception. Hillsdale, NJ, Lawrence Erlbaum Associates
Google Scholar
Ito K, Takayama A, Kobayashi T (2009) “Hardware design of autonomous snake-like robot for reinforcement learning based on environment -Discussion of versatility on different tasks-,” The 2009 IEEE/RSJ Int. Conf. Intelligent Robots and Systems, p 2622–2627
Ito K, Fukumori Y, Takayama A (2007) Autonomous control of real snake-like robot using reinforcement learning -abstraction of state-action space using properties of real world-, Proc. Int. Conf. Intelligent Sensors, Sensor Networks and Information Processing, p 389–394

Download references

Acknowledgments

This study was partially supported by the Ministry of Education, Culture, Sports, Science and Technology (MEXT) (Grant-in-Aid for Young Scientists (B), 22700156, 2011).

Author information

Authors and Affiliations

Hosei University, Tokyo, Japan
Kazuyuki Ito
Murata Manufacturing Co., Ltd., Nagaokakyo, Japan
Yutaka Takeuchi

Authors

Kazuyuki Ito
View author publications
You can also search for this author inPubMed Google Scholar
Yutaka Takeuchi
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Kazuyuki Ito.

About this article

Cite this article

Ito, K., Takeuchi, Y. Reinforcement learning in dynamic environment: abstraction of state-action space utilizing properties of the robot body and environment. Artif Life Robotics 21, 11–17 (2016). https://doi.org/10.1007/s10015-015-0258-1

Download citation

Received: 02 December 2014
Accepted: 24 November 2015
Published: 18 January 2016
Issue Date: March 2016
DOI: https://doi.org/10.1007/s10015-015-0258-1

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Reinforcement learning in dynamic environment: abstraction of state-action space utilizing properties of the robot body and environment

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Abstraction of State-Action Space Utilizing Properties of the Body and Environment

Toward Effective Soft Robot Control via Reinforcement Learning

A Learning Based Recovery for Damaged Snake-Like Robots

Explore related subjects

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now