Online exploratory behavior acquisition model based on reinforcement learning

Gouko, Manabu; Kobayashi, Yuichi; Kim, Chyon Hae

doi:10.1007/s10489-014-0567-4

Online exploratory behavior acquisition model based on reinforcement learning

Published: 27 July 2014

Volume 42, pages 75–86, (2015)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Manabu Gouko¹,
Yuichi Kobayashi² &
Chyon Hae Kim³

491 Accesses
Explore all metrics

Abstract

Discernment behavior is an exploratory behavior that supports object feature extraction and is a fundamental tool used by robots to orient themselves, operate objects, and establish knowledge. The main contribution of this paper is to propose an active perception model and analyzes the acquired motion patterns. In this study, we propose an active perception model in which a robot autonomously learns discernment behavior by interacting with multiple objects in its environment. During such interactions, the robot receives reinforcement signals according to the cluster distance of the observed data. In other words, we use a reinforcement learning approach to reward the successful recognition of objects. We apply our proposed model to a mobile robot simulation to observe its effectiveness. Results show that our proposed model effectively established intelligent strategies based on the relationship between object features and the robot’s configuration. In addition, we perform our experiments using real mobile robots and observe the suitability of the observed learned behaviors.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Learning Long-Horizon Robot Exploration Strategies for Multi-object Search in Continuous Action Spaces

Motivated Reinforcement Learning Using Self-Developed Knowledge in Autonomous Cognitive Agent

Estimation of the Change of Agents Behavior Strategy Using State-Action History

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Anderson CW (1989) Learning to Control an Inverted Pendulum Using Neural Networks. IEEE Control Syst Mag 9:31–37
Article Google Scholar
İlkay A, Dag̃ N, Kalkan S, Şahin E (2010) Affordances and emergence of concepts. In: Proceedings of the 10th International Conference on Epigenetic Robotics, 11–18
Gibson E (1988) Exploratory behavior in the development of perceiving, acting, and the acquiring of knowledge. Annu Rev Psychol 39(1):1–42
Article Google Scholar
Gibson JJ (1962) Observations on active touch. Psychol Rev 69(6):477–491
Article MathSciNet Google Scholar
Griffith S, Sinapov J, Sukhoy V, Stoytchev A (2010) How to separate containers from non-containers? a behavior-grounded approach to acoustic object categorization. In: Proceedings of 2010 IEEE International Conference on Robotics and Automation (ICRA2010), 1852–1859
Held R, Hein A (1963) Movement-produced stimulation in the development of visually guided behavior. J Comp Physiol Psychol 56(5):872–876
Article Google Scholar
Shibata K, Nishinoz T, Okabe Y (2001) Actor-q based active perception learning system. In: Proceedings of International Conference on Robotics and Automation 2001 (ICRA2001), 1000–1005
Maris M, Boeckhorst R (1996) Exploiting physical constraints: heap formation through behavioral error in a group of robots. In: Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS96), 1655–1660
Morimoto G, Ikegami T (2004) Evolution of plastic sensory-motor coupling and dynamic categorization. In: Pollack J, et al. (eds) Artificial Life IX. The MIT press, Massachusetts, pp 188–193
Google Scholar
Nishide S, Ogata T, Tani J, Komatani K, Okuno G H (2008) Predicting object dynamics from visual images through active sensing experiences. Adv Robot 22(5):527–546
Google Scholar
Nolfi S, Marocco D (2002) Active perception: A sensorimotor account of object categorization. In: Proceedings of the 7th international conference on simulation of adaptive behavior on From animals to animats (SAB2002), 266–271
Pfeifer R, Scheier C (1999) Understanding Intelligence. The MIT Press, Cambridge
Google Scholar
Sinapov J, Sukhoy V, Sahai R, Stoytchev A (2009) Vibrotactile Recognition and Categorization of Surfaces by a Humanoid Robot. IEEE Trans Robot 27(3):488–497
Article Google Scholar
Sinapov J, Wiemer M, Stoytchev A (2009) Interactive learning of the acoustic properties of household objects. In: Proceedings of the 2009 IEEE International Conference on Robotics and Automation (ICRA2009), 2518–2524
Stoytchev A (2005) Behavior-grounded representation of tool affordances. In: Proceedings of the 2005 IEEE International Conference on Robotics and Automation (ICRA2005), 3060–3065
Sutton RS, Barto AG (1998) Reinforcement Learning: An Introduction. The MIT Press, Massachusetts
Google Scholar
Takamuku S, Hosoda K, Asada M (2008) Object category acquisition by dynamic touch. Adv Robot 22(10):1143–1154
Article Google Scholar
Takamuku S, Takahashi Y, Asada M (2006) Lexicon acquisition based on object-oriented behavior learning. Adv Robot 20(10):1127–1145
Article Google Scholar
Tsukada M, Madokoro H, Sato K (2010) Unsupervised and adaptive category classification for a vision-based mobile robot. In: Proceedings of the 2010 International Joint Conference on Neural Networks (IJCNN2010), 1–6
Turvey MT (1996) Dynamic touch. Am Psychol 51(11):1134–1152
Article Google Scholar

Download references

Acknowledgments

This research was partially supported by the Ministry of Education, Science, Sports and Culture, Grant-in-Aid for Young Scientists (B), 24700196.

Author information

Authors and Affiliations

Department of Mechanical Engineering and Intelligent Systems, Faculty of Engineering, Tohoku Gakuin University, 1-13-1 Chuo, Tagajo-shi, Miyagi, 985-8537, Japan
Manabu Gouko
Department of Mechanical Engineering, Graduate School of Engineering, Shizuoka University, 3-5-1 Johoku, Naka-ku, Hamamatsu-shi, Shizuoka, 432-8561, Japan
Yuichi Kobayashi
Department of Electrical Engineering and Computer Science, Faculty of Engineering, Iwate University, 4-3-5, Ueda, Morioka-shi, Iwate, 020-8551, Japan
Chyon Hae Kim

Authors

Manabu Gouko
View author publications
You can also search for this author inPubMed Google Scholar
Yuichi Kobayashi
View author publications
You can also search for this author inPubMed Google Scholar
Chyon Hae Kim
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Manabu Gouko.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gouko, M., Kobayashi, Y. & Kim, C.H. Online exploratory behavior acquisition model based on reinforcement learning. Appl Intell 42, 75–86 (2015). https://doi.org/10.1007/s10489-014-0567-4

Download citation

Published: 27 July 2014
Issue Date: January 2015
DOI: https://doi.org/10.1007/s10489-014-0567-4

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Online exploratory behavior acquisition model based on reinforcement learning

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Learning Long-Horizon Robot Exploration Strategies for Multi-object Search in Continuous Action Spaces

Motivated Reinforcement Learning Using Self-Developed Knowledge in Autonomous Cognitive Agent

Estimation of the Change of Agents Behavior Strategy Using State-Action History

Explore related subjects

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now