Abstract
Discernment behavior is an exploratory behavior that supports object feature extraction and is a fundamental tool used by robots to orient themselves, operate objects, and establish knowledge. The main contribution of this paper is to propose an active perception model and analyzes the acquired motion patterns. In this study, we propose an active perception model in which a robot autonomously learns discernment behavior by interacting with multiple objects in its environment. During such interactions, the robot receives reinforcement signals according to the cluster distance of the observed data. In other words, we use a reinforcement learning approach to reward the successful recognition of objects. We apply our proposed model to a mobile robot simulation to observe its effectiveness. Results show that our proposed model effectively established intelligent strategies based on the relationship between object features and the robot’s configuration. In addition, we perform our experiments using real mobile robots and observe the suitability of the observed learned behaviors.














Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Anderson CW (1989) Learning to Control an Inverted Pendulum Using Neural Networks. IEEE Control Syst Mag 9:31–37
İlkay A, Dag̃ N, Kalkan S, Şahin E (2010) Affordances and emergence of concepts. In: Proceedings of the 10th International Conference on Epigenetic Robotics, 11–18
Gibson E (1988) Exploratory behavior in the development of perceiving, acting, and the acquiring of knowledge. Annu Rev Psychol 39(1):1–42
Gibson JJ (1962) Observations on active touch. Psychol Rev 69(6):477–491
Griffith S, Sinapov J, Sukhoy V, Stoytchev A (2010) How to separate containers from non-containers? a behavior-grounded approach to acoustic object categorization. In: Proceedings of 2010 IEEE International Conference on Robotics and Automation (ICRA2010), 1852–1859
Held R, Hein A (1963) Movement-produced stimulation in the development of visually guided behavior. J Comp Physiol Psychol 56(5):872–876
Shibata K, Nishinoz T, Okabe Y (2001) Actor-q based active perception learning system. In: Proceedings of International Conference on Robotics and Automation 2001 (ICRA2001), 1000–1005
Maris M, Boeckhorst R (1996) Exploiting physical constraints: heap formation through behavioral error in a group of robots. In: Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS96), 1655–1660
Morimoto G, Ikegami T (2004) Evolution of plastic sensory-motor coupling and dynamic categorization. In: Pollack J, et al. (eds) Artificial Life IX. The MIT press, Massachusetts, pp 188–193
Nishide S, Ogata T, Tani J, Komatani K, Okuno G H (2008) Predicting object dynamics from visual images through active sensing experiences. Adv Robot 22(5):527–546
Nolfi S, Marocco D (2002) Active perception: A sensorimotor account of object categorization. In: Proceedings of the 7th international conference on simulation of adaptive behavior on From animals to animats (SAB2002), 266–271
Pfeifer R, Scheier C (1999) Understanding Intelligence. The MIT Press, Cambridge
Sinapov J, Sukhoy V, Sahai R, Stoytchev A (2009) Vibrotactile Recognition and Categorization of Surfaces by a Humanoid Robot. IEEE Trans Robot 27(3):488–497
Sinapov J, Wiemer M, Stoytchev A (2009) Interactive learning of the acoustic properties of household objects. In: Proceedings of the 2009 IEEE International Conference on Robotics and Automation (ICRA2009), 2518–2524
Stoytchev A (2005) Behavior-grounded representation of tool affordances. In: Proceedings of the 2005 IEEE International Conference on Robotics and Automation (ICRA2005), 3060–3065
Sutton RS, Barto AG (1998) Reinforcement Learning: An Introduction. The MIT Press, Massachusetts
Takamuku S, Hosoda K, Asada M (2008) Object category acquisition by dynamic touch. Adv Robot 22(10):1143–1154
Takamuku S, Takahashi Y, Asada M (2006) Lexicon acquisition based on object-oriented behavior learning. Adv Robot 20(10):1127–1145
Tsukada M, Madokoro H, Sato K (2010) Unsupervised and adaptive category classification for a vision-based mobile robot. In: Proceedings of the 2010 International Joint Conference on Neural Networks (IJCNN2010), 1–6
Turvey MT (1996) Dynamic touch. Am Psychol 51(11):1134–1152
Acknowledgments
This research was partially supported by the Ministry of Education, Science, Sports and Culture, Grant-in-Aid for Young Scientists (B), 24700196.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Gouko, M., Kobayashi, Y. & Kim, C.H. Online exploratory behavior acquisition model based on reinforcement learning. Appl Intell 42, 75–86 (2015). https://doi.org/10.1007/s10489-014-0567-4
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-014-0567-4