Abstract
To develop truly autonomous mobile robots, we proposed to introduce internal rewards such as the desire for existence, specific curiosity, diversive curiosity, boredom, and novelty into reinforcement learning. They are expected to make mobile robots capable of behaving appropriately without being told what to do. Firstly, we proposed to use multiple sources of rewards to endow mobile robots with ability to behave properly in the real world. Secondly, we proposed task-independent internal rewards. Thirdly, we proposed to attain engineering merit of internal rewards in addition to scientific interest. A pursuit-evasion game comprising a predator and its prey on a robotic field was selected as a testbed to demonstrate the effectiveness of internal rewards in reinforcement learning. The present paper focuses on learning of pursuit timing to maximize accumulated future rewards by Q-learning and SARSA.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Schmidhuber, J.: Self-motivated development through rewards for predictor errors/ improvements, Developmental Robotics. In: 2005 AAAI Spring Symposium (2005)
Oudeyer, P.-Y., Kaplan, F., Hafner, V.V.: Intrinsic Motivation Systems for Autonomous Mental Development. IEEE Trans. EC 11(2), 265–286 (2007)
Stout, A., Konidaris, G.D., Barto, A.G.: Intrinsically motivated reinforcement learning: A promising framework for developmental robot learning. In: Developmental Robotics AAAI Spring Symp. (2005)
Hagiwara, T., Ishikawa, M.: Emergence of behaviors based on the desire for existence. In: BrainIT 2007, p. 41 (2007)
Ishikawa, M., Hagiwara, T., Yamamoto, N., Kiriake, F.: Brain-inspired emergence of behaviors in mobile robots by reinforcement learning with internal rewards. In: HIS 2008, pp. 138–143 (2008)
Sutton, R.S., Barto, A.G.: Reinforcement Learning. MIT Press, Cambridge (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Morita, M., Ishikawa, M. (2009). Brain-Inspired Emergence of Behaviors Based on the Desire for Existence by Reinforcement Learning. In: Köppen, M., Kasabov, N., Coghill, G. (eds) Advances in Neuro-Information Processing. ICONIP 2008. Lecture Notes in Computer Science, vol 5506. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02490-0_93
Download citation
DOI: https://doi.org/10.1007/978-3-642-02490-0_93
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02489-4
Online ISBN: 978-3-642-02490-0
eBook Packages: Computer ScienceComputer Science (R0)