Incremental Skill Acquisition for Self-motivated Learning Animats

Bonarini, Andrea; Lazaric, Alessandro; Restelli, Marcello

doi:10.1007/11840541_30

Andrea Bonarini²⁵,
Alessandro Lazaric²⁵ &
Marcello Restelli²⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4095))

Included in the following conference series:

International Conference on Simulation of Adaptive Behavior

1717 Accesses
6 Citations

Abstract

A central role in the development process of children is played by self-exploratory activities. Through a playful interaction with the surrounding environment, they test their own capabilities, explore novel situations, and understand how their actions affect the world. During this kind of exploration, interesting situations may be discovered. By learning to reach these situations, a child incrementally develops more and more complex skills. Inspired by studies from psychology, neuroscience, and machine learning, we designed SMILe (Self-Motivated Incremental Learning), a learning framework that allows artificial agents to autonomously identify and learn a set of abilities useful to face several different tasks, through an iterated three phase process: by means of a random exploration of the environment (babbling phase), the agent identifies interesting situations and generates an intrinsic motivation (motivating phase) aimed at learning how to get into these situations (skill acquisition phase). This process incrementally increases the skills of the agent, so that new interesting configurations can be experienced. We present results on two gridworld environments to show how SMILe makes it possible to learn skills that enable the agent to perform well and robustly in many different tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Barto, A.G., Mahadevan, S.: Recent advances in hierarchical reinforcement learning. Discrete Event Dynamic Systems 13(4), 341–379 (2003)
Article MathSciNet Google Scholar
Barto, A.G., Singh, S., Chentanez, N.: Intrinsically motivated learning of hierarchical collections of skills. In: Proceedings of ICDL (2004)
Google Scholar
Berlyne, D.E.: Conflict, Arousal, and Curiosity. McGraw-Hill, New York (1960)
Book Google Scholar
Bonarini, A., Lazaric, A., Restelli, M.: Smile: Self-motivated incremental learning. Technical report, Politecnico di Milano (2006), www.airlab.elet.polimi.it/papers/bonarini06smile.pdf
Kakade, S., Dayan, P.: Dopamine: Generalization and bonuses. Neural Networks 15, 549–559 (2002)
Article Google Scholar
Konidaris, G.D., Hayes, G.M.: An architecture for behavior-based reinforcement learning. Adaptive Behavior 13(1), 5–32 (2005)
Article Google Scholar
Lungarella, M., Metta, G., Pfeifer, R., Sandini, C.: Developmental robotics: a survey. Connection Science 15(4), 151–190 (2003)
Article Google Scholar
Marshall, J., Blank, D., Meeden, L.: An emergent framework for self-motivation in developmental robotics. In: Proceedings of ICDL (2004)
Google Scholar
McGovern, A., Barto, A.G.: Automatic discovery of subgoals in reinforcement learning using diverse density. In: Proceedings of ICML (2001)
Google Scholar
Meltzoff, A., Moore, M.: Explaining facial imitation: a theoretical model. Early Development and Parenting 6, 179–192 (1997)
Article Google Scholar
Menache, I., Mannor, S., Shimkin, N.: Q-Cut - Dynamic Discovery of Sub-goals in Reinforcement Learning. In: Elomaa, T., Mannila, H., Toivonen, H. (eds.) ECML 2002. LNCS, vol. 2430, p. 295. Springer, Heidelberg (2002)
Chapter Google Scholar
Oudeyer, P.-Y., Kaplan, F., Hafner, V.: The playground experiment: Task-independent development of a curious robot. In: AAAI Spring Symposium Workshop on Developmental Robotics (2005)
Google Scholar
Piaget, J.: The Origins of Intelligence in Children. Norton, New York (1952)
Book Google Scholar
Ratitch, B., Precup, D.: Using mdp characteristics to guide exploration in reinforcement learning. In: European Conference on Reinforcement Learning (2003)
Google Scholar
Schmidhuber, J.: Self-motivated development through rewards for predictor errors / improvements. In: AAAI Spring Symposium on Developmental Robotics (2005)
Google Scholar
Stout, A., Konidaris, G., Barto, A.: Intrinsically motivated reinforcement learning: A promising framework for developmental robot learning. In: AAAI Spring Symposium on Developmental Robotics (2005)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Google Scholar
Sutton, R.S., Precup, D., Singh, S.P.: Between mdps and semi-mdps: a framework for temporal abstraction in reinforcement learning. Artificial Intelligence 112, 181–211 (1999)
Article MATH MathSciNet Google Scholar
Uchibe, E., Doya, K.: Reinforcement learning with multiple heterogeneous modules: A framework for developmental robot learning. In: Proceedings of ICDL (2005)
Google Scholar
Weng, J., McClelland, A., Sporns, O., Stockman, I., Sur, M., Thelen, E.: Autonomous mental development by robots and animals. Science 291, 599–600 (2001)
Article Google Scholar
Weng, J., Zhang, Y.: Novelty and reinforcement learning in the value system of developmental robots. In: International Workshop on Epigenetic Robotics: Modeling Cognitive Development in Robotic Systems (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronics and Informatics, Politecnico di Milano, piazza Leonardo da Vinci 32, I-20133, Milan, Italy
Andrea Bonarini, Alessandro Lazaric & Marcello Restelli

Authors

Andrea Bonarini
View author publications
You can also search for this author in PubMed Google Scholar
Alessandro Lazaric
View author publications
You can also search for this author in PubMed Google Scholar
Marcello Restelli
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Cognitive Sciences and Technologies, LARRAL, Via S. Martino della Battaglia 44, 00185, Roma, Italy
Stefano Nolfi
Institute of Cognitive Science and Technology, (ISTC-CNR), Via San Martino della Battaglia 44, 00185, Rome, Italy
Gianluca Baldassarre
Institute of Cognitive Science and Technology, ISTC-CNR, Via S. Martino della Battaglia 44, 00185, Rome, Italy
Raffaele Calabretta & Davide Marocco &
The Mærsk Mc-Kinney Møller Institute, University of Southern Denmark, Campusvej 55, 5230, Odense M, Denmark
John C. T. Hallam
UPMC Univ Paris 6, FRE2507, ISIR, F-75016, Paris, France
Jean-Arcady Meyer
Laboratory of Autonomous Robotics and Artificial Life, Institute of Cognitive, Sciences and Technologies, National Research Council, Rome, Italy
Orazio Miglino
Institute of Cognitive Sciences and Technologies, National Research Council, 44, Via San Martino della, 00185, Rome, Battaglia, Italy
Domenico Parisi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bonarini, A., Lazaric, A., Restelli, M. (2006). Incremental Skill Acquisition for Self-motivated Learning Animats. In: Nolfi, S., et al. From Animals to Animats 9. SAB 2006. Lecture Notes in Computer Science(), vol 4095. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11840541_30

Download citation

DOI: https://doi.org/10.1007/11840541_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-38608-7
Online ISBN: 978-3-540-38615-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics