Intrinsically Motivated Decision Making for Situated, Goal-Driven Agents

Oubbati, Mohamed; Fischer, Christian; Palm, Günther

doi:10.1007/978-3-319-08864-8_16

Mohamed Oubbati²⁴,
Christian Fischer²⁴ &
Günther Palm²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8575))

Included in the following conference series:

International Conference on Simulation of Adaptive Behavior

1533 Accesses

Abstract

Goal-driven agents are generally expected to be capable of pursuing simultaneously a variety of goals. As these goals may compete in certain circumstances, the agent must be able to constantly trade them off and shift their priorities in a rational way. One aspect of rationality is to evaluate its needs and make decisions accordingly. We endow the agent with a set of needs, or drives, that change over time as a function of external stimuli and internal consumption, and the decision making process hast to generate actions that maintain balance between these needs. The proposed framework pursues an approach in which decision making is considered as a multiobjective problem and approximately solved using a hierarchical reinforcement learning architecture. At a higher-level, a Q-learning learns to select the best learning strategy that improves the well-being of the agent. At a lower-level, an actor-critic design executes the selected strategy while interacting with a continuous, partially observable environment. We provide simulation results to demonstrate the efficiency of the approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Model-Free Reinforcement Learning-Based Control for Continuous-Time Systems

Reinforcement Learning

References

Ram, A., Leake, D.: Goal-Driven Learning. MIT Press (1995)
Google Scholar
Choi, D.: Reactive goal management in a cognitive architecture. Cognitive Systems Research 12(3-4), 293–308 (2011)
Article Google Scholar
Jaidee, U., Munoz-Avila, H., Aha, D.: Integrated learning for goal-driven autonomy. In: IJCAI, pp. 2450–2455 (2011)
Google Scholar
Zilberstein, S.: Metareasoning and Bounded Rationality. In: AAAI Workshop on Metareasoning: Thinking about Thinking (2008)
Google Scholar
da Costa Pereira, C., Tettamanzi, A.: An integrated possibilistic framework for goal generation in cognitive agents. In: AAMAS, International Foundation for Autonomous Agents and Multiagent Systems, pp. 1239–1246 (2010)
Google Scholar
Michalski, R.: Inferential Theory of Learning: Developing Foundations for Multistrategy Learning. In: Machine Learning, A Multistrategy Approach. Morgan K (1994)
Google Scholar
Konidaris, G., Barto, A.: An adaptive robot motivational system. In: Nolfi, S., Baldassarre, G., Calabretta, R., Hallam, J.C.T., Marocco, D., Meyer, J.-A., Miglino, O., Parisi, D. (eds.) SAB 2006. LNCS (LNAI), vol. 4095, pp. 346–356. Springer, Heidelberg (2006)
Chapter Google Scholar
Dayan, P.: Goal-directed control and its antipodes. Neural Networks 22(3), 213–219 (2009)
Article Google Scholar
Dezfouli, Balleine: Actions, action sequences and habits: Evidence that goal-directed and habitual action control are hierarchically organized. PLoS Comp. Biol. 9(12) (2013)
Google Scholar
Butz, M., Shirinov, E., Reif, K.: Self-organizing sensorimotor maps plus internal motivations yield animal-like behavior. Adaptive Behaviour 18(3-4), 315–337 (2010)
Article Google Scholar
Sutton, R., Precup, D., Singh, S.: Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artif. Intel. 112(1-2), 181–211 (1999)
Article MATH MathSciNet Google Scholar
Salichs, M., Malfaz, M.: A new approach to modeling emotions and their use on a decision-making system for artificial agents. IEEE Trans. Affect. Comput. 3(1), 56–68 (2012)
Article Google Scholar
Cos-Aguilera, I., Canamero, L., Hayes, G., Gillies, A.: Hedonic value: Enhancing adaptation for motivated agents. Adaptive Behaviour 21(6), 465–483 (2013)
Article Google Scholar
Oubbati, M., Kord, B., Koprinkova-Hristova, P., Palm, G.: Learning of embodied interaction dynamics with recurrent neural networks: some exploratory experiments. Journal of Neural Engineering 11(2), 026019 (2014)
Google Scholar
Bellman, R.E.: Dynamic Programming. Princeton Univ. Press, NJ (1957)
MATH Google Scholar
Deb, K.: Multi-objective genetic algorithms: Problem difficulties and construction of test problems. Evolutionary Computation 7(3), 205–230 (1999)
Article Google Scholar
Prokhorov, D., Wunsch, D.: Adaptive critic designs. IEEE Transactions on Neural Networks 8, 997–1007 (1997)
Article Google Scholar
Jaeger, H.: The ’echo state’ approach to analysing and training recurrent neural networks. Technical Report 148, AIS Fraunhofer, St. Augustin, Germany (2001)
Google Scholar
Parisi, D.: Internal robotics. Connection Science 16, 325–338 (2004)
Article Google Scholar
Konidaris, G.D., Hayes, G.M.: An architecture for behavior-based reinforcement learning. Adaptive Behavior 13(1), 5–32 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Neural Information Processing, University of Ulm, Germany
Mohamed Oubbati, Christian Fischer & Günther Palm

Authors

Mohamed Oubbati
View author publications
You can also search for this author in PubMed Google Scholar
Christian Fischer
View author publications
You can also search for this author in PubMed Google Scholar
Günther Palm
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Robotic Intelligence Lavoratory, Jaume I University, Avda. Sos Baynat s/n, 12071, Castellón de la Plana, Spain
Angel P. del Pobil
School of Computing, University of Leeds, LS2 9JT, Leeds, UK
Eris Chinellato
Robotic Intelligence Laboratory, Jaume I University, Avda. Sos Baynat s/n, 12071, Castellón de la Plana, Spain
Ester Martinez-Martin & Enric Cervera &
Mærsk McKinney Møller Institute, University of Southern Denmark, Campusvej 55, 5230, Odense, Denmark
John Hallam
Robotic Intelligence Laboratory, Jaume I University, Avda. Sos Baynat s/n, 12071, Castellón de la Plana, spain
Antonio Morales

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Oubbati, M., Fischer, C., Palm, G. (2014). Intrinsically Motivated Decision Making for Situated, Goal-Driven Agents. In: del Pobil, A.P., Chinellato, E., Martinez-Martin, E., Hallam, J., Cervera, E., Morales, A. (eds) From Animals to Animats 13. SAB 2014. Lecture Notes in Computer Science(), vol 8575. Springer, Cham. https://doi.org/10.1007/978-3-319-08864-8_16

Download citation

DOI: https://doi.org/10.1007/978-3-319-08864-8_16
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08863-1
Online ISBN: 978-3-319-08864-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Intrinsically Motivated Decision Making for Situated, Goal-Driven Agents

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Model-Free Reinforcement Learning-Based Control for Continuous-Time Systems

Model-Free Reinforcement Learning-Based Control for Continuous-Time Systems

Reinforcement Learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us