Advice-Exchange Between Evolutionary Algorithms and Reinforcement Learning Agents: Experiments in the Pursuit Domain

Nunes, Luís; Oliveira, Eugénio

doi:10.1007/978-3-540-32274-0_12

Advice-Exchange Between Evolutionary Algorithms and Reinforcement Learning Agents: Experiments in the Pursuit Domain

Luís Nunes²¹ &
Eugénio Oliveira²²

Conference paper

1242 Accesses
5 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3394))

Abstract

This research aims at studying the effects of exchanging information during the learning process in Multiagent Systems. The concept of advice-exchange, introduced in previous contributions, consists in enabling an agent to request extra feedback, in the form of episodic advice, from other agents that are solving similar problems. The work that was previously focused on the exchange of information between agents that were solving detached problems is now concerned with groups of learning-agents that share the same environment. This change added new difficulties to the task. The experiments reported below were conducted to detect the causes and correct the shortcomings that emerged when moving from environments where agents worked in detached problems to those where agents are interacting in the same environment. New concepts, such as self confidence, trust and advisor preference are introduced in this text.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Clouse, J.A.: On integrating apprentice learning and reinforcement learning. PhD thesis, University of Massachusetts, Department of Computer Science (1997)
Google Scholar
Nunes, L., Oliveira, E.: Advice-exchange in heterogeneous groups of learning agents. Technical Report 1 12/02, FEUP/LIACC (2002)
Google Scholar
Whitehead, S.D.: A complexity analysis of cooperative mechanisms in reinforcement learning. In: Proc. of the 9th National Conf. on AI (AAAI 1991), pp. 607–613 (1991)
Google Scholar
Clouse, J.A., Utgoff, P.E.: Two kinds of training information for evaluation function learning. In: Proc. of AAAI 1991 (1991)
Google Scholar
Clouse, J., Utgoff, P.: A teaching method for reinforcement learning. In: Proc. of the 9th Int. Conf. on Machine Learning, pp. 92–101 (1992)
Google Scholar
Lin, L.J.: Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning 8, 293–321 (1992)
Google Scholar
Tan, M.: Multi-agent reinforcement learning: Independent vs. cooperative agents. Proc. of the Tenth Int. Conf. on Machine Learning, 330–337 (1993)
Google Scholar
Price, B., Boutilier, C.: Implicit imitation in multiagent reinforcement learning. Proc. of the Sixteenth Int. Conf. on Machine Learning, 325–334 (1999)
Google Scholar
Sen, S., Kar, P.P.: Sharing a concept. In: Working Notes of the AAAI 2002 Spring Symposium on Collaborative Learning Agents (2002)
Google Scholar
Watkins, C.J.C.H., Dayan, P.D.: Technical note: Q-learning. Machine Learning 8, 279–292 (1992)
MATH Google Scholar
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning internal representations by error propagation. Parallel Distributed Processing: Exploration in the Microstructure of Cognition 1, 318–362 (1986)
Google Scholar
Kaelbling, L.P., Littman, M.L., Moore, A.W.: Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4 (1996)
Google Scholar
Nunes, L., Oliveira, E.: Advice exchange between evolutionary algorithms and reinforcement learning agents: Experimental results in the pursuit domain. Technical Report 2 03/03, FEUP/LIACC (2003)
Google Scholar
Holland, J.H.: Adaptation in Natural and Artificial Systems. University of Michigan Press (1975)
Google Scholar
Koza, J.R.: Genetic programming: On the Programming of Computers by Means of Natural Selection. MIT Press, Cambridge (1992)
MATH Google Scholar
Salustowicz, R.: A Genetic Algorithm for the Topological Optimization of Neural Networks. PhD thesis, Tech. Univ. Berlin (1995)
Google Scholar
Yao, X.: Evolving artificial neural networks. Proc. of IEEE. 87, 1423–1447 (1999)
Article Google Scholar
Glickman, M., Sycara, K.: Evolution of goal-directed behavior using limited information in a complex environment. In: Proc. of the Genetic and Evolutionary Computation Conference, GECCO 1999 (1999)
Google Scholar
Nunes, L., Oliveira, E.: On learning by exchanging advice. In: Proc. of the First Symposium on Adaptive Agents and Multi-Agent Systems, AISB 2002 (2002)
Google Scholar
Benda, M., Jagannathan, V., Dodhiawalla, R.: On optimal cooperation of knowledge resources. Technical Report BCS G-2012-28, Boeing AI Center, Boeing Computer Services, Bellevue, WA (1985)
Google Scholar
Haynes, T., Wainwright, R., Sen, S., Schoenfeld, D.: Strongly typed genetic programming in evolving cooperation strategies. In: Proc. of the Sixth Int. Conf. on Genetic Algorithms, pp. 271–278 (1995)
Google Scholar
Sen, S., Sekaran, M., Hale, J.: Lerning to coordinate without sharing information. In: Proc. of the National Conf. on AI, pp. 426–431 (1994)
Google Scholar
Sen, S., Sekaran, M.: Individual learning of coordination knowledge. Journal of Experimental, Theoretical Artificial Intelligence 10, 333–356 (1998)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

ISCTE/FEUP/LIACC-NIAD&R ISCTE, Av. Forças Armadas, 1649-026, Lisbon, Portugal
Luís Nunes
FEUP/LIACC-NIAD&R FEUP, Av. Dr. Roberto Frias, 4200-465, Porto, Portugal
Eugénio Oliveira

Authors

Luís Nunes
View author publications
You can also search for this author in PubMed Google Scholar
Eugénio Oliveira
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of York, YO10 5DD, York, UK
Daniel Kudenko
Artificial Intelligence Group, Department of Computer Science, University of York, Heslington, York, UK
Dimitar Kazakov
Department of Computing, City University, P.O. Box, EC1V 0HB, London, United Kingdom
Eduardo Alonso

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nunes, L., Oliveira, E. (2005). Advice-Exchange Between Evolutionary Algorithms and Reinforcement Learning Agents: Experiments in the Pursuit Domain. In: Kudenko, D., Kazakov, D., Alonso, E. (eds) Adaptive Agents and Multi-Agent Systems II. AAMAS AAMAS 2004 2003. Lecture Notes in Computer Science(), vol 3394. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-32274-0_12

Download citation

DOI: https://doi.org/10.1007/978-3-540-32274-0_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25260-3
Online ISBN: 978-3-540-32274-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics