Comparing Distributed Reinforcement Learning Approaches to Learn Agent Coordination

Bianchi, Reinaldo A.C.; Costa, Anna H.R.

doi:10.1007/3-540-36131-6_59

Reinaldo A.C. Bianchi³ &
Anna H.R. Costa³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2527))

Included in the following conference series:

Ibero-American Conference on Artificial Intelligence

849 Accesses

Abstract

This work compares the performance of the Ant-ViBRA system to approaches based on Distributed Q-learning and Q-learning, when they are applied to learn coordination among agent actions in a Multi Agent System. Ant-ViBRA is a modified version ofa Swarm Intelligence Algorithm called the Ant Colony System algorithm (ACS), which combines a Reinforcement Learning (RL) approach with Heuristic Search. Ant-ViBRA uses a priori domain knowledge to decompose the domain task into subtasks and to define the relationship between actions and states based on interactions among subtasks. In this way, Ant-ViBRA is able to cope with planning when several agents are involved in a combinatorial optimization problem where interleaved execution is needed. The domain in which the comparison is made is that ofa manipulator performing visually-guided pick-and-place tasks in an assembly cell. The experiments carried out are encouraging, showing that Ant- ViBRA presents better results than the Distributed Q-learning and the Q-learning algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

R. A. C. Bianchi and A. H. R. Costa. Ant-vibra: a swarm intelligence approach to learn task coordination. Lecture Notes in Artificial Intelligence-XVI Brazilian Symposium on Artificial Intelligence-SBIA’02, 2002.
Google Scholar
E. Bonabeau, M. Dorigo, and G. Theraulaz. Swarm Intelligence: From Natural to Artificial Systems. Oxford University Press, New York, 1999.
MATH Google Scholar
E. Bonabeau, M. Dorigo, and G. Theraulaz. Inspiration for optimization from social insect behaviour. Nature 406 [6791], 2000.
Google Scholar
A. H. R. Costa, L. N. Barros, and R. A. C. Bianchi. Integrating purposive vision with deliberative and reactive planning: An engineering support on robotics applications. Journal of the Brazilian Computer Society, 4(3): em52-60, April 1998.
Google Scholar
A. H. R. Costa and R. A. C. Bianchi. L-vibra: Learning in the vibra architecture. Lecture Notes in Artificial Intelligence, 1952:280–289, 2000.
Google Scholar
M. Dorigo and L. M. Gambardella. Ant colony system: A cooperative learning approach to the traveling salesman problem. IEEE Transactions on Evolutionary Computation, 1(1), 1997.
Google Scholar
C. Mariano and E. Morales. A new distributed reinforcement learning algorithm for multiple objective optimization problems. Lecture Notes in Artificial Intelligence, 1952:290–299, 2000.
Google Scholar
C. J. C. H. Watkins. Learning from Delayed Rewards. PhD Thesis, University of Cambridge, 1989.
Google Scholar

Download references

Author information

Authors and Affiliations

Laboratório de Técnicas Inteligentes – LTI/PCS, Escola Politécnica da Universidade de São Paulo, Av. Prof. Luciano Gualberto, trav. 3, 158, 05508-900, São Paulo, SP, Brazil
Reinaldo A.C. Bianchi & Anna H.R. Costa

Authors

Reinaldo A.C. Bianchi
View author publications
You can also search for this author in PubMed Google Scholar
Anna H.R. Costa
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Telefónica Investigación y Desarrollo, Emilio Vargas 6, 28043, Madrid, Spain
Francisco J. Garijo
Dpto. Lenguajes y Sistemas Informáticos, Universidad de Sevilla, ETS Ingeniería Informática, 41012, Seville, Spain
José C. Riquelme & Miguel Toro &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bianchi, R.A., Costa, A.H. (2002). Comparing Distributed Reinforcement Learning Approaches to Learn Agent Coordination. In: Garijo, F.J., Riquelme, J.C., Toro, M. (eds) Advances in Artificial Intelligence — IBERAMIA 2002. IBERAMIA 2002. Lecture Notes in Computer Science(), vol 2527. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36131-6_59

Download citation

DOI: https://doi.org/10.1007/3-540-36131-6_59
Published: 05 November 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00131-7
Online ISBN: 978-3-540-36131-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics