Abstract
This work compares the performance of the Ant-ViBRA system to approaches based on Distributed Q-learning and Q-learning, when they are applied to learn coordination among agent actions in a Multi Agent System. Ant-ViBRA is a modified version ofa Swarm Intelligence Algorithm called the Ant Colony System algorithm (ACS), which combines a Reinforcement Learning (RL) approach with Heuristic Search. Ant-ViBRA uses a priori domain knowledge to decompose the domain task into subtasks and to define the relationship between actions and states based on interactions among subtasks. In this way, Ant-ViBRA is able to cope with planning when several agents are involved in a combinatorial optimization problem where interleaved execution is needed. The domain in which the comparison is made is that ofa manipulator performing visually-guided pick-and-place tasks in an assembly cell. The experiments carried out are encouraging, showing that Ant- ViBRA presents better results than the Distributed Q-learning and the Q-learning algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
R. A. C. Bianchi and A. H. R. Costa. Ant-vibra: a swarm intelligence approach to learn task coordination. Lecture Notes in Artificial Intelligence-XVI Brazilian Symposium on Artificial Intelligence-SBIA’02, 2002.
E. Bonabeau, M. Dorigo, and G. Theraulaz. Swarm Intelligence: From Natural to Artificial Systems. Oxford University Press, New York, 1999.
E. Bonabeau, M. Dorigo, and G. Theraulaz. Inspiration for optimization from social insect behaviour. Nature 406 [6791], 2000.
A. H. R. Costa, L. N. Barros, and R. A. C. Bianchi. Integrating purposive vision with deliberative and reactive planning: An engineering support on robotics applications. Journal of the Brazilian Computer Society, 4(3): em52-60, April 1998.
A. H. R. Costa and R. A. C. Bianchi. L-vibra: Learning in the vibra architecture. Lecture Notes in Artificial Intelligence, 1952:280–289, 2000.
M. Dorigo and L. M. Gambardella. Ant colony system: A cooperative learning approach to the traveling salesman problem. IEEE Transactions on Evolutionary Computation, 1(1), 1997.
C. Mariano and E. Morales. A new distributed reinforcement learning algorithm for multiple objective optimization problems. Lecture Notes in Artificial Intelligence, 1952:290–299, 2000.
C. J. C. H. Watkins. Learning from Delayed Rewards. PhD Thesis, University of Cambridge, 1989.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bianchi, R.A., Costa, A.H. (2002). Comparing Distributed Reinforcement Learning Approaches to Learn Agent Coordination. In: Garijo, F.J., Riquelme, J.C., Toro, M. (eds) Advances in Artificial Intelligence — IBERAMIA 2002. IBERAMIA 2002. Lecture Notes in Computer Science(), vol 2527. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36131-6_59
Download citation
DOI: https://doi.org/10.1007/3-540-36131-6_59
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00131-7
Online ISBN: 978-3-540-36131-2
eBook Packages: Springer Book Archive