Cooperation and Coordination Regimes by Deep Q-Learning in Multi-agent Task Executions

Miyashita, Yuki; Sugawara, Toshiharu

doi:10.1007/978-3-030-30487-4_42

Cooperation and Coordination Regimes by Deep Q-Learning in Multi-agent Task Executions

Conference paper
First Online: 09 September 2019

2938 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11727))

Abstract

We investigate the coordination structures generated by deep Q-network (DQN) with various types of input by using a distributed task execution game. Although cooperation and coordination are mandatory for efficiency in multi-agent systems (MAS), they require sophisticated structures or regimes for effective behaviors. Recently, deep Q-learning has been applied to multi-agent systems to facilitate their coordinated behavior. However, the characteristics of the learned results have not yet been fully clarified. We investigate how information input to DQNs affect the resultant coordination and cooperation structures. We examine the inputs generated from local observations with and without the estimated location in the environment. Experimental results show that they form two types of coordination structures—the division of labor and the targeting of near tasks while avoiding conflicts—and that the latter is more efficient in our game. We clarify the mechanism behind and the characteristics of the generated coordination behaviors.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Amarjyoti, S.: Deep reinforcement learning for robotic manipulation - the state of the art. CoRR abs/1701.08878 (2017). https://doi.org/10.1109/ICRA.2017.7989385, http://arxiv.org/abs/1701.08878
Gupta, J.K., Egorov, M., Kochenderfer, M.: Cooperative multi-agent control using deep reinforcement learning. In: Sukthankar, G., Rodriguez-Aguilar, J.A. (eds.) AAMAS 2017. LNCS (LNAI), vol. 10642, pp. 66–83. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-71682-4_5
Chapter Google Scholar
Hüttenrauch, M., Sosic, A., Neumann, G.: Guided deep reinforcement learning for swarm systems. CoRR abs/1709.06011 (2017). http://arxiv.org/abs/1709.06011
Lample, G., Chaplot, D.S.: Playing FPS games with deep reinforcement learning. In: Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 4–9 February 2017, San Francisco, California, USA, pp. 2140–2146 (2017). http://aaai.org/ocs/index.php/AAAI/AAAI17/paper/view/14456
Leibo, J.Z., Zambaldi, V., Lanctot, M., Marecki, J., Graepel, T.: Multi-agent reinforcement learning in sequential social dilemmas. In: Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, pp. 464–473. IFAAMAS, Richland (2017). http://dl.acm.org/citation.cfm?id=3091125.3091194
Mnih, V., et al.: Playing Atari with deep reinforcement learning. CoRR abs/1312.5602 (2013). http://arxiv.org/abs/1312.5602
Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529 (2015). https://doi.org/10.1038/nature14236
Article Google Scholar
Palmer, G., Tuyls, K., Bloembergen, D., Savani, R.: Lenient multi-agent deep reinforcement learning. In: Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, AAMAS 2018, IFAAMAS, Richland, SC, pp. 443–451 (2018). http://dl.acm.org/citation.cfm?id=3237383.3237451
Sugiyama, A., Sugawara, T.: Improvement of robustness to environmental changes by autonomous divisional cooperation in multi-agent cooperative patrol problem. In: Advances in Practical Applications of Cyber-Physical Multi-Agent Systems: The PAAMS Collection - 15th International Conference, PAAMS 2017, Porto, Portugal, 21–23 June 2017, Proceedings, pp. 259–271 (2017). https://doi.org/10.1007/978-3-319-59930-4_21
Google Scholar
Tieleman, T., Hinton, G.: Lecture 6.5-RMSProp: divide the gradient by a running average of its recent magnitude. COURSERA Neural Netw. Mach. Learn. 4(2), 26–31 (2012). https://doi.org/10.1007/BF00992698
Article Google Scholar
Watkins, C.J., Dayan, P.: Q-Learning. Mach. Learn. 8(3–4), 279–292 (1992). https://doi.org/10.1007/BF00992698
Article MATH Google Scholar

Download references

Acknowledgements

This work was partly supported by JSPS KAKENHI Grant Number 17KT0044.

Author information

Authors and Affiliations

Computer Science and Engineering, Waseda University, Tokyo, 1698555, Japan
Yuki Miyashita & Toshiharu Sugawara
Shimizu Corporation, Tokyo, 1040031, Japan
Yuki Miyashita

Authors

Yuki Miyashita
View author publications
You can also search for this author in PubMed Google Scholar
Toshiharu Sugawara
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Yuki Miyashita or Toshiharu Sugawara .

Editor information

Editors and Affiliations

Helmholtz Zentrum München - Deutsches Forschungszentrum für Gesundheit und Umwelt (GmbH), Neuherberg, Germany
Igor V. Tetko
Institute of Computer Science, Czech Academy of Sciences, Prague 8, Czech Republic
Věra Kůrková
Helmholtz Zentrum München - Deutsches Forschungszentrum für Gesundheit und Umwelt (GmbH), Neuherberg, Germany
Pavel Karpov
Helmholtz Zentrum München - Deutsches Forschungszentrum für Gesundheit und Umwelt (GmbH), Neuherberg, Germany
Fabian Theis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Miyashita, Y., Sugawara, T. (2019). Cooperation and Coordination Regimes by Deep Q-Learning in Multi-agent Task Executions. In: Tetko, I., Kůrková, V., Karpov, P., Theis, F. (eds) Artificial Neural Networks and Machine Learning – ICANN 2019: Theoretical Neural Computation. ICANN 2019. Lecture Notes in Computer Science(), vol 11727. Springer, Cham. https://doi.org/10.1007/978-3-030-30487-4_42

Download citation

DOI: https://doi.org/10.1007/978-3-030-30487-4_42
Published: 09 September 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30486-7
Online ISBN: 978-3-030-30487-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics