Multi-agent Cooperation and Competition with Two-Level Attention Network

Wu, Shiguang; Pu, Zhiqiang; Yi, Jianqiang; Wang, Huimu

doi:10.1007/978-3-030-63833-7_44

Shiguang Wu^14,15,
Zhiqiang Pu^14,15,
Jianqiang Yi^14,15 &
…
Huimu Wang^14,15

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12533))

Included in the following conference series:

International Conference on Neural Information Processing

2644 Accesses
1 Citations

Abstract

Multi-agent reinforcement learning (MARL) has made significant advances in multi-agent systems. However, it is hard to learn a stable policy in complicated and changeable environment. To address these issues, a two-level attention network is proposed, which is composed of across-group observation attention network (AGONet) and intentional communication network (ICN). AGONet is designed to distinguish the different semantic meanings of observations (including friend group, foe group, and object/entity group) and extract different underlying information of different groups with across-group attention. Based AGONet, the proposed network framework is invariant to the number of agents existing in the system, which can be applied in large-scale multi-agent systems. Furthermore, to enhance the cooperation of the agents in the same group, ICN is used to aggregate the intentions of neighbors in the same group, which are extracted by AGONet. It obtains the understanding and intentions of their neighbors in the same group and enlarges the receptive filed of the agent. The simulation results demonstrate that the agents can learn complicated cooperative and competitive strategies and our method is superiority to existing methods.

Research supported by the National Key Research, Development Program of China under Grant 2018AAA0102404, and Innovation Academy for Light-duty Gas Turbine, Chinese Academy of Sciences, No. CXYJJ19-ZD-02.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agarwal, A., Kumar, S., Sycara, K.: Learning transferable cooperative behavior in multi-agent teams. arXiv preprint arXiv:1906.01202 (2019)
Foerster, J.N., Farquhar, G., Afouras, T., Nardelli, N., Whiteson, S.: Counterfactual multi-agent policy gradients. In: Thirty-Second AAAI Conference on Artificial Intelligence (2018)
Google Scholar
Iqbal, S., Sha, F.: Actor-attention-critic for multi-agent reinforcement learning. In: International Conference on Machine Learning, pp. 2961–2970 (2019)
Google Scholar
Jiang, J., Dun, C., Huang, T., Lu, Z.: Graph convolutional reinforcement learning. arXiv preprint arXiv:1810.09202 (2018)
Jiang, J., Lu, Z.: Learning attentional communication for multi-agent cooperation. In: Advances in Neural Information Processing Systems, pp. 7254–7264 (2018)
Google Scholar
Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016)
Li, X., Zhang, J., Bian, J., Tong, Y., Liu, T.Y.: A cooperative multi-agent reinforcement learning framework for resource balancing in complex logistics network. In: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, pp. 980–988 (2019)
Google Scholar
Lillicrap, T.P., et al.: Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971 (2015)
Lowe, R., Wu, Y.I., Tamar, A., Harb, J., Abbeel, O.P., Mordatch, I.: Multi-agent actor-critic for mixed cooperative-competitive environments. In: Advances in Neural Information Processing Systems, pp. 6379–6390 (2017)
Google Scholar
Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015)
Article Google Scholar
Nguyen, H.T., et al.: A deep hierarchical reinforcement learner for aerial shepherding of ground swarms. In: Gedeon, T., Wong, K.W., Lee, M. (eds.) ICONIP 2019. LNCS, vol. 11953, pp. 658–669. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-36708-4_54
Chapter Google Scholar
Radhakrishnan, B.M., Srinivasan, D.: A multi-agent based distributed energy management scheme for smart grid applications. Energy 103, 192–204 (2016)
Article Google Scholar
Ryu, H., Shin, H., Park, J.: Multi-agent actor-critic with hierarchical graph attention network. arXiv preprint arXiv:1909.12557 (2019)
Schulman, J., Wolski, F., Dhariwal, P., Radford, A., Klimov, O.: Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 (2017)
Vashishth, S., Yadati, N., Talukdar, P.: Graph-based deep learning in natural language processing. In: Proceedings of the 7th ACM IKDD CoDS and 25th COMAD, pp. 371–372 (2020)
Google Scholar
Veličković, P., Cucurull, G., Casanova, A., Romero, A., Lio, P., Bengio, Y.: Graph attention networks. arXiv preprint arXiv:1710.10903 (2017)
Vinyals, O., et al.: Grandmaster level in starcraft ii using multi-agent reinforcement learning. Nature 575(7782), 350–354 (2019)
Article Google Scholar
Yang, Y., Luo, R., Li, M., Zhou, M., Zhang, W., Wang, J.: Mean field multi-agent reinforcement learning. In: 35th International Conference on Machine Learning, ICML 2018, vol. 80, pp. 5571–5580. PMLR (2018)
Google Scholar
Zhang, Y., Dai, H., Kozareva, Z., Smola, A.J., Song, L.: Variational reasoning for question answering with knowledge graph. In: Thirty-Second AAAI Conference on Artificial Intelligence (2018)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, 100049, China
Shiguang Wu, Zhiqiang Pu, Jianqiang Yi & Huimu Wang
Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China
Shiguang Wu, Zhiqiang Pu, Jianqiang Yi & Huimu Wang

Authors

Shiguang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Zhiqiang Pu
View author publications
You can also search for this author in PubMed Google Scholar
Jianqiang Yi
View author publications
You can also search for this author in PubMed Google Scholar
Huimu Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhiqiang Pu .

Editor information

Editors and Affiliations

Department of AI, Ping An Life, Shenzhen, China
Haiqin Yang
Faculty of Information Technology, King Mongkut’s Institute of Technology Ladkrabang, Bangkok, Thailand
Kitsuchart Pasupa
City University of Hong Kong, Kowloon, China
Andrew Chi-Sing Leung
Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong, Hong Kong
James T. Kwok
School of Information Technology, King Mongkut’s University of Technology Thonburi, Bangkok, Thailand
Jonathan H. Chan
The Chinese University of Hong Kong, New Territories, Hong Kong
Irwin King

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, S., Pu, Z., Yi, J., Wang, H. (2020). Multi-agent Cooperation and Competition with Two-Level Attention Network. In: Yang, H., Pasupa, K., Leung, A.CS., Kwok, J.T., Chan, J.H., King, I. (eds) Neural Information Processing. ICONIP 2020. Lecture Notes in Computer Science(), vol 12533. Springer, Cham. https://doi.org/10.1007/978-3-030-63833-7_44

Download citation

DOI: https://doi.org/10.1007/978-3-030-63833-7_44
Published: 20 November 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63832-0
Online ISBN: 978-3-030-63833-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Multi-agent Cooperation and Competition with Two-Level Attention Network