Efficient Policy Generation in Multi-agent Systems via Hypergraph Neural Network

Zhang, Bin; Bai, Yunpeng; Xu, Zhiwei; Li, Dapeng; Fan, Guoliang

doi:10.1007/978-3-031-30108-7_19

Bin Zhang^12,13,
Yunpeng Bai^12,13,
Zhiwei Xu^12,13,
Dapeng Li^12,13 &
…
Guoliang Fan^12,13

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13624))

Included in the following conference series:

International Conference on Neural Information Processing

887 Accesses
1 Citations

Abstract

The application of deep reinforcement learning in multi-agent systems introduces extra challenges. In a scenario with numerous agents, one of the most important concerns currently being addressed is how to develop sufficient collaboration between diverse agents. To address this problem, we consider the form of agent interaction based on neighborhood and propose a multi-agent reinforcement learning (MARL) algorithm based on the actor-critic method, which can adaptively construct the hypergraph structure representing the agent interaction and further implement effective information extraction and representation learning through hypergraph convolution networks, leading to effective cooperation. Based on different hypergraph generation methods, we present two variants: Actor Hypergraph Convolutional Critic Network (HGAC) and Actor Attention Hypergraph Critic Network (ATT-HGAC). Experiments with different settings demonstrate the advantages of our approach over other existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Soft-HGRNs: soft hierarchical graph recurrent networks for multi-agent partially observable environments

Article 23 January 2023

Multi-Agent Hyper-Attention Policy Optimization

Attention-Aware Actor for Cooperative Multi-agent Reinforcement Learning

References

Bai, Y., Gong, C., Zhang, B., et al.: Value function factorisation with hypergraph convolution for cooperative multi-agent reinforcement learning (2021)
Google Scholar
Buşoniu, L., Babuška, R., De Schutter, B.: Multi-agent reinforcement learning: An overview. Innovations in multi-agent systems and applications-1 pp. 183–221 (2010)
Google Scholar
Feng, Y., You, H., Zhang, Z., Ji, R., Gao, Y.: Hypergraph neural networks. In: Proceedings of the AAAI Conference on Artificial Intelligence. vol. 33, pp. 3558–3565 (2019)
Google Scholar
Foerster, J., Assael, I.A., De Freitas, N., Whiteson, S.: Learning to communicate with deep multi-agent reinforcement learning. Advances in neural information processing systems 29 (2016)
Google Scholar
Foerster, J., Farquhar, G., Afouras, T., Nardelli, N., Whiteson, S.: Counterfactual multi-agent policy gradients. In: Proceedings of the AAAI conference on artificial intelligence. vol. 32 (2018)
Google Scholar
Haarnoja, T., Zhou, A., Abbeel, P., et al.: Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. In: ICML (2018)
Google Scholar
Iqbal, S., Sha, F.: Actor-attention-critic for multi-agent reinforcement learning. In: International Conference on Machine Learning. pp. 2961–2970. PMLR (2019)
Google Scholar
Jiang, J., Dun, C., Huang, T., et al.: Graph convolutional reinforcement learning. In: International Conference on Learning Representations (2019)
Google Scholar
Lillicrap, T.P., Hunt, J.J., Pritzel, A., et al.: Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971 (2015)
Littman, M.L.: Markov games as a framework for multi-agent reinforcement learning. In: Machine learning proceedings 1994, pp. 157–163. Elsevier (1994)
Google Scholar
Multi-agent actor-critic for mixed cooperative-competitive environments: Lowe, R., WU, Y., Tamar, A., et al. Advances in Neural Information Processing Systems 30, 6379–6390 (2017)
Google Scholar
Mao, H., Zhang, Z., Xiao, Z., Gong, Z.: Modelling the dynamic joint policy of teammates with attention multi-agent ddpg. arXiv preprint arXiv:1811.07029 (2018)
Mnih, V., Kavukcuoglu, K., Silver, D., et al.: Human-level control through deep reinforcement learning. nature 518(7540), 529–533 (2015)
Google Scholar
Peng, Z., Li, Q., Hui, K.M., Liu, C., Zhou, B.: Learning to simulate self-driven particles system with coordinated policy optimization. Advances in Neural Information Processing Systems 34, 10784–10797 (2021)
Google Scholar
Sharma, A., Chauhan, S.: A distributed reinforcement learning based sensor node scheduling algorithm for coverage and connectivity maintenance in wireless sensor network. Wireless Networks 26(6), 4411–4429 (2020). https://doi.org/10.1007/s11276-020-02350-y
Article Google Scholar
Silver, D., Huang, A., Maddison, C.J., et al.: Mastering the game of go with deep neural networks and tree search. nature 529(7587), 484–489 (2016)
Google Scholar
Tavakoli, A., Fatemi, M., Kormushev, P.: Learning to represent action values as a hypergraph on the action vertices. arXiv preprint arXiv:2010.14680 (2020)
Vaswani, A., Shazeer, N., Parmar, N., et al.: Attention is all you need. In: Advances in neural information processing systems. pp. 5998–6008 (2017)
Google Scholar
Vinyals, O., Babuschkin, I., Czarnecki, W.M., et al.: Grandmaster level in starcraft ii using multi-agent reinforcement learning. Nature 575(7782), 350–354 (2019)
Article Google Scholar
Wu, Z., Pan, S., Chen, F., Long, G., Zhang, C., Philip, S.Y.: A comprehensive survey on graph neural networks. IEEE transactions on neural networks and learning systems 32(1), 4–24 (2020)
Article MathSciNet Google Scholar
Xu, Z., Zhang, B., Bai, Y., Li, D., Fan, G.: Learning to coordinate via multiple graph neural networks. In: International Conference on Neural Information Processing. pp. 52–63. Springer (2021)
Google Scholar
Zhou, D., Huang, J., Schölkopf, B.: Learning with hypergraphs: Clustering, classification, and embedding. Advances in neural information processing systems 19 (2006)
Google Scholar
Zhou, D., Huang, J., et al.: Learning with hypergraphs: Clustering, classification, and embedding. Advances in neural information processing systems 19, 1601–1608 (2006)
Google Scholar
Zhu, L., Shen, J., Jin, H., Zheng, R., Xie, L.: Content-based visual landmark search via multimodal hypergraph learning. IEEE Transactions on Cybernetics 45, 2756–2769 (2015)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Automation, Chinese Academy of Sciences, Beijing, China
Bin Zhang, Yunpeng Bai, Zhiwei Xu, Dapeng Li & Guoliang Fan
School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China
Bin Zhang, Yunpeng Bai, Zhiwei Xu, Dapeng Li & Guoliang Fan

Authors

Bin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yunpeng Bai
View author publications
You can also search for this author in PubMed Google Scholar
Zhiwei Xu
View author publications
You can also search for this author in PubMed Google Scholar
Dapeng Li
View author publications
You can also search for this author in PubMed Google Scholar
Guoliang Fan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Guoliang Fan .

Editor information

Editors and Affiliations

Indian Institute of Technology Indore, Indore, India
Mohammad Tanveer
Indian Institute of Information Technology - Allahabad, Prayagraj, India
Sonali Agarwal
Kobe University, Kobe, Japan
Seiichi Ozawa
Indian Institute of Technology Patna, Patna, India
Asif Ekbal
University of Innsbruck, Innsbruck, Austria
Adam Jatowt

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, B., Bai, Y., Xu, Z., Li, D., Fan, G. (2023). Efficient Policy Generation in Multi-agent Systems via Hypergraph Neural Network. In: Tanveer, M., Agarwal, S., Ozawa, S., Ekbal, A., Jatowt, A. (eds) Neural Information Processing. ICONIP 2022. Lecture Notes in Computer Science, vol 13624. Springer, Cham. https://doi.org/10.1007/978-3-031-30108-7_19

Download citation

DOI: https://doi.org/10.1007/978-3-031-30108-7_19
Published: 13 April 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-30107-0
Online ISBN: 978-3-031-30108-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Efficient Policy Generation in Multi-agent Systems via Hypergraph Neural Network