Generating collective foraging behavior for robotic swarm using deep reinforcement learning

Jin, Boyin; Liang, Yupeng; Han, Ziyao; Ohkura, Kazuhiro

doi:10.1007/s10015-020-00642-2

Generating collective foraging behavior for robotic swarm using deep reinforcement learning

Original Article
Published: 19 October 2020

Volume 25, pages 588–595, (2020)
Cite this article

Artificial Life and Robotics Aims and scope Submit manuscript

Boyin Jin¹,
Yupeng Liang¹,
Ziyao Han¹ &
…
Kazuhiro Ohkura¹

488 Accesses
Explore all metrics

Abstract

This paper mainly discussed the generation of collective behaviors with raw camera images as the primary information input. The swarm robotic system exhibits considerable advantages when faced with individual-level failure or the lack of global information. Spatial information has always been a necessity in generating collective transport behavior. The rise of deep neural network technology makes it possible for a robot to perceive the environment from its visual input. In this paper, the use of deep reinforcement learning in training a robotic swarm to generate collective foraging behavior is shown. The collective foraging behavior is evaluated in a transportation task, where robots need to learn to process image information while cooperatively transport foods to the nest. We applied a deep Q-Learning algorithm and several improved versions to develop controllers for robotic swarms. The results of computer simulations show that using images as the main information input can successfully generate collective foraging behavior. Besides, we also combine the advantages of several algorithms to improve performance and perform experiments to examine the flexibility of the developed controllers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Visualizing Deep Q-Learning to Understanding Behavior of Swarm Robotic System

A hierarchical training method of generating collective foraging behavior for a robotic swarm

Article 26 November 2021

Learning to Cooperate in Decentralized Multi-robot Exploration of Dynamic Environments

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Şahin E (2004) Swarm robotics: from sources of inspiration to domains of application. International workshop on swarm robotics, pp. 10–20
Stephan C, Wilkinson A (2012) Have we met before? Pigeons recognise familiar human faces. Avian Biol Res 5(2):75–80
Article Google Scholar
Yu H, Yang J (2001) A direct LDA algorithm for high-dimensional data-with application to face recognition. Pattern Recogn 34(10):2067–2070
Article Google Scholar
LeCun Y, Yoshua B (2015) Deep learning. Nature 521(7553):436
Article Google Scholar
Hsu K, Gupta HV (1995) Artificial neural network modeling of the rainfall-runoff process. Water Resour Res 31(10):2517–2530
Article Google Scholar
Mnih V, Kavukcuoglu K (2013) Playing atari with deep reinforcement learning. arXiv:1312.5602
Mnih V et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529
Article Google Scholar
Mnih V, Badia AP (2016) Asynchronous methods for deep reinforcement learning. Int Conf Mach Learn 2017:1928–1937
Google Scholar
Hasselt V (2016) Hado and Guez. In: Deep reinforcement learning with double q-learning, Thirtieth AAAI conference on artificial intelligence
Brambilla M et al (2013) Deep reinforcement learning with double q-learning. Swarm Intell 7(1):1–41
Article Google Scholar
Liu W, Winfield AFT (2010) Modeling and optimization of adaptive foraging in swarm robotic systems. Int J Robot Res 29(14):1743–1760
Article Google Scholar
Wei Y, Toshiyuki Y, Kazuhiro O (2017) Collective cognition: a case study of evolutionary swarm robotics in the collective foraging problem with poison. In: 2017 IEEE/SICE international symposium on system integration (SII), pp 865–868
Angeline PJ, Saunders GM, Pollack JB (1994) An evolutionary algorithm that constructs recurrent neural networks. IEEE Trans Neural Netw 5(1):54–65
Article Google Scholar
OpenAI Five Website (2018) https://openai.com/five/
AlphaGo Website (2017) https://deepmind.com/research/alphago/
BayndIr L (2016) A review of swarm robotics tasks. Neurocomputing 172:292–321
Article Google Scholar

Download references

Author information

Authors and Affiliations

Hiroshima University, 1-4-1, Kagamiyama, Higashi-Hiroshima, Hiroshima, 739-8527, Japan
Boyin Jin, Yupeng Liang, Ziyao Han & Kazuhiro Ohkura

Authors

Boyin Jin
View author publications
You can also search for this author inPubMed Google Scholar
Yupeng Liang
View author publications
You can also search for this author inPubMed Google Scholar
Ziyao Han
View author publications
You can also search for this author inPubMed Google Scholar
Kazuhiro Ohkura
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Kazuhiro Ohkura.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

About this article

Cite this article

Jin, B., Liang, Y., Han, Z. et al. Generating collective foraging behavior for robotic swarm using deep reinforcement learning. Artif Life Robotics 25, 588–595 (2020). https://doi.org/10.1007/s10015-020-00642-2

Download citation

Received: 20 May 2020
Accepted: 14 September 2020
Published: 19 October 2020
Issue Date: November 2020
DOI: https://doi.org/10.1007/s10015-020-00642-2

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Generating collective foraging behavior for robotic swarm using deep reinforcement learning

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Visualizing Deep Q-Learning to Understanding Behavior of Swarm Robotic System

A hierarchical training method of generating collective foraging behavior for a robotic swarm

Learning to Cooperate in Decentralized Multi-robot Exploration of Dynamic Environments

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now