Particle Swarm Based Reinforcement Learning

Duan, Jianyu; Guo, Yanxiao; Wang, Zhigang; Ke, Liangjun

doi:10.1007/978-981-19-9297-1_3

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1744))

Included in the following conference series:

International Conference on Data Mining and Big Data

522 Accesses

Abstract

With the vigorous development of computer-related technology, the “perception + decision” paradigm of the combination of deep learning and reinforcement learning has become a research hotspot. Nowadays, deep reinforcement learning algorithms have been successfully applied to the fields of games, industry and commerce. However, deep reinforcement learning algorithms often fall into the dilemma of “exploration” and “exploitation”, and the effect of these algorithms is easily affected by the quality of hyperparameters. In order to make up for the defects mentioned above, this paper introduces the particle swarm based reinforcement learning framework (PRL). Compared with the standard reinforcement learning algorithms, this framework greatly improves the exploration ability and obtains better scores in a series of gym experimental tests.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Cheng, C.A., Kolobov, A., Swaminathan, A.: Heuristic-guided reinforcement learning. Adv. Neural. Inf. Process. Syst. 34, 13550–13563 (2021)
Google Scholar
Degrave, J., et al.: Magnetic control of tokamak plasmas through deep reinforcement learning. Nature 602(7897), 414–419 (2022)
Article Google Scholar
Ding, S., Du, W., Zhao, X., Wang, L., Jia, W.: A new asynchronous reinforcement learning algorithm based on improved parallel PSO. Appl. Intell. 49(12), 4211–4222 (2019)
Article Google Scholar
François-Lavet, V., Henderson, P., Islam, R., Bellemare, M.G., Pineau, J.: An introduction to deep reinforcement learning. arXiv preprint arXiv:1811.12560 (2018)
Fujimoto, S., Gu, S.S.: A minimalist approach to offline reinforcement learning. In: Advances in Neural Information Processing Systems, vol. 34 (2021)
Google Scholar
Huang, C., et al.: Multi-hop RIS-empowered terahertz communications: a DRL-based hybrid beamforming design. IEEE J. Sel. Areas Commun. 39(6), 1663–1677 (2021)
Article Google Scholar
Ibarz, J., Tan, J., Finn, C., Kalakrishnan, M., Pastor, P., Levine, S.: How to train your robot with deep reinforcement learning: lessons we have learned. Int. J. Robot. Res. 40(4–5), 698–721 (2021)
Article Google Scholar
Kennedy, J., Eberhart, R.: Particle swarm optimization. In: Proceedings of ICNN 1995-International Conference on Neural Networks, vol. 4, pp. 1942–1948. IEEE (1995)
Google Scholar
Khadka, S., et al.: Collaborative evolutionary reinforcement learning. In: International Conference on Machine Learning, pp. 3341–3350. PMLR (2019)
Google Scholar
Khadka, S., Tumer, K.: Evolution-guided policy gradient in reinforcement learning. In: Advances in Neural Information Processing Systems, vol. 31 (2018)
Google Scholar
Kiran, B.R., et al.: Deep reinforcement learning for autonomous driving: a survey. IEEE Trans. Intell. Transp. Syst. (2021)
Google Scholar
Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015)
Article Google Scholar
Movahedi, Z., Bastanfard, A.: Toward competitive multi-agents in polo game based on reinforcement learning. Multimedia Tools Appl. 80(17), 26773–26793 (2021)
Article Google Scholar
Pourchot, A., Sigaud, O.: CEM-RL: combining evolutionary and gradient-based methods for policy search. arXiv preprint arXiv:1810.01222 (2018)
Silver, D., et al.: Mastering the game of go with deep neural networks and tree search. Nature 529(7587), 484–489 (2016)
Article Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (2018)
MATH Google Scholar
Wang, B., Liu, F., Lin, W.: Energy-efficient VM scheduling based on deep reinforcement learning. Futur. Gener. Comput. Syst. 125, 616–628 (2021)
Article Google Scholar
Zhan, Z.H., et al.: Matrix-based evolutionary computation. IEEE Trans. Emerg. Top. Comput. Intell. 6(2), 315–328 (2021)
Article Google Scholar
Zhang, F., Li, J., Li, Z.: A TD3-based multi-agent deep reinforcement learning method in mixed cooperation-competition environment. Neurocomputing 411, 206–215 (2020)
Article Google Scholar
Zhou, J., Xue, S., Xue, Y., Liao, Y., Liu, J., Zhao, W.: A novel energy management strategy of hybrid electric vehicle via an improved TD3 deep reinforcement learning. Energy 224, 120118 (2021)
Article Google Scholar

Download references

Acknowledgement

This work was supported by the National Natural Science Foundation of China under Grant 61973244 and Grant 61573277. It is also supported by the open fund of CETC Key Laboratory of Data Link Technology (CLDL-20202101-1).

Author information

Authors and Affiliations

State Key Laboratory for Manufacturing Systems Engineering, School of Automation Science and Engineering, Xi’an Jiaotong University, Xi’an, China
Jianyu Duan, Yanxiao Guo & Liangjun Ke
CETC Key Laboratory of Data Link Technology Xi’an, Xi’an, China
Zhigang Wang

Authors

Jianyu Duan
View author publications
You can also search for this author in PubMed Google Scholar
Yanxiao Guo
View author publications
You can also search for this author in PubMed Google Scholar
Zhigang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Liangjun Ke
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Liangjun Ke .

Editor information

Editors and Affiliations

Peking University, Beijing, China
Ying Tan
Southern University of Science and Technology, Shenzhen, China
Yuhui Shi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Duan, J., Guo, Y., Wang, Z., Ke, L. (2022). Particle Swarm Based Reinforcement Learning. In: Tan, Y., Shi, Y. (eds) Data Mining and Big Data. DMBD 2022. Communications in Computer and Information Science, vol 1744. Springer, Singapore. https://doi.org/10.1007/978-981-19-9297-1_3

Download citation

DOI: https://doi.org/10.1007/978-981-19-9297-1_3
Published: 20 January 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-9296-4
Online ISBN: 978-981-19-9297-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics