Abstract
The mobile robot path planning problem is one of the main contents of reinforcement learning research. In traditional reinforcement learning, the agent obtains the cumulative reward value in the process of interacting with the environment and finally converges to the optimal strategy. The Dyna learning framework in reinforcement learning obtains an estimation model in the real environment. The virtual samples generated by the estimation model are updated together with the empirical samples obtained in the real environment to update the value function or strategy function to improve the convergence efficiency. At present, when reinforcement learning is used for path planning tasks, continuous motion cannot be solved in a large-scale continuous environment, and the convergence is poor. In this paper, we use RBFNN to approximate the Q-value table in the Dyna-Q algorithm to solve the drawbacks in traditional algorithms. The experimental results show that the convergence speed of the improved Dyna-RQ algorithm is significantly faster, which improves the efficiency of mobile robot path planning.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Guo, N., Li, C., Wang, D., Zhang, N., Liu, G.: Path planning of mobile robot based on prediction and fuzzy control. Comput. Eng. Appl. 56(8), 104–109 (2020)
Wang, Y., Chirikjian, G.S.: A new potential field method for robot path planning. In: IEEE International Conference on Robotics and Automation, Proceedings. ICRA. IEEE, vol. 2, pp. 977–982 (2000)
Roberge, V., Tarbouchi, M., Labonte, G.: Comparison of parallel genetic algorithm and particle swarm optimization for real-time UAV path planning. IEEE Trans. Ind. Inform. 9(1), 132–141 (2013)
Zhang, Y., Li, S., Guo, H.: A type of biased consensus-based distributed neural network for path planning. Nonlinear Dyn. 1–13 (2017)
Hu, W., Lei, B.: Application research of computer vision technology based on deep neural network in path. Planning 6(13), 113–116 (2022)
Cao, J., Liu, Q.: Research on path planning algorithm based on deep reinforcement learning. 39(11), 231–237 (2022)
Sutton, R.S., Barto, A.G.: Reinforcement learning: an introduction. Mach. Learn. 8(3–4), 225–227 (1992)
Santos, M., Botella, G.: Dyna-H: a heuristic planning reinforcement learning algorithm applied to role-playing game strategy decision systems. Knowl.-Based Syst. 32(8), 28–36 (2012)
Zajdel, R.: Epoch-incremental dyna learning and prioritized sweeping algorithms. Neurocomputing 319, 13–20 (2018)
Liu, S., Tong, X.: Urban transportation path planning based on reinforcement learning. J. Comput. Appl. 41(1), 185–190 (2021)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zhang, Z., Li, X., Wang, Y. (2023). Research on Path Planning of Mobile Robots Based on Dyna-RQ. In: Yu, Z., et al. Data Science. ICPCSEE 2023. Communications in Computer and Information Science, vol 1879. Springer, Singapore. https://doi.org/10.1007/978-981-99-5968-6_5
Download citation
DOI: https://doi.org/10.1007/978-981-99-5968-6_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-5967-9
Online ISBN: 978-981-99-5968-6
eBook Packages: Computer ScienceComputer Science (R0)