Abstract
As an important class of sampling-based path planning methods, the Rapidly-exploring Random Trees (RRT) algorithm has been widely studied and applied in the literature. In RRT, how to select a tree to extend or connect is a critical factor, which will greatly influence the efficiency of path planning. In this paper, a novel learning-based multi-RRTs (LM-RRT) approach is proposed for robot path planning in narrow passages. The LM-RRT approach models the tree selection process as a multi-armed bandit problem and uses a reinforcement learning algorithm that learns action values and selects actions with an improved ε-greedy strategy (ε t -greedy). Compared with previous RRT algorithms, LM-RRT can not only enhance the local space exploration ability of each tree, but also guarantee the efficiency of global path planning. The probabilistic completeness and combinatory optimality of LM-RRT are proved based on the geometric characteristics of the configuration space. Simulation and experimental results show the effectiveness of the proposed LM-RRT approach in single-query path planning problems with narrow passages.
Similar content being viewed by others
References
Tsardoulias, E.G., Iliakopoulou, A., Kargakos, A., Petrou, L.: A review of global path planning methods for occupancy grid maps regardless of obstacle density. J. Intell. Robot. Syst. 84(1-4), 829–858 (2016)
LaValle, S.: Planning algorithms. Cambridge University Press, New York (2006)
Yang, K., Moon, S., Yoo, S., Kang, J., Doh, N., Kim, H., Joo, S.: Spline-based RRT path planner for non-holonomic robots. J. Intell. Robot. Syst. 73(1-4), 763–782 (2014)
Plaku, E., Kavraki, L.E., Vardi, M.Y.: Motion planning with dynamics by a synergistic combination of layers of planning. IEEE Trans. Robot. 26(3), 469–482 (2010)
Howard, T., Kelly, A.: Optimal rough terrain trajectory generation for wheeled mobile robots. Int. J. Robot. Res. 26(2), 141–166 (2007)
Jaillet, L., Cortés, J., Siméon, T.: Sampling-based path planning on configuration-space costmaps. IEEE Trans. Robot. 26(3), 647–659 (2010)
Tang, X., Thomas, S., Coleman, P., Amato, N.: Reachable distance space: Efficient sampling-based planning for spatially constrained systems. Int. J. Robot. Res. 29(7), 916–934 (2010)
Karaman, S., Frazzoli, E.: Sampling-based algorithms for optimal motion planning. Int. J. Robot. Res. 30(7), 846–894 (2011)
Kavraki, L.E., Švetska, P., Latombe, J.C., Overmars, M.H.: Probabilistic roadmaps for path planning in high-dimensional configuration space. IEEE Trans. Robot. Autom. 12(4), 566–580 (1996)
Kavraki, L.E., Kolountzakis, M.N., Latombe, J.C.: Analysis of probabilistic roadmaps for path planning. IEEE Trans. Robot. Autom. 14(1), 166–171 (1998)
LaValle, S.M., Kuffner Jr., J.J.: Randomized kinodynamic planning. Int. J. Robot. Res. 20(5), 378–400 (2001)
Hsu, D., Latombe, J.-C., Motwani, R.: Path planning in expansive configuration spaces. Proc IEEE Int. Conf. Robot. Autom. 3, 2719–2726 (1997)
Hsu, D., Kindel, R., Latombe, J.-C., Rock, S.: Randomized kinodynamic motion planning with moving obstacles. Int. J. Robot. Res. 21(3), 233–255 (2002)
Yershova, A., Jaillet, L., Siméon, T., LaValle, S.M.: Dynamic Domain RRTs: Efficient exploration by controlling the sampling domain. In: IEEE International Conference on Robotics and Automation, pp 3856–3861, Barcelona (2005)
Jaillet, L., Yershova, A., LaValle, S.M., Siméon, T.: Adaptive tuning of the sampling domain for dynamic-domain RRTs. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, pp 2851–2856 (2005)
Burns, B., Brock, O.: Single-query motion planning with utility guided random trees. In: IEEE International Conference on Robotics and Automation, pp 3307–3312, Rome (2007)
Dalibard, S., Laumond, J.P.: Control of probabilistic diffusion in motion planning. In: International Workshop on Algorithmic Foundations of Robotics, pp 467–481 (2008)
Kuffner, J.J., LaValle, S.M.: RRT-connect: An efficient approach to single-query path planning. In: International Conference on Robotics and Automation, pp 995–1001, San Francisco (2000)
LaValle, S.M., Kuffner, J.J.: Rapidly exploring random trees: Progress and prospect. In: Donald, B.R., Lynch, K., Rus, D. (eds.) New Directions in Algorithmic and Computational Robotics, pp 293–308. A. K. Peters, London (2001)
Lie, T.Y., Shie, Y.C.: An incremental learning approach to motion planning with roadmap management. In: IEEE International Conference on Robotics and Automation, pp 3411–3416, Washington (2002)
Morales, M., Rodriguez, S., Amato, N.: Improving the connectivity of PRM roadmaps. In: IEEE International Conference on Robotics and Automation, pp 4427–4432 (2003)
Strandberg, M.: Augmenting RRT-planners with local trees. In: IEEE International Conference on Robotics and Automation, pp 3258–3262, New Orleans (2004)
Plaku, E., Bekris, K.E., Chen, B.Y., Ladd, A.M., Kavraki, L.: Sampling based roadmap of trees for parallel motion planning. IEEE Trans. Robot. 21(4), 597–608 (2005)
Otte, M., Correll, N.: C-FOREST: parallel shortest path planning with superlinear speedup. IEEE Trans. Robot. 29(3), 798–806 (2013)
Wang, W., Xu, X., Li, Y., Song, J., He, H.: Triple RRTs: an effective method for path planning in narrow passage. Advanced Robot. (RSJ) 24(7), 943–962 (2010)
Sun, Z., Hsu, D., Jiang, T., Kurniawati, H., Reif, J.H.: Narrow passage sampling for probabilistic roadmap planning. IEEE Trans. Robot. 21(6), 1105–1115 (2005)
Wang, W., Li, Y., Xu, X., Yang, S.X.: An adaptive roadmap guided multi-RRTs strategy for single query path planning. In: Proceedings of IEEE International Conference on Robotics and Automation, pp 2871–2876, Anchorage (2010)
Elbanhawi, M., Simic, M.: Sampling-based robot motion planning: a review. IEEE Access 2(1), 56–77 (2014)
He, W., Ge, S.S.: Cooperative control of a nonuniform gantry crane with constrained tension. Automatica 66, 146–154 (2016)
He, W., Chen, Y., Yin, Z.: Adaptive neural network control of an uncertain robot with full-state constraints. IEEE Trans. Cybern. 46(3), 620–629 (2016)
He, W., Dong, Y., Sun, C.: Adaptive neural impedance control of a robotic manipulator with input saturation. IEEE Trans. Syst. Man Cybern. Syst. 46(3), 334–344 (2016)
He, W., Ouyang, Y., Hong, J.: Vibration control of a flexible robotic manipulator in the presence of input deadzone. IEEE Trans. Indust. Inform. 13(1), 48–59 (2016)
He, W., Zhang, S.: Control design for nonlinear flexible wings of a robotic aircraft. IEEE Trans. Control Syst. Technol. 25(1), 351–357 (2017)
Sutton, R., Barto, A.: Reinforcement learning: an introduction. MIT Press, Cambridge (1998)
Auer, P., Bianchi, N.C., Fischer, P.: Finite-time analysis of the multi-armed bandit problem. Mach. Learn. 47(2–3), 235–256 (2002)
Berry, D., Fristedt, B.: Bandit problems. Chapman and Hall, London (1985)
Holland, J.: Adaptation in Natural and Artificial Systems. MIT Press/Bradford Books, Cambridge (1992)
Hoeffding, W.: Probability inequalities for sums of bounded random variables. J. Amer. Stat. Assoc. 58(301), 13–30 (1963)
Chernoff, H.: A note on an inequality involving the normal distribution. Ann. Probab. 9(3), 533–535 (1981)
Acknowledgments
This work is sponsored by he NSFC Innovation and Development Joint Foundation of Chinese Automobile Industry under Grant U1564214. The authors would like to thank Dr. Chunming Liu for helpful discussion and conversation. In addition, the authors would like to thank the anonymous reviewers for their valuable comments.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work is supported by the NSFC Innovation and Development Joint Foundation of Chinese Automobile Industry under Grant U1564214 and the NSFC International Cooperation Project under grant 61611540348.
Rights and permissions
About this article
Cite this article
Wang, W., Zuo, L. & Xu, X. A Learning-based Multi-RRT Approach for Robot Path Planning in Narrow Passages. J Intell Robot Syst 90, 81–100 (2018). https://doi.org/10.1007/s10846-017-0641-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10846-017-0641-3