Path planning of modular robots on various terrains using Q-learning versus optimization algorithms

Haghzad Klidbary, Sajad; Bagheri Shouraki, Saeed; Sheikhpour Kourabbaslou, Soroush

doi:10.1007/s11370-017-0217-x

Path planning of modular robots on various terrains using Q-learning versus optimization algorithms

Original Research Paper
Published: 24 January 2017

Volume 10, pages 121–136, (2017)
Cite this article

Intelligent Service Robotics Aims and scope Submit manuscript

Sajad Haghzad Klidbary¹,
Saeed Bagheri Shouraki¹ &
Soroush Sheikhpour Kourabbaslou¹

1125 Accesses
25 Citations
Explore all metrics

Abstract

Self-reconfigurable modular robots (SRMRs) have recently attracted considerable attention because of their numerous potential applications in the real world. In this paper, we draw a comprehensive comparison among five different algorithms in path planning of a novel SRMR system called ACMoD through an environment comprised of various terrains in a static condition. The contribution of this work is that the reconfiguration ability of ACMoD has been taken into account. This consideration, though raises new algorithmic challenges, equips the robot with new capability to pass difficult terrains rather than bypassing them, and consequently the robot can achieve better performance in terms of traversal time and energy consumption. In this work, four different optimization algorithms, including Adaptive Genetic Algorithm, Elitist Ant System, Dijkstra and Dynamic Weighting A*, along with a well-known reinforcement learning algorithm called Q-Learning, are proposed to solve this path planning problem. The outputs of these algorithms are the optimal path through the environment and the associated configuration on each segment of the path. The challenges involved in mapping the path planning problem to each algorithm are discussed in full details. Eventually, all algorithms are compared in terms of the quality of their solutions and convergence rate.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Path Planning and Trajectory Planning Algorithms: A General Overview

Swarm intelligence algorithms for multiple unmanned aerial vehicles collaboration: a comprehensive review

Article 26 September 2022

Game-theoretic multi-agent motion planning in a mixed environment

Article 15 March 2024

References

Bhat P et al (2006) Hierarchical motion planning for self-reconfigurable modular robots. In: 2006 IEEE/RSJ on international conference on intelligent robots and systems
Liu T et al (2009) The adaptive path planning research for a shape-shifting robot using particle swarm optimization. In: Fifth international conference on natural computation, 2009
Sun X et al (2015) A reconfiguration approach for self-reconfigurable modular robot using assisted modules. In: 2015 IEEE international conference on mechatronics and automation (ICMA)
Christensen D et al (2014) Collective modular underwater robotic system for long-term autonomous operation. Science 19(1):35–40
MathSciNet Google Scholar
Hancher MD, Hornby GS (2006) A modular robotic system with applications to space exploration. In: 2nd IEEE international conference on space mission challenges for information technology (SMC-IT’06)
Liu S et al (2012) A reconfigurable modular robot for detection and rescue. In: Lei J, Wang FL, Deng H, Miao D (eds) Emerging research in artificial intelligence and computational intelligence. Springer, 17–24
Tsai C-C, Huang H-C, Chan C-K (2011) Parallel elite genetic algorithm and its application to global path planning for autonomous robot navigation. IEEE Trans Ind Electron 58(10):4813–4821
Article Google Scholar
Soltani AR et al (2002) Path planning in construction sites: performance evaluation of the Dijkstra, A*, and GA search algorithms. Adv Eng Inform 16(4):291–303
Article Google Scholar
Naderan-Tahan M, Manzuri-Shalmani MT (2009) Efficient and safe path planning for a mobile robot using genetic algorithm. In: IEEE congress on evolutionary computation, 2009. CEC’09
Guanghua Z, Zhicheng D, Wei W (2006) Realization of a modular reconfigurable robot for rough terrain. In: Proceedings of the 2006 IEEE international conference on mechatronics and automation
Golestan K, Asadpour M, Moradi H (2013) A new graph signature calculation method based on power centrality for modular robots. In: Martinoli A, Mondada F, Correll N, Mermoud G, Egerstedt M, Hsieh MA, Parker LE, St\(\varnothing \)y K (eds) Distributed autonomous robotic systems. Springer, pp 505–516
Yim M et al (2007) Modular self-reconfigurable robot systems [grand challenges of robotics]. IEEE Robot Autom Mag 14(1):43–52
Article Google Scholar
Rus D, Vona M (2001) Crystalline robots: self-reconfiguration with compressible unit modules. Auton Robots 10(1):107–124
Article MATH Google Scholar
Jorgensen MW, Ostergaard EH, Lund HH (2004) Modular ATRON: modules for a self-reconfigurable robot. In: Proceedings of 2004 IEEE/RSJ international conference on intelligent robots and systems, 2004 (IROS 2004)
Murata S et al (2002) M-TRAN: self-reconfigurable modular robotic system. IEEE/ASME Trans Mechatron 7(4):431–441
Article Google Scholar
Castano A, Shen W-M, Will P (2000) CONRO: towards deployable robots with inter-robots metamorphic capabilities. Auton Robots 8(3):309–324
Article Google Scholar
Ryland GG, Cheng HH (2010) Design of iMobot, an intelligent reconfigurable mobile robot with novel locomotion. In: 2010 IEEE international conference on robotics and automation (ICRA)
Haghzad S, Bagheri S, Faraji S (2013) Finding proper configurations for modular robots by using genetic algorithm on different terrains. Int J Mater Mech Manuf 1:360–365
Google Scholar
Fetanat M, Haghzad S, Shouraki SB (2015) Optimization of dynamic mobile robot path planning based on evolutionary methods. In: IEEE AI & Robotics (IRANOPEN), 2015
Hu Y, Yang SX (2004) A knowledge based genetic algorithm for path planning of a mobile robot. In: Proceedings of 2004 IEEE international conference on robotics and automation, 2004. ICRA’04
Konar A et al (2013) A deterministic improved Q-learning for path planning of a mobile robot. IEEE Trans Syst Man Cybern Syst 43(5):1141–1153
Article Google Scholar
Wang H, Yu Y, Yuan Q (2011) Application of Dijkstra algorithm in robot path-planning. In: 2011 second international conference on mechanic automation and control engineering
Cai C, Ferrari S (2009) Information-driven sensor path planning by approximate cell decomposition. IEEE Trans Syst Man Cybern Part B Cybern 39(3):672–689
Article Google Scholar
Yu-qin W, Xiao-peng Y (2012) Research for the robot path planning control strategy based on the immune particle swarm optimization algorithm. In: 2012 second international conference on intelligent system design and engineering application (ISDEA)
Dong H et al (2010) The path planning for mobile robot based on Voronoi diagram. In: 2010 3rd international conference on intelligent networks and intelligent systems (ICINIS)
Janet JA, Luo RC, Kay MG (1995) The essential visibility graph: an approach to global motion planning for autonomous mobile robots. In: Proceedings of 1995 IEEE international conference on robotics and automation, 1995
Xin D, Hua-hua C, Wei-kang G (2005) Neural network and genetic algorithm based global path planning in a static environment. J Zhejiang Univ Sci A 6(6):549–554
Google Scholar
Rimon E, Koditschek DE (1992) Exact robot navigation using artificial potential functions. IEEE Trans Robot Autom 8(5):501–518
Article Google Scholar
Ge SS, Cui YJ (2000) New potential functions for mobile robot path planning. IEEE Trans Robot Autom 16(5):615–620
Article Google Scholar
Guan-Zheng T, Huan H, Sloman A (2007) Ant colony system algorithm for real-time globally optimal path planning of mobile robots. Acta Autom Sin 33(3):279–285
Article MATH Google Scholar
Sariff NB, Buniyamin N (2009) Comparative study of genetic algorithm and ant colony optimization algorithm performances for robot path planning in global static environments of different complexities. In: CIRA
Brand M et al (2010) Ant colony optimization algorithm for robot path planning. In: 2010 international conference on computer design and applications (ICCDA)
Kala R et al (2009) Mobile robot navigation control in moving obstacle environment using genetic algorithm, artificial neural networks and A* algorithm. In: 2009 WRI world congress on computer science and information engineering
Zeng C, Zhang Q, Wei X (2011) Robotic global path-planning based modified genetic algorithm and A* algorithm. In: 2011 third international conference on measuring technology and mechatronics automation (ICMTMA)
Kang HI, Lee B, Kim K (2008) Path planning algorithm using the particle swarm optimization and the improved Dijkstra algorithm. In: Pacific-Asia workshop on computational intelligence and industrial application, 2008 (PACIIA’08)
Das P, Behera H, Panigrahi B (2015) Intelligent-based multi-robot path planning inspired by improved classical Q-learning and improved particle swarm optimization with perturbed velocity. Int J Eng Sci Technol 19:651–669
Li Y, Li C, Zhang Z (2006) Q-learning based method of adaptive path planning for mobile robot. In: 2006 IEEE international conference on information acquisition
Banerjee D, et al (2012) Path-planning of mobile agent using Q-learning and real-time communication in an unfavourable situation. In: 2012 world congress on information and communication technologies (WICT)
Li S, Xu X, Zuo L (2015) Dynamic path planning of a mobile robot with improved Q-learning algorithm. In: 2015 IEEE international conference on information and automation
Meng Y et al (2011) Cross-ball: a new morphogenetic self-reconfigurable modular robot. In: 2011 IEEE international conference on robotics and automation (ICRA)
Kuffner JJ, LaValle SM (2000) RRT-connect: an efficient approach to single-query path planning. In: Proceedings of IEEE international conference on robotics and automation, 2000 (ICRA’00)
Desaraju VR, How JP (2011) Decentralized path planning for multi-agent teams in complex environments using rapidly-exploring random trees. In: 2011 IEEE international conference on robotics and automation (ICRA)
Bai W, Xue B, Sun Y (2011) Research on path planning for soccer robot based on improved genenic algorithm. In: 2011 international conference on mechatronic science, electric engineering and computer (MEC)
Li Q et al (2006) An improved adaptive algorithm for controlling the probabilities of crossover and mutation based on a fuzzy control strategy. In: Sixth international conference on hybrid intelligent systems, 2006 (HIS’06)
Parlaktuna O, Sipahioğlu A, Yazici A (2007) A VRP-based route planning for a mobile robot group. Turk J Electr Eng Comput Sci 15(2):187–197
Google Scholar
Sun X, Druzdzel, MJ, Yuan C (2007) Dynamic weighting A* search-based MAP algorithm for bayesian networks. In: IJCAI
Rosyidi L, et al (2014) Timebase dynamic weight for Dijkstra Algorithm implementation in route planning software. In: 2014 international conference on intelligent green building and smart grid (IGBSG)
Bozdoğan AÖ, Yilmaz AE, Efe M (2010) Performance analysis of swarm optimization approaches for the generalized assignment problem in multi-target tracking applications. Turk J Electr Eng Comput Sci 18(6):1059–1078
Google Scholar
Chia S-H et al (2010) Ant colony system based mobile robot path planning. In: 2010 fourth international conference on genetic and evolutionary computing (ICGEC)
Dorigo M, Maniezzo V, Colorni A (1996) Ant system: optimization by a colony of cooperating agents. IEEE Trans Syst Man Cybern Part B (Cybern) 26(1):29–41
Article Google Scholar
Kaelbling LP, Littman ML, Moore AW (1996) Reinforcement learning: a survey. J Artif Intell Res 4:237–285
Sutton RS, Barto AG (1998) Reinforcement learning: an introduction. Bradford Book, Cambridge
Google Scholar
Watkins CJ, Dayan P (1992) Q-learning. Mach Learn 8(3–4):279–292
MATH Google Scholar
Huang B-Q, Cao G-Y, Guo M (2005) Reinforcement learning neural network to the problem of autonomous mobile robot obstacle avoidance. In: Proceedings of 2005 international conference on machine learning and cybernetics, 2005
Gao Q, et al (2006) An improved q-learning algorithm based on exploration region expansion strategy. In: Intelligent Control and Automation, 2006. WCICA 2006. The Sixth World Congress on. IEEE
Tokic M (2010) Adaptive \(\varepsilon \)-greedy exploration in reinforcement learning based on value differences. In: Annual conference on artificial intelligence. Springer
Sutton RS, Barto AG (1998) Introduction to reinforcement learning., vol 135. MIT Press, Cambridge
Google Scholar

Download references

Acknowledgements

The authors would like to thank Sina Maleki, Salman Faraji and Hossein Davari for their generous help in software design of the robot.

Author information

Authors and Affiliations

Sharif University of Technology, Tehran, Islamic Republic of Iran
Sajad Haghzad Klidbary, Saeed Bagheri Shouraki & Soroush Sheikhpour Kourabbaslou

Authors

Sajad Haghzad Klidbary
View author publications
You can also search for this author in PubMed Google Scholar
Saeed Bagheri Shouraki
View author publications
You can also search for this author in PubMed Google Scholar
Soroush Sheikhpour Kourabbaslou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sajad Haghzad Klidbary.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Haghzad Klidbary, S., Bagheri Shouraki, S. & Sheikhpour Kourabbaslou, S. Path planning of modular robots on various terrains using Q-learning versus optimization algorithms. Intel Serv Robotics 10, 121–136 (2017). https://doi.org/10.1007/s11370-017-0217-x

Download citation

Received: 11 August 2016
Accepted: 10 January 2017
Published: 24 January 2017
Issue Date: April 2017
DOI: https://doi.org/10.1007/s11370-017-0217-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Path planning of modular robots on various terrains using Q-learning versus optimization algorithms

Abstract

Access this article

Similar content being viewed by others

Path Planning and Trajectory Planning Algorithms: A General Overview

Swarm intelligence algorithms for multiple unmanned aerial vehicles collaboration: a comprehensive review

Game-theoretic multi-agent motion planning in a mixed environment

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Path planning of modular robots on various terrains using Q-learning versus optimization algorithms

Abstract

Access this article

Similar content being viewed by others

Path Planning and Trajectory Planning Algorithms: A General Overview

Swarm intelligence algorithms for multiple unmanned aerial vehicles collaboration: a comprehensive review

Game-theoretic multi-agent motion planning in a mixed environment

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation