Deep Q-Learning for Navigation of Robotic Arm for Tokamak Inspection

Jain, Swati; Sharma, Priyanka; Bhoiwala, Jaina; Gupta, Sarthak; Dutta, Pramit; Gotewal, Krishan Kumar; Rastogi, Naveen; Raju, Daniel

doi:10.1007/978-3-030-05063-4_6

Swati Jain¹⁵,
Priyanka Sharma¹⁵,
Jaina Bhoiwala¹⁵,
Sarthak Gupta¹⁵,
Pramit Dutta¹⁶,
Krishan Kumar Gotewal¹⁶,
Naveen Rastogi¹⁶ &
…
Daniel Raju^16,17

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11337))

Included in the following conference series:

International Conference on Algorithms and Architectures for Parallel Processing

2104 Accesses
2 Citations

Abstract

Computerized human-machine interfaces are used to control the manipulators and robots for inspection and maintenance activities in Tokamak. The activities embrace routine and critical activities such as tile inspection, dust cleaning, equipment handling and replacement tasks. Camera(s) is deployed on the robotic arm which moves inside the chamber to accomplish the inspection task. For navigating the robotic arm to the desired position, an inverse kinematic solution is required. Such closed-form inverse kinematic solutions become complex in the case of dexterous hyper-redundant robotic arms that have high degrees of freedom and can be used for inspections in narrow gaps. To develop real-time inverse kinematic solver for robots, a technique called Reinforcement Learning is used. There are various strategies to solve Reinforcement problem in polynomial time, one of them is Q-Learning. It can handle problems with stochastic transitions and rewards, without requiring adaption or probabilities of actions to be taken at a certain point. It is observed that Deep Q-Network successfully learned optimal policies from high dimension sensory inputs using Reinforcement Learning.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Wang, H., Chen, W., Lai, Y., He, T.: Trajectory planning of tokamak flexible in-vessel inspection robot. Fusion Eng. Des. 98–99, 1678–1682 (2015)
Article Google Scholar
Vijayakumari, D., Dhivya, K.: Conceptual framework of robot with nanowire sensor in nuclear reactor. Int. J. Inf. Futur. Res. 1(11), 146–151 (2014)
Google Scholar
Hyper-Redundant Robotics Research. http://robby.caltech.edu/~jwb/hyper.html. Accessed 15 02 2018
Dutta, P., Gotewal, K.K., Rastogi, N., Tiwari, R.: A hyper-redundant robot development for tokamak inspection. In: AIR 2017, p. 6 (2017)
Google Scholar
Wang, H., Xu, L., Chen, W.: Design and implementation of visual inspection system handed in tokamak flexible in-vessel robot. Fusion Eng. Des. 106, 21–28 (2016)
Article Google Scholar
Andrew, G., Gryniewski, M., Campbell, T.: AARM: a robot arm for internal operations in nuclear reactors. In: 2010 1st International Conference on Applied Robotics for the Power Industry, CARPI, pp. 1–5 (2010)
Google Scholar
Peng, X., Yuan, J., Zhang, W., Yang, Y., Song, Y.: Kinematic and dynamic analysis of a serial-link robot for inspection process in EAST vacuum vessel. Fusion Eng. Des. 87(5), 905–909 (2012)
Article Google Scholar
Liu, J., Wang, Y., Li, B., Ma, S.: Neural network based kinematic control of the hyper-redundant snake-like manipulator. In: Advances in Neural Networks – ISNN 2007, vol. 4491, pp. 339–348, April 2015
Google Scholar
Liu, J., Wang, Y., Ma, S., Li, B.: RBF neural network based shape control of hyper-redundant manipulator with constrained end-effector. In: Wang, J., Yi, Z., Zurada, Jacek M., Lu, B.-L., Yin, H. (eds.) ISNN 2006. LNCS, vol. 3972, pp. 1146–1152. Springer, Heidelberg (2006). https://doi.org/10.1007/11760023_168
Chapter Google Scholar
James, S., Johns, E.: 3D Simulation for Robot Arm Control with Deep Q-Learning, p. 6 (2016)
Google Scholar
Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518, 529–533 (2015)
Article Google Scholar

Download references

Acknowledgment

This work is conducted at Nirma University, Ahmedabad underfunded research project by the Board of Research in Nuclear Sciences under Department of Atomic Energy.

Author information

Authors and Affiliations

Department of Computer Engineering, Institute of Technology, Nirma University, Ahmedabad, Gujarat, India
Swati Jain, Priyanka Sharma, Jaina Bhoiwala & Sarthak Gupta
Institute of Plasma Research, Bhat, Gandhinagar, Gujarat, India
Pramit Dutta, Krishan Kumar Gotewal, Naveen Rastogi & Daniel Raju
Homi Bhabha National Institute, Mumbai, Maharashtra, India
Daniel Raju

Authors

Swati Jain
View author publications
You can also search for this author in PubMed Google Scholar
Priyanka Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Jaina Bhoiwala
View author publications
You can also search for this author in PubMed Google Scholar
Sarthak Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Pramit Dutta
View author publications
You can also search for this author in PubMed Google Scholar
Krishan Kumar Gotewal
View author publications
You can also search for this author in PubMed Google Scholar
Naveen Rastogi
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Raju
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Swati Jain , Priyanka Sharma or Jaina Bhoiwala .

Editor information

Editors and Affiliations

Rutgers University, Newark, NJ, USA
Jaideep Vaidya
Guangzhou University, Guangzhou, China
Jin Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jain, S. et al. (2018). Deep Q-Learning for Navigation of Robotic Arm for Tokamak Inspection. In: Vaidya, J., Li, J. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2018. Lecture Notes in Computer Science(), vol 11337. Springer, Cham. https://doi.org/10.1007/978-3-030-05063-4_6

Download citation

DOI: https://doi.org/10.1007/978-3-030-05063-4_6
Published: 07 December 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05062-7
Online ISBN: 978-3-030-05063-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics