Skip to main content

Deep Q-Learning for Navigation of Robotic Arm for Tokamak Inspection

  • Conference paper
  • First Online:
Algorithms and Architectures for Parallel Processing (ICA3PP 2018)

Abstract

Computerized human-machine interfaces are used to control the manipulators and robots for inspection and maintenance activities in Tokamak. The activities embrace routine and critical activities such as tile inspection, dust cleaning, equipment handling and replacement tasks. Camera(s) is deployed on the robotic arm which moves inside the chamber to accomplish the inspection task. For navigating the robotic arm to the desired position, an inverse kinematic solution is required. Such closed-form inverse kinematic solutions become complex in the case of dexterous hyper-redundant robotic arms that have high degrees of freedom and can be used for inspections in narrow gaps. To develop real-time inverse kinematic solver for robots, a technique called Reinforcement Learning is used. There are various strategies to solve Reinforcement problem in polynomial time, one of them is Q-Learning. It can handle problems with stochastic transitions and rewards, without requiring adaption or probabilities of actions to be taken at a certain point. It is observed that Deep Q-Network successfully learned optimal policies from high dimension sensory inputs using Reinforcement Learning.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Wang, H., Chen, W., Lai, Y., He, T.: Trajectory planning of tokamak flexible in-vessel inspection robot. Fusion Eng. Des. 98–99, 1678–1682 (2015)

    Article  Google Scholar 

  2. Vijayakumari, D., Dhivya, K.: Conceptual framework of robot with nanowire sensor in nuclear reactor. Int. J. Inf. Futur. Res. 1(11), 146–151 (2014)

    Google Scholar 

  3. Hyper-Redundant Robotics Research. http://robby.caltech.edu/~jwb/hyper.html. Accessed 15 02 2018

  4. Dutta, P., Gotewal, K.K., Rastogi, N., Tiwari, R.: A hyper-redundant robot development for tokamak inspection. In: AIR 2017, p. 6 (2017)

    Google Scholar 

  5. Wang, H., Xu, L., Chen, W.: Design and implementation of visual inspection system handed in tokamak flexible in-vessel robot. Fusion Eng. Des. 106, 21–28 (2016)

    Article  Google Scholar 

  6. Andrew, G., Gryniewski, M., Campbell, T.: AARM: a robot arm for internal operations in nuclear reactors. In: 2010 1st International Conference on Applied Robotics for the Power Industry, CARPI, pp. 1–5 (2010)

    Google Scholar 

  7. Peng, X., Yuan, J., Zhang, W., Yang, Y., Song, Y.: Kinematic and dynamic analysis of a serial-link robot for inspection process in EAST vacuum vessel. Fusion Eng. Des. 87(5), 905–909 (2012)

    Article  Google Scholar 

  8. Liu, J., Wang, Y., Li, B., Ma, S.: Neural network based kinematic control of the hyper-redundant snake-like manipulator. In: Advances in Neural Networks – ISNN 2007, vol. 4491, pp. 339–348, April 2015

    Google Scholar 

  9. Liu, J., Wang, Y., Ma, S., Li, B.: RBF neural network based shape control of hyper-redundant manipulator with constrained end-effector. In: Wang, J., Yi, Z., Zurada, Jacek M., Lu, B.-L., Yin, H. (eds.) ISNN 2006. LNCS, vol. 3972, pp. 1146–1152. Springer, Heidelberg (2006). https://doi.org/10.1007/11760023_168

    Chapter  Google Scholar 

  10. James, S., Johns, E.: 3D Simulation for Robot Arm Control with Deep Q-Learning, p. 6 (2016)

    Google Scholar 

  11. Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518, 529–533 (2015)

    Article  Google Scholar 

Download references

Acknowledgment

This work is conducted at Nirma University, Ahmedabad underfunded research project by the Board of Research in Nuclear Sciences under Department of Atomic Energy.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Swati Jain , Priyanka Sharma or Jaina Bhoiwala .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Jain, S. et al. (2018). Deep Q-Learning for Navigation of Robotic Arm for Tokamak Inspection. In: Vaidya, J., Li, J. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2018. Lecture Notes in Computer Science(), vol 11337. Springer, Cham. https://doi.org/10.1007/978-3-030-05063-4_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-05063-4_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-05062-7

  • Online ISBN: 978-3-030-05063-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics