Skip to main content

A Hybrid Deep Reinforcement Learning Algorithm for Intelligent Manipulation

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11743))

Abstract

Conventional collaborative robots can solve complex problems through programming approaches. But the current tasks are different and nonrepetitive, many problems cannot be solved by conventional programming methods. Deep reinforcement learning provides a framework for solving robotic control tasks using machine learning techniques. However, the existing model-free deep reinforcement learning algorithms lack unified framework for comparing sample efficiency with final performance. In this paper, a hybrid deep reinforcement learning framework and its application in robot control are proposed based on the existing model-free deep reinforcement learning algorithms. In the acting process, the distributed actors acting with the environment are used to acquire the data, while prior actors are used to solve the cold boot problem of the algorithm. In the learning process, prioritized experience replay and multi-step learning are designed for the improvement on the final performance. Simulations are represented to show the practicality and potential of the proposed algorithm. Results show that the hybrid deep reinforcement learning algorithm in this paper has a significant improvement on the final performance and sample efficiency while it can ensure the stability and convergence.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Vecerik, M., Hester, T., Scholz, J., et al.: Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards. arXiv preprint arXiv:1707.08817 (2017)

  2. Tai, L., Zhang, J., Liu, M., et al.: A survey of deep network solutions for learning control in robotics: from reinforcement to imitation. arXiv preprint arXiv:1612.07139 (2016)

  3. Barto, G., Sutton, S., Anderson, W.: Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Trans. Syst. Man Cybern. 5, 834–846 (1984)

    Google Scholar 

  4. Mnih, V., Kavukcuoglu, K., Silver, D., et al.: Playing Atari with deep reinforcement learning. In: Neural Information Processing Systems (2013)

    Google Scholar 

  5. Lillicrap, P., Hunt, J., Pritzel, A., et al.: Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971 (2015)

  6. Mnih, V., Badia, P., Mirza, M., et al.: Asynchronous methods for deep reinforcement learning. arXiv preprint arXiv:1602.01783 (2016)

  7. Wu, Y., Mansimov, E., Grosse, B., et al.: Scalable trust-region method for deep reinforcement learning using kronecker-factored approximation. In: Neural Information Processing Systems, pp. 5279–5288 (2017)

    Google Scholar 

  8. Schulman, J., Levine, S., Abbeel, P., et al.: Trust region policy optimization. In: 32nd International Conference on Machine Learning (ICML 2015), pp. 1889–1897 (2015)

    Google Scholar 

  9. Schulman, J., Wolski, F., Dhariwal, P., et al.: Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 (2017)

  10. Goodfellow, I., Pouget, J., Mirza, M., et al.: Generative adversarial nets. In: Neural Information Processing Systems, pp. 2672–2680 (2014)

    Google Scholar 

  11. Dean, J., Corrado, G., Monga, R., et al.: Large scale distributed deep networks. In: 25th International Conference on Neural Information Processing Systems (2012)

    Google Scholar 

  12. Nair, A., Srinivasan, P., Blackwell, S., et al.: Massively parallel methods for deep reinforcement learning. arXiv preprint arXiv:1507.04296 (2015)

  13. Horgan, D., Quan, J., Budden, D., et al.: Distributed prioritized experience replay. arXiv preprint arXiv:1803.00933 (2018)

  14. Hessel, M., Modayil, J., Hasselt, H., et al.: Rainbow: combining improvements in deep reinforcement learning. arXiv preprint arXiv:1710.02298 (2017)

  15. Babaeizadeh, M., Frosio, I., Tyree, S., et al.: Reinforcement learning through asynchronous advantage actor-critic on a gpu. arXiv preprint arXiv:1611.06256 (2016)

  16. Google. https://github.com/openai/gym/tree/master/gym/envs/mujoco. Accessed 06 May 2019

  17. Li, J., Liu, L., Wang, Y., et al.: Adaptive hybrid impedance control of robot manipulators with robustness against environment’s uncertainties. In: 2015 IEEE International Conference on Mechatronics and Automation, pp. 1846–1851. IEEE (2015)

    Google Scholar 

Download references

Acknowledgment

This research was supported, in part, by the National Natural Science Foundation of China (No. 51875393) and by the China Advance Research for Manned Space Project (No. 030601).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yaobing Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ma, C., Li, J., Bai, J., Wang, Y., Liu, B., Sun, J. (2019). A Hybrid Deep Reinforcement Learning Algorithm for Intelligent Manipulation. In: Yu, H., Liu, J., Liu, L., Ju, Z., Liu, Y., Zhou, D. (eds) Intelligent Robotics and Applications. ICIRA 2019. Lecture Notes in Computer Science(), vol 11743. Springer, Cham. https://doi.org/10.1007/978-3-030-27538-9_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-27538-9_31

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-27537-2

  • Online ISBN: 978-3-030-27538-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics