Improving deep reinforcement learning by safety guarding model via hazardous experience planning

Peng, Pai; Zhu, Fei; Ling, Xinghong; Zhao, Peiyao; Liu, Quan

doi:10.1007/s11704-021-0250-y

Improving deep reinforcement learning by safety guarding model via hazardous experience planning

Letter
Published: 03 December 2021

Volume 16, article number 164320, (2022)
Cite this article

Frontiers of Computer Science Aims and scope Submit manuscript

Pai Peng¹,
Fei Zhu¹,
Xinghong Ling¹,
Peiyao Zhao¹ &
…
Quan Liu¹

46 Accesses
1 Citation
1 Altmetric
Explore all metrics

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Kai A, Deisenroth M P, Brundage M, Bharath A A. Deep reinforcement learning: a brief survey. IEEE Signal Processing Magazine, 2017, 34(6): 26–38
Article Google Scholar
Cheng R, Orosz G, Murray R M, Burdick J W. End-to-end safe reinforcement learning through barrier functions for safety-critical continuous control tasks. In: Proceedings of the AAAI Conference on Artificial Intelligence. 2019, 3387–3395
Saunders W, Sastry G, Stuhlmueller A, Evans O. Trial without error: towards safe reinforcement learning via human intervention. In: Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems. 2018, 2067–2069
Achiam J, Held D, Tamar A, Abbeel P. Constrained policy optimization. In: Proceedings of the International Conference on Machine Learning. 2017, 22–31
García J, Fernández F. A comprehensive survey on safe reinforcement learning. Journal of Machine Learning Research, 2015, 16: 1437–1480
MathSciNet MATH Google Scholar
Chatzilygeroudis K, Vassiliades V, Mouret J B. Reset-free trial-and-error learning for robot damage recovery. Robotics and Autonomous Systems, 2018, 100: 236–250
Article Google Scholar
Zhu F, Wu W, Fu Y, Liu Q. A dual deep network based secure deep reinforcement learning method. Chinese Journal of Computers, 2019, 42(8): 1812–1826
Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (Grant No. 61303108), Natural Science Foundation of Jiangsu Province (BK20211102), Suzhou Key Industries Technological Innovation-Prospective Applied Research Project (SYG 201804); A Project Funded by the Priority Academic Program Development of Jiangsu Higher Education Institutions.

Author information

Authors and Affiliations

School of Computer Science and Technology, Soochow University, Suzhou, 215006, China
Pai Peng, Fei Zhu, Xinghong Ling, Peiyao Zhao & Quan Liu

Authors

Pai Peng
View author publications
Search author on:PubMed Google Scholar
Fei Zhu
View author publications
Search author on:PubMed Google Scholar
Xinghong Ling
View author publications
Search author on:PubMed Google Scholar
Peiyao Zhao
View author publications
Search author on:PubMed Google Scholar
Quan Liu
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Fei Zhu.

Additional information

Supporting information

The supporting information is available online at journal. hep. com. cn and link. springer. com

Electronic supplementary material