Agent manipulator: Stealthy strategy attacks on deep reinforcement learning

Chen, Jinyin; Wang, Xueke; Zhang, Yan; Zheng, Haibin; Yu, Shanqing; Bao, Liang

doi:10.1007/s10489-022-03882-w

Agent manipulator: Stealthy strategy attacks on deep reinforcement learning

Published: 03 October 2022

Volume 53, pages 12831–12858, (2023)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Jinyin Chen ORCID: orcid.org/0000-0002-7153-2755^1,2,
Xueke Wang²,
Yan Zhang²,
Haibin Zheng²,
Shanqing Yu² &
…
Liang Bao³

448 Accesses
3 Citations
1 Altmetric
Explore all metrics

Abstract

Deep reinforcement learning (DRL) is a primary machine learning approach for solving sequential decision problems. To exploit the potential vulnerabilities of DRL, we propose a poisoning attack method that injects a backdoor for the DRL model by manipulating the training data with triggers. Existing attack methods can be easily detected by defenders, and their interpretability and transferability have not been studied so far. To address these issues, we propose an agent manipulator, a stealthy target poisoning method. The agent manipulator generates stealthy poisoning examples and fine tunes the model together with clean examples. It achieves state-of-the-art attack performance and is also the first black-box poisoning method through the poisoned examples’ transfer. Corresponding experimental results indicate that, even with a single poisoning example, the poisoning model reaches 60% of trigger success rate of the target action. The effectiveness of the agent manipulator can be interpreted through the heat map’s visualization and the neuron coverage rate. In addition, the agent manipulator can disrupt the model’s deep feature extraction and the execution of actions. Additionally, we verified that the agent manipulator is immune to the existing defenses.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Learning key steps to attack deep reinforcement learning agents

Article 20 March 2023

Vulnerability of Deep Reinforcement Learning to Policy Induction Attacks

Multiple-Model Based Defense for Deep Reinforcement Learning Against Adversarial Attack

References

Ye D, Chen G, Zhang W, Chen S, Yuan B, Liu B, Chen J, Liu Z, Qiu F, Yu H et al (2020) Towards playing full moba games with deep reinforcement learning. Adv Neural Inf Process Syst 33:621–632
Google Scholar
Yang Y, Vamvoudakis KG, Modares H (2020) Safe reinforcement learning for dynamical games. Int J Robust Nonlinear Control 30(9):3706–3726
Article MathSciNet MATH Google Scholar
Yang X, He H, Wei Q, Luo B (2018) Reinforcement learning for robust adaptive control of partially unknown nonlinear systems subject to unmatched uncertainties. Inf Sci 463:307–322
Article MathSciNet MATH Google Scholar
Fayjie AR, Hossain S, Oualid D, Lee DJ (2018) Driverless car: Autonomous driving using deep reinforcement learning in urban environment. In: 2018 15th international conference on ubiquitous robots UR, IEEE, pp 896–901
Prasad N, Cheng LF, Chivers C, Draugelis M, Engelhardt BE (2017) A reinforcement learning approach to weaning of mechanical ventilation in intensive care units. In: 33Rd conference on uncertainty in artificial intelligence
Lee J, Koh H, Choe HJ (2021) Learning to trade in financial time series using high-frequency through wavelet transformation and deep reinforcement learning. Appl Intell 51(8):6202– 6223
Article Google Scholar
Perrusquía A, Yu W, Li X (2021) Multi-agent reinforcement learning for redundant robot control in task-space. Int J Mach Learn Cybern 12(1):231–241
Article Google Scholar
Nguyen TT, Reddi V (2022) Deep reinforcement learning for cyber security. IEEE Transactions on Neural Networks and Learning Systems, p 1–18
Andersen PA, Goodwin M, Granmo OC (2020) Towards safe reinforcement-learning in industrial grid-warehousing. Inf Sci 537:467–484
Article MathSciNet Google Scholar
Le N, Rathour VS, Yamazaki K, Luu K, Savvides M (2021) Deep reinforcement learning in computer vision: a comprehensive survey. Artificial Intelligence Review, p 1–87
Furuta R, Inoue N, Yamasaki T (2019) Pixelrl: Fully convolutional network with reinforcement learning for image processing. IEEE Trans Multimed 22(7):1704–1719
Article Google Scholar
Liu Q, Cheng L, Jia AL, Liu C (2021) Deep reinforcement learning for communication flow control in wireless mesh networks. IEEE Netw 35(2):112–119
Article Google Scholar
Chen P, Lu W (2021) Deep reinforcement learning based moving object grasping. Inf Sci 565:62–76
Article MathSciNet Google Scholar
Vithayathil Varghese N, Mahmoud QH (2020) A survey of multi-task deep reinforcement learning. Electronics 9(9):1363
Article Google Scholar
Zou F, Yen GG, Tang L, Wang C (2021) A reinforcement learning approach for dynamic multi-objective optimization. Inf Sci 546:815–834
Article MathSciNet MATH Google Scholar
Pröllochs N, Feuerriegel S, Lutz B, Neumann D (2020) Negation scope detection for sentiment analysis: a reinforcement learning framework for replicating human interpretations. Inf Sci 536:205–221
Article Google Scholar
Lei L, Tan Y, Zheng K, Liu S, Zhang K, Shen X (2020) Deep reinforcement learning for autonomous internet of things: model, applications and challenges. IEEE Commun Surv Tutor 22(3):1722–1760
Article Google Scholar
Igl M, Ciosek K, Li Y, Tschiatschek S, Zhang C, Devlin S, Hofmann K (2019) Generalization in reinforcement learning with selective noise injection and information bottleneck. Proceedings of the 33rd International Conference on Neural Information Processing Systems, p 13979–13991
Wang J, Liu Y, Li B (2020) Reinforcement learning with perturbed rewards. Proc Conf AAAI Artif Intell 34(04):6202–6209
Google Scholar
Pinto L, Davidson J, Sukthankar R, Gupta A (2017) Robust adversarial reinforcement learning. International Conference on Machine Learning, p 2817–2826
Bravo M, Mertikopoulos P (2017) On the robustness of learning in games with stochastically perturbed payoff observations. Games Econ Behav 103:41–66
Article MathSciNet MATH Google Scholar
Behzadan V, Munir A (2018) Mitigation of policy manipulation attacks on deep q-networks with parameter-space noise, International Conference on Computer Safety Reliability, and Security, p 406–417
Al-Nima RRO, Han T, Al-Sumaidaee SAM, Chen T, Woo WL (2021) Robustness and performance of deep reinforcement learning. Appl Soft Comput 105:107295
Article Google Scholar
Han Y, Rubinstein BI, Abraham T, Alpcan T, De Vel O, Erfani S, Hubczenko D, Leckie C, Montague P (2018) Reinforcement learning for autonomous defence in software-defined networking. In: International conference on decision and game theory for security, Springer, pp 145–165
Bai X, Niu W, Liu J, Gao X, Xiang Y, Liu J (2018) Adversarial examples construction towards white-box q table variation in dqn pathfinding training. In: 2018 IEEE third international conference on data science in cyberspace (DSC), IEEE, pp 781–787
Lee XY, Ghadai S, Tan KL, Hegde C, Sarkar S (2020) Spatiotemporally constrained action space attacks on deep reinforcement learning agents. In: AAAI, pp 4577–4584
Panagiota K, Kacper W, Jha S, Wenchao L (2020) Trojdrl: Trojan attacks on deep reinforcement learning agents. In: Proc. 57th ACM/IEEE design automation conference (DAC)
Behzadan V, Munir A (2017) Vulnerability of deep reinforcement learning to policy induction attacks. In: International conference on machine learning and data mining in pattern recognition, pp 262–275
Wang B, Yao Y, Shan S, Li H, Viswanath B, Zheng H, Zhao BY (2019) Neural cleanse: identifying and mitigating backdoor attacks in neural networks. In: 2019 IEEE symposium on security and privacy (SP), IEEE, pp 707–723
Wang L, Javed Z, Wu X, Guo W, Xing X, Song D (2021) Backdoorl: Backdoor attack against competitive reinforcement learning. In: IJCAI
Behzadan V, Hsu W (2019) Adversarial exploitation of policy imitation. In: IJCAI
Kos J, Song D (2017) Delving into adversarial attacks on deep policies. In: 5Th international conference on learning representations, ICLR
Tretschk E, Oh SJ, Fritz M (2018) Sequential attacks on agents for long-term adversarial goals. In: 2. ACM Computer science in cars symposium
Hussenot L, Geist M, Pietquin O (2019) Targeted attacks on deep reinforcement learning agents through adversarial observations, 1–9 arXiv:1905.12282
Huang S, Papernot N, Goodfellow I, Duan Y, Abbeel P (2017) Adversarial attacks on neural network policies. In: 5Th international conference on learning representations, ICLR
Goodfellow IJ, Shlens J, Szegedy C (2015) Explaining and harnessing adversarial examples. In: The international conference on learning representations, ICLR
Behzadan V, Munir A (2017) Whatever does not kill deep reinforcement learning, makes it stronger, 1–8 arXiv:1712.09344
Pattanaik A, Tang Z, Liu S, Bommannan G, Chowdhary G (2018) Robust deep reinforcement learning with adversarial attacks. In: 17th International conference on autonomous agents and multiagent systems, AAMAS 2018, pp 2040– 2042
Gleave A, Dennis M, Wild C, Kant N, Levine S, Russell S (2019) Adversarial policies: Attacking deep reinforcement learning. In: International conference on learning representations
Sun Y, Huo D, Huang F (2020) Vulnerability-aware poisoning mechanism for online rl with unknown dynamics. In: International conference on learning representations
Zhang X, Ma Y, Singla A, Zhu X (2020) Adaptive reward-poisoning attacks against reinforcement learning. Proceedings of the 37th International Conference on Machine Learning 119:11225–11234
Google Scholar
Behzadan V, Hsu W (2017) Analysis and improvement of adversarial training in dqn agents with adversarially-guided exploration (age), 1–9 arXiv:1906.01119
Rajeswaran A, Ghotra S, Ravindran B, Levine S (2016) Epopt: Learning robust neural network policies using model ensembles. In: Proceedings of the 5th International Conference on Learning Representations, 1–15 arXiv:1610.01283
Morimoto J, Doya K (2005) Robust reinforcement learning. Neural Comput 17(2):335–359
Article MathSciNet Google Scholar
Ogunmolu O, Gans N, Summers T (2018) Minimax iterative dynamic game: application to nonlinear robot control tasks. In: 2018 IEEE/RSJ international conference on intelligent robots and systems (IROS), IEEE, pp 6919–6925
Gu Z, Jia Z, Choset H (2019) Adversary a3c for robust reinforcement learning, 1–12 arXiv:1912.00330
Behzadan V, Hsu W (2019) Sequential triggers for watermarking of deep reinforcement learning policies, 1–4 arXiv:1906.01126
Lin YC, Liu MY, Sun M, Huang JB (2017) Detecting adversarial attacks on neural network policies with visual foresight, 1–10 arXiv:1710.00814
Hessel M, Modayil J, Van Hasselt H, Schaul T, Ostrovski G, Dabney W, Horgan D, Piot B, Azar M, Silver D (2018) Rainbow: Combining improvements in deep reinforcement learning. In: Proceedings of the AAAI conference on artificial intelligence, Vol. 32
Bellemare MG, Naddaf Y, Veness J, Bowling M (2013) The arcade learning environment: an evaluation platform for general agents. J Artif Intell Res 47:253–279
Article Google Scholar
Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D (2020) Grad-cam: Visual explanations from deep networks via gradient-based localization. Int J Comput Vis 128(2):336– 359
Article Google Scholar
Pei K, Cao Y, Yang J, Jana S (2017) Deepxplore: Automated whitebox testing of deep learning systems. In: proceedings of the 26th symposium on operating systems principles, pp 1–18

Download references

Acknowledgment

This research was supported by the National Natural Science Foundation of China under Grant No. 62072406, the Natural Science Foundation of Zhejiang Province under Grant No. LY19F020025, the National Key Research and Development Program of China under Grant No. 2018AAA0100801, Key R&D Projects in Zhejiang Province No. 2021C01117, 2020 Industrial Internet Innovation Development Project No. TC200H01V, “Ten Thousand Talents Program” Science and Technology Innovation Leading Talent Project in Zhejiang Province No. 2020R52011.

Author information

Authors and Affiliations

Institute of Cyberspace Security, Zhejiang University of Technology, Hangzhou, China
Jinyin Chen
The College of Information Engineering, Zhejiang University of Technology, Hangzhou, China
Jinyin Chen, Xueke Wang, Yan Zhang, Haibin Zheng & Shanqing Yu
The Third Research Institute of Ministry of Public Security, Shanghai, China
Liang Bao

Authors

Jinyin Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xueke Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Haibin Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Shanqing Yu
View author publications
You can also search for this author in PubMed Google Scholar
Liang Bao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Jinyin Chen or Liang Bao.

Ethics declarations

Conflict of Interests

The authors declare that they have no conflict of interest.

Additional information

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Chen, J., Wang, X., Zhang, Y. et al. Agent manipulator: Stealthy strategy attacks on deep reinforcement learning. Appl Intell 53, 12831–12858 (2023). https://doi.org/10.1007/s10489-022-03882-w

Download citation

Accepted: 10 June 2022
Published: 03 October 2022
Issue Date: May 2023
DOI: https://doi.org/10.1007/s10489-022-03882-w

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Agent manipulator: Stealthy strategy attacks on deep reinforcement learning

Abstract

Access this article

Similar content being viewed by others

Learning key steps to attack deep reinforcement learning agents

Vulnerability of Deep Reinforcement Learning to Policy Induction Attacks

Multiple-Model Based Defense for Deep Reinforcement Learning Against Adversarial Attack

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of Interests

Additional information

Ethical approval

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Agent manipulator: Stealthy strategy attacks on deep reinforcement learning

Abstract

Access this article

Similar content being viewed by others

Learning key steps to attack deep reinforcement learning agents

Vulnerability of Deep Reinforcement Learning to Policy Induction Attacks

Multiple-Model Based Defense for Deep Reinforcement Learning Against Adversarial Attack

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of Interests

Additional information

Ethical approval

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation