AdverseGen: A Practical Tool for Generating Adversarial Examples to Deep Neural Networks Using Black-Box Approaches

Zhang, Keyuan; Wu, Kaiyue; Chen, Siyu; Zhao, Yunce; Yao, Xin

doi:10.1007/978-3-030-91100-3_25

Keyuan Zhang^10,11,
Kaiyue Wu^10,11,
Siyu Chen^10,11,
Yunce Zhao^10,11,12 &
…
Xin Yao^10,11

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13101))

Included in the following conference series:

International Conference on Innovative Techniques and Applications of Artificial Intelligence

837 Accesses

Abstract

Deep neural networks are fragile as they are easily fooled by inputs with deliberate perturbations, which are key concerns in image security issues. Given a trained neural network, we are always curious about whether the neural network has learned the concept that we’d like it to learn. We want to know whether there might be some vulnerabilities of the neural network that could be exploited by hackers. It could be useful if there is a tool that can be used by non-experts to test a trained neural network and try to find its vulnerabilities. In this paper, we introduce a tool named AdverseGen for generating adversarial examples to a trained deep neural network using the black-box approach, i.e., without using any information about the neural network architecture and its gradient information. Our tool provides customized adversarial attacks for both non-professional users and developers. It can be invoked by a graphical user interface or command line mode to launch adversarial attacks. Moreover, this tool supports different attack goals (targeted, non-targeted) and different distance metrics.

This work was supported by the Research Institute of Trustworthy Autonomous Systems, the Guangdong Provincial Key Laboratory (Grant No. 2020B121201001), the Program for Guangdong Introducing Innovative and Enterpreneurial Teams (Grant No. 2017ZT07X386) and Shenzhen Science and Technology Program (Grant No. KQTD2016112514355531).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

DA3G: Detecting Adversarial Attacks by Analysing Gradients

Improving the transferability of adversarial examples with path tuning

Article 11 September 2024

Adversarial Attacks on Neural Networks

Notes

References

Brendel, W., Rauber, J., Bethge, M.: Decision-based adversarial attacks: reliable attacks against black-box machine learning models. In: International Conference on Learning Representations, pp. 1–12 (2018)
Google Scholar
Chen, P.Y., Zhang, H., Sharma, Y., Yi, J., Hsieh, C.J.: Zoo: zeroth order optimization based black-box attacks to deep neural networks without training substitute models. In: Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security, pp. 15–26 (2017)
Google Scholar
Croce, F., Hein, M.: Sparse and imperceivable adversarial attacks. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 4724–4732 (2019)
Google Scholar
Ding, G.W., Wang, L., Jin, X.: Advertorch v0. 1: an adversarial robustness toolbox based on pytorch. arXiv preprint arXiv:1902.07623 (2019)
Gao, L., Zhang, Q., Song, J., Liu, X., Shen, H.T.: Patch-wise attack for fooling deep neural network. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12373, pp. 307–322. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58604-1_19
Chapter Google Scholar
Goodfellow, I.J., Shlens, J., Szegedy, C.: Explaining and harnessing adversarial examples. In: International Conference on Learning Representations (ICLR) (2015)
Google Scholar
Gragnaniello, D., Marra, F., Verdoliva, L., Poggi, G.: Perceptual quality-preserving black-box attack against deep learning image classifiers. Pattern Recogn. Lett. 147, 142–149 (2021)
Article Google Scholar
Guo, C., Frank, J.S., Weinberge, K.Q.: Low frequency adversarial perturbation. In: Adams, R.P., Gogate, V. (eds.) Proceedings of The 35th Uncertainty in Artificial Intelligence Conference. Proceedings of Machine Learning Research, vol. 115, pp. 1127–1137. PMLR, 22–25 Jul 2020
Google Scholar
Guo, C., Gardner, J., You, Y., Wilson, A.G., Weinberger, K.: Simple black-box adversarial attacks. In: International Conference on Machine Learning, pp. 2484–2493. PMLR (2019)
Google Scholar
Hayes, J., Danezis, G.: Learning universal adversarial perturbations with generative models. In: 2018 IEEE Security and Privacy Workshops (SPW), pp. 43–49. IEEE (2018)
Google Scholar
Ilyas, A., Engstrom, L., Madry, A.: Prior convictions: black-box adversarial attacks with bandits and priors. In: International Conference on Learning Representations (ICLR), pp. 1–25 (2018)
Google Scholar
Kim, H.: Torchattacks: a pytorch repository for adversarial attacks. arXiv preprint arXiv:2010.01950 (2020)
Kurakin, A., Goodfellow, I., Bengio, S.: Adversarial examples in the physical world. In: 5th International Conference on Learning Representations (ICLR) (2017)
Google Scholar
Li, Y., Jin, W., Xu, H., Tang, J.: Deeprobust: a platform for adversarial attacks and defenses. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35(18), pp. 16078–16080 (2021)
Google Scholar
Liu, Y., Chen, X., Liu, C., Song, D.: Delving into transferable adversarial examples and black-box attacks. In: Proceedings of 5th International Conference on Learning Representations (ICLR), pp. 1–14 (2017)
Google Scholar
Papernot, N., et al.: Technical report on the cleverhans v2.1.0 adversarial examples library. arXiv preprint arXiv:1610.00768 (2018)
Rauber, J., Brendel, W., Bethge, M.: Foolbox: a python toolbox to benchmark the robustness of machine learning models. arXiv preprint arXiv:1707.04131 (2017)
Su, J., Vargas, D.V., Sakurai, K.: One pixel attack for fooling deep neural networks. IEEE Trans. Evol. Comput. 23(5), 828–841 (2019). https://doi.org/10.1109/TEVC.2019.2890858
Article Google Scholar
Szegedy, C., et al.: Intriguing properties of neural networks. In: International Conference on Learning Representations (ICLR), pp. 1–10 (2014)
Google Scholar
Wang, Z., Bovik, A., Sheikh, H., Simoncelli, E.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)
Article Google Scholar
Yang, J., Jiang, Y., Huang, X., Ni, B., Zhao, C.: Learning black-box attackers with transferable priors and query feedback. In: Advances in Neural Information Processing Systems, vol. 33, pp. 12288–12299 (2020)
Google Scholar

Download references

Author information

Authors and Affiliations

Research Institute of Trustworthy Autonomous System, Southern University of Science and Technology (SUSTech), Shenzhen, China
Keyuan Zhang, Kaiyue Wu, Siyu Chen, Yunce Zhao & Xin Yao
Guangdong Provincial Key Laboratory of Brain-Inspired Intelligent Computation, Department of Computer Science and Engineering, Southern University of Science and Technology (SUSTech), Shenzhen, China
Keyuan Zhang, Kaiyue Wu, Siyu Chen, Yunce Zhao & Xin Yao
University of Technology Sydney, Sydney, Australia
Yunce Zhao

Authors

Keyuan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Kaiyue Wu
View author publications
You can also search for this author in PubMed Google Scholar
Siyu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yunce Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Xin Yao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xin Yao .

Editor information

Editors and Affiliations

University of Portsmouth, Portsmouth, UK
Max Bramer
RKE Consulting, Micheldever, UK
Richard Ellis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, K., Wu, K., Chen, S., Zhao, Y., Yao, X. (2021). AdverseGen: A Practical Tool for Generating Adversarial Examples to Deep Neural Networks Using Black-Box Approaches. In: Bramer, M., Ellis, R. (eds) Artificial Intelligence XXXVIII. SGAI-AI 2021. Lecture Notes in Computer Science(), vol 13101. Springer, Cham. https://doi.org/10.1007/978-3-030-91100-3_25

Download citation

DOI: https://doi.org/10.1007/978-3-030-91100-3_25
Published: 06 December 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-91099-0
Online ISBN: 978-3-030-91100-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

AdverseGen: A Practical Tool for Generating Adversarial Examples to Deep Neural Networks Using Black-Box Approaches

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

DA3G: Detecting Adversarial Attacks by Analysing Gradients

Improving the transferability of adversarial examples with path tuning

Adversarial Attacks on Neural Networks

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

AdverseGen: A Practical Tool for Generating Adversarial Examples to Deep Neural Networks Using Black-Box Approaches

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

DA3G: Detecting Adversarial Attacks by Analysing Gradients

Improving the transferability of adversarial examples with path tuning

Adversarial Attacks on Neural Networks

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation