Poisonous Label Attack: Black-Box Data Poisoning Attack with Enhanced Conditional DCGAN

Liu, Haiqing; Li, Daoxing; Li, Yuancheng

doi:10.1007/s11063-021-10584-w

Poisonous Label Attack: Black-Box Data Poisoning Attack with Enhanced Conditional DCGAN

Published: 28 August 2021

Volume 53, pages 4117–4142, (2021)
Cite this article

Neural Processing Letters Aims and scope Submit manuscript

807 Accesses
8 Citations
1 Altmetric
Explore all metrics

Abstract

Data poisoning is identified as a security threat for machine learning models. This paper explores the poisoning attack against the convolutional neural network under black-box conditions. The proposed attack is “black-box,” which means the attacker has no knowledge about the targeted model’s structure and parameters when attacking the model, and it uses “poisonous-labels” images, fake images with crafted wrong labels, as poisons. We present a method for generating “poisonous-label” images that use Enhanced Conditional DCGAN (EC-DCGAN) to synthesizes fake images and uses asymmetric poisoning vectors to mislabel them. We evaluate our method by generating “poisonous-label” images from MNIST and FashionMNIST datasets and using them to manipulate image classifiers. Our experiments demonstrate that, similarly to white box data poisoning attacks, the poisonous label attack can also dramatically increase the classification error.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Practical Poisoning Attacks on Neural Networks

DCAL: A New Method for Defending Against Adversarial Examples

Minority Reports Defense: Defending Against Adversarial Patches

Availability of data and material

Not applicable.

References

Aghakhani H, Meng D, Wang Y, Kruegel C, Vigna G (2020) Bullseye polytope: A scalable clean-label poisoning attack with improved transferability. CoRR arXiv:2005.00191
Awasthi P, Balcan M, Long PM (2017) The power of localization for efficiently learning linear separators with noise. J. ACM 63(6):50:1-50:27. https://doi.org/10.1145/3006384
Article MathSciNet MATH Google Scholar
Barreno M, Nelson B, Sears R, Joseph AD, Tygar JD (2006) Can machine learning be secure? In: Lin F, Lee D, Lin BP, Shieh S, Jajodia S (eds) Proceedings of the 2006 ACM symposium on information, computer and communications security, ASIACCS 2006, Taipei, Taiwan, March 21-24, 2006, pp 16–25. ACM https://doi.org/10.1145/1128817.1128824
Bshouty NH, Eiron N, Kushilevitz E (2002) PAC learning with nasty noise. Theor. Comput. Sci. 288(2):255–275. https://doi.org/10.1016/S0304-3975(01)00403-0
Article MathSciNet MATH Google Scholar
Charikar M, Steinhardt J, Valiant G (2017) Learning from untrusted data. In: Hatami H, McKenzie P, King V (eds) Proceedings of the 49th annual ACM SIGACT symposium on theory of computing, STOC 2017, Montreal, QC, Canada, June 19-23, 2017, pp 47–60. ACM https://doi.org/10.1145/3055399.3055491
Chen B, Carvalho W, Baracaldo N, Ludwig H, Edwards B, Lee T, Molloy I, Srivastava B (2019) Detecting backdoor attacks on deep neural networks by activation clustering 2301 http://ceur-ws.org/Vol-2301/paper_18.pdf
Chen C, Seff A, Kornhauser AL, Xiao J (2015) Deepdriving: Learning affordance for direct perception in autonomous driving. In: 2015 IEEE international conference on computer vision, ICCV 2015, Santiago, Chile, December 7–13, 2015, pp 2722–2730. IEEE Computer Society https://doi.org/10.1109/ICCV.2015.312
Chen P, Liao B, Chen G, Zhang S (2019) Understanding and utilizing deep neural networks trained with noisy labels. In: Chaudhuri K, Salakhutdinov R (eds) Proceedings of the 36th international conference on machine learning, ICML 2019, 9–15 June 2019, Long Beach, California, USA, Proceedings of Machine Learning Research, vol 97, pp 1062–1070. PMLR http://proceedings.mlr.press/v97/chen19g.html
Chen X, Liu C, Li B, Lu K, Song D (2017) Targeted backdoor attacks on deep learning systems using data poisoning. CoRR arXiv:1712.05526
Diakonikolas I, Kamath G, Kane DM, Li J, Moitra A, Stewart A (2016) Robust estimators in high dimensions without the computational intractability. In: Dinur I (ed) IEEE 57th annual symposium on foundations of computer science, FOCS 2016, 9–11 October 2016, Hyatt Regency, New Brunswick, New Jersey, USA, pp 655–664. IEEE Computer Society https://doi.org/10.1109/FOCS.2016.85
Diakonikolas I, Kamath G, Kane DM, Li J, Steinhardt J, Stewart A (2018) Sever: a robust meta-algorithm for stochastic optimization. CoRR arXiv:1803.02815
Frénay B, Verleysen M (2014) Classification in the presence of label noise: a survey. IEEE Trans Neural Netw Learn Syst 25(5):845–869. https://doi.org/10.1109/TNNLS.2013.2292894
Article MATH Google Scholar
Ghosh A, Kumar H, Sastry PS (2017) Robust loss functions under label noise for deep neural networks. In: Singh SP, Markovitch S (eds) Proceedings of the thirty-first AAAI conference on artificial intelligence, February 4–9, 2017, San Francisco, California, USA, pp 1919–1925. AAAI Press http://aaai.org/ocs/index.php/AAAI/AAAI17/paper/view/14759
Goodfellow IJ, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville AC, Bengio Y (2014) Generative adversarial nets. In: Ghahramani Z, Welling M, Cortes C, Lawrence ND, Weinberger KQ (eds) Advances in neural information processing systems 27: annual conference on neural information processing systems 2014, December 8–13 2014, Montreal, Quebec, Canada, pp 2672–2680 http://papers.nips.cc/paper/5423-generative-adversarial-nets
Goodfellow IJ, Shlens J, Szegedy C (2015) Explaining and harnessing adversarial examples. In: Bengio Y, LeCun Y (eds) 3rd international conference on learning representations, ICLR 2015, San Diego, CA, USA, May 7–9, 2015, conference track proceedings arXiv:1412.6572
Huang WR, Geiping J, Fowl L, Taylor G, Goldstein T (2020) Metapoison: practical general-purpose clean-label data poisoning. In: Larochelle H, Ranzato M, Hadsell R, Balcan M, Lin H (eds) Advances in neural information processing systems 33: annual conference on neural information processing systems 2020, NeurIPS 2020, December 6–12, 2020, virtual https://proceedings.neurips.cc/paper/2020/hash/8ce6fc704072e351679ac97d4a985574-Abstract.html
Isola P, Zhu J, Zhou T, Efros AA (2017) Image-to-image translation with conditional adversarial networks. In: 2017 IEEE conference on computer vision and pattern recognition, CVPR 2017, Honolulu, HI, USA, July 21–26, 2017, pp 5967–5976. IEEE Computer Society https://doi.org/10.1109/CVPR.2017.632
Kearns MJ, Li M (1993) Learning in the presence of malicious errors. SIAM J Comput 22(4):807–837. https://doi.org/10.1137/0222052
Article MathSciNet MATH Google Scholar
Kingma DP, Ba J (2015) Adam: a method for stochastic optimization. In: Bengio Y, LeCun Y (eds) 3rd international conference on learning representations, ICLR 2015, San Diego, CA, USA, May 7–9, 2015, conference track proceedings arXiv:1412.6980
Koh PW, Liang P (2017) Understanding black-box predictions via influence functions. In: Precup D, Teh YW (eds) Proceedings of the 34th international conference on machine learning, ICML 2017, Sydney, NSW, Australia, 6–11 August 2017, Proceedings of Machine Learning Research, vol 70, pp. 1885–1894. PMLR http://proceedings.mlr.press/v70/koh17a.html
Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60(6):84–90. https://doi.org/10.1145/3065386
Article Google Scholar
Lecun Y, Bottou L (1998) Gradient-based learning applied to document recognition. Proceedings of the IEEE 86(11):2278–2324
Article Google Scholar
Lee C, Gallagher PW, Tu Z (2016) Generalizing pooling functions in convolutional neural networks: mixed, gated, and tree. In: Gretton A, Robert CC (eds) Proceedings of the 19th international conference on artificial intelligence and statistics, AISTATS 2016, Cadiz, Spain, May 9–11, 2016, JMLR Workshop and Conference Proceedings, vol 51, pp 464–472. JMLR.org http://proceedings.mlr.press/v51/lee16a.html
Li W, Wang L, Li W, Agustsson E, Gool LV (2017) Webvision database: visual learning and understanding from web data. CoRR arXiv:1708.02862
Mahloujifar S, Diochnos DI, Mahmoody M (2018) Learning under $p$-tampering attacks. In: Janoos F, Mohri M, Sridharan K (eds) Algorithmic learning theory, ALT 2018, 7–9 April 2018, Lanzarote, Canary Islands, Spain, Proceedings of Machine Learning Research, vol 83, pp 572–596. PMLR http://proceedings.mlr.press/v83/mahloujifar18a.html
Mahloujifar S, Diochnos DI, Mahmoody M (2019) The curse of concentration in robust learning: evasion and poisoning attacks from concentration of measure. In: The Thirty-Third AAAI conference on artificial intelligence, AAAI 2019, the thirty-first innovative applications of artificial intelligence conference, IAAI 2019, the ninth AAAI symposium on educational advances in artificial intelligence, EAAI 2019, Honolulu, Hawaii, USA, January 27–February 1, 2019, pp 4536–4543. AAAI Press https://doi.org/10.1609/aaai.v33i01.33014536
Mohri M, Rostamizadeh A, Talwalkar A (2012) Foundations of machine learning http://mitpress.mit.edu/books/foundations-machine-learning-0
Muñoz-González L, Biggio B, Demontis A, Paudice A, Wongrassamee V, Lupu EC, Roli F (2017) Towards poisoning of deep learning algorithms with back-gradient optimization. In: Thuraisingham BM, Biggio B, Freeman DM, Miller B, Sinha A (eds) Proceedings of the 10th ACM workshop on artificial intelligence and security, AISec@CCS 2017, Dallas, TX, USA, November 3, 2017, pp. 27–38. ACM https://doi.org/10.1145/3128572.3140451
Nelson B, Barreno M, Chi FJ, Joseph AD, Rubinstein BI, Saini U, Sutton C, Tygar JD, Xia K. Misleading learners: co-opting your spam filter. In: Machine learning in cyber trust, pp 17–51. https://doi.org/10.1007/978-0-387-88735-7
Odena A, Olah C, Shlens J (2017) Conditional image synthesis with auxiliary classifier gans 70:2642–2651 http://proceedings.mlr.press/v70/odena17a.html
Pathak D, Krähenbühl P, Donahue J, Darrell T, Efros, AA (2016) Context encoders: Feature learning by inpainting. In: 2016 IEEE conference on computer vision and pattern recognition, CVPR 2016, Las Vegas, NV, USA, June 27–30, 2016, pp 2536–2544. IEEE Computer Society https://doi.org/10.1109/CVPR.2016.278
Radford A, Metz L, Chintala S (2016) Unsupervised representation learning with deep convolutional generative adversarial networks. In: Y. Bengio, Y. LeCun (eds) 4th International conference on learning representations, ICLR 2016, San Juan, Puerto Rico, May 2–4, 2016, Conference Track Proceedings arXiv:1511.06434
van Rooyen B, Menon AK, Williamson RC (2015) Learning with symmetric label noise: The importance of being unhinged. In: Cortes C, Lawrence ND, Lee DD, Sugiyama M, Garnett R (eds) Advances in neural information processing systems 28: annual conference on neural information processing systems 2015, December 7–12, 2015, Montreal, Quebec, Canada, pp. 10–18 http://papers.nips.cc/paper/5941-learning-with-symmetric-label-noise-the-importance-of-being-unhinged
Scott C, Blanchard G, Handy G (2013) Classification with asymmetric label noise: Consistency and maximal denoising. In: Shalev-Shwartz S, Steinwart I (eds) COLT 2013—The 26th annual conference on learning theory, June 12-14, 2013, Princeton University, NJ, USA, JMLR Workshop and Conference Proceedings, vol 30, pp 489–511. JMLR.org http://proceedings.mlr.press/v30/Scott13.html
Shafahi A, Huang WR, Najibi M, Suciu O, Studer C, Dumitras T, Goldstein T (2018) Poison frogs! targeted clean-label poisoning attacks on neural networks. In: Bengio S, Wallach HM, Larochelle H, Grauman K, Cesa-Bianchi N, Garnett R (eds) Advances in neural information processing systems 31: annual conference on neural information processing Systems 2018, NeurIPS 2018, December 3-8, 2018, Montréal, Canada, pp 6106–6116 http://papers.nips.cc/paper/7849-poison-frogs-targeted-clean-label-poisoning-attacks-on-neural-networks
Shen S, Tople S, Saxena P (2016) Auror: defending against poisoning attacks in collaborative deep learning systems. In: Schwab S, Robertson WK, Balzarotti D (eds) Proceedings of the 32nd annual conference on computer security applications, ACSAC 2016, Los Angeles, CA, USA, December 5-9, 2016, pp 508–519. ACM http://dl.acm.org/citation.cfm?id=2991125
Sloan RH (1995) Four types of noise in data for PAC learning. Inf Process Lett 54(3):157–162. https://doi.org/10.1016/0020-0190(95)00016-6
Article MATH Google Scholar
Steinhardt J, Koh PW, Liang P (2017) Certified defenses for data poisoning attacks. In: Guyon I, von Luxburg U, Bengio S, Wallach HM, Fergus R, Vishwanathan SVN, Garnett R (eds) Advances in neural information processing systems 30: annual conference on neural information processing systems 2017, December 4–9, 2017, Long Beach, CA, USA, pp. 3517–3529 http://papers.nips.cc/paper/6943-certified-defenses-for-data-poisoning-attacks
Sukhbaatar S, Bruna J, Paluri M, Bourdev L, Fergus R (2015) Training convolutional networks with noisy labels. Computer Science
Szegedy C, Zaremba W, Sutskever I, Bruna J, Erhan D, Goodfellow IJ, Fergus R (2014) Intriguing properties of neural networks. In: Bengio Y, LeCun Y (eds) 2nd international conference on learning representations, ICLR 2014, Banff, AB, Canada, April 14–16, 2014, Conference Track Proceedings arXiv:1312.6199
Taheri R, Javidan R, Shojafar M, Pooranian Z, Miri A, Conti M (2020) On defending against label flipping attacks on malware detection systems. Neural Comput Appl 32(18):14781–14800. https://doi.org/10.1007/s00521-020-04831-9
Article Google Scholar
Valiant LG (1984) A theory of the learnable. Commun ACM 27(11):1134–1142. https://doi.org/10.1145/1968.1972
Article MATH Google Scholar
Valiant LG (1985) Learning disjunction of conjunctions. In: Joshi AK (ed) Proceedings of the 9th international joint conference on artificial intelligence. Los Angeles, CA, USA, August 1985. Morgan Kaufmann, pp 560–566. http://ijcai.org/Proceedings/85-1/Papers/107.pdf
Weber M, Xu X, Karlas B, Zhang C, Li B (2020) RAB: provable robustness against backdoor attacks. CoRR arXiv:2003.08904
Xiao H, Biggio B, Nelson B, Xiao H, Eckert C, Roli F (2015) Support vector machines under adversarial label contamination. Neurocomputing 160:53–62. https://doi.org/10.1016/j.neucom.2014.08.081
Article Google Scholar
Xiao H, Rasul K, Vollgraf R (2017) Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms. CoRR arXiv:1708.07747
Yang C, Wu Q, Li H, Chen Y (2017) Generative poisoning attack method against neural networks. CoRR arXiv:1703.01340
Zhu C, Huang WR, Li H, Taylor G, Studer C, Goldstein T(2019) Transferable clean-label poisoning attacks on deep neural nets. In: Chaudhuri K, Salakhutdinov R (eds) Proceedings of the 36th international conference on machine learning, ICML 2019, 9–15 June 2019, Long Beach, California, USA, Proceedings of Machine Learning Research, vol 97, pp 7614–7623. PMLR http://proceedings.mlr.press/v97/zhu19a.html

Download references

Acknowledgements

We thank Chen Hui from North China Electric Power University for his great contribution to the revision of this paper, including supplementary experiments and typesetting.

Funding

This work is supported by the Fundamental Research Funds for the Central Universities(2020YJ003).

Author information

Authors and Affiliations

School of Control and Computer Engineering, North China Electric Power University, Beijing, 102206, China
Haiqing Liu, Daoxing Li & Yuancheng Li

Authors

Haiqing Liu
View author publications
You can also search for this author in PubMed Google Scholar
Daoxing Li
View author publications
You can also search for this author in PubMed Google Scholar
Yuancheng Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuancheng Li.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Code availability

https://gitee.com/lidaoxing/poisonous-label-attack.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A: Proof of Proposition 1

Preliminaries Before the poisoning samples are injected into the training dataset $D_{train}^*$, let $P^* (y_r\vert x_r )$ donate the conditional probability of the sample $x_r$ is correctly labeled as $y_r$, $P_{\varphi }^* (y_r\vert x_r,\omega )$ donate the predicted conditional probability of the sample $x_r$ is classified as $y_r$ by the network $\varphi $ parameterized by $\omega $. After the poisoning samples are injected into the training dataset $(D_{train}^*$ becomes$D_{train})$, let $P(y_r\vert x_r )$ donate the conditional probability of the sample $x_r$ is correctly labeled as $y_r$. $P_{\varphi }(y_r\vert x_r,{\hat{\omega }} )$donate the predicted conditional probability of the sample $x_r$ is classified as $y_r$ by the network $\varphi $ parameterized by ${\hat{\omega }}$.

Conjecture 1

After injecting poisoning samples that satisfy $\exists y_p\ne y_r$ into a training dataset, the conditional probability of the sample $x_r$ is correctly labeled as $y_r$ will decrease, we have

$$\begin{aligned} P^{*}\left( y_{r} | x_{r}\right) -\mathrm {P}\left( y_{r} | x_{r}\right) >a \end{aligned}$$

(12)

where $1>a>0$, and is constant.

Conjecture 1 is true in most situations, e.g., if we randomly label the samples in a test dataset and add them into a training dataset, the probability of getting a wrong labeled sample will be higher. Constant a in 1 can be formulated as follow:

$$\begin{aligned} \mathrm {a}=\frac{\alpha \beta R}{p\left( x_{r}\right) +\alpha \beta R} \end{aligned}$$

(13)

where $p(x_r)$ is the probability of the sample $x_r$ appears in the training dataset, $\alpha $ is the coefficient which is determined by the global coherence across fake images $x_p$, $\beta $ is the coefficient which is determined by the poisoning matrix H and R is the percentage of poisoning samples in the training dataset. From the perspective of probability theory, the coefficients can be formulated as follow:

$$\begin{aligned}&\alpha =p\left( x_{r} | x_{p}\right) \end{aligned}$$

(14)

$$\begin{aligned}&\beta =1-\mathrm {p}\left( y_{r} | y_{p}\right) \end{aligned}$$

(15)

$$\begin{aligned}&\mathrm {R}=\mathrm {p}\left( x_{p}\right) \end{aligned}$$

(16)

The Learning Ability of the network should also be considered when proving the feasibility of poisonous label attacks. In the following Lemma 1, we formulate the learning error of the network $\varphi $.

Lemma 1

If the network $\varphi $ is a PAC-learning algorithm,the training dataset and the test dataset have the same probability distribution. After training T epoch, the learning error $K^*$ of network $\varphi $ on training dataset $D_{train}^*$ is

$$\begin{aligned} \lim _{T \rightarrow \infty } K^{*}=\left| P^{*}\left( y_{r} | x_{r}\right) -P_{\varphi }^{*}\left( y_{r} | x_{r}, \omega \right) \right| =0 \end{aligned}$$

(17)

the learning error K of network $\varphi $ on training dataset $D_{train}$ is

$$\begin{aligned} \lim _{T \rightarrow \infty } K=\left| \mathrm {P}\left( y_{r} | x_{r}\right) -P_{\varphi }\left( y_{r} | x_{r}, {\widehat{\omega }}\right) \right| =0 \end{aligned}$$

(18)

Proof

It is a corollary of Probably Approximately Correct framework [42]. $\square $

Claim 1 reveals that if a network is good enough, after running infinite time, it can learn the conditional probability distribution perfectly, these two assumptions approximately hold when the network is a deep neural network.

Proposition 2

If the network $\varphi $ is strongly learnable and the training dataset and the test dataset have the same probability distribution, after injecting poisoning samples that satisfy $\exists y_p\ne y_r $ into training dataset and training T epoch, we have

$$\begin{aligned} \lim _{T \rightarrow \infty } P_{\varphi }^*(y_r\vert x_r,\omega )-P_{\varphi }(y_r\vert x_r,{\hat{\omega }})>a \end{aligned}$$

(19)

Proof

Based on the definition of learning error $K^*$, K, we have

$$\begin{aligned} P_{\varphi }^{*}\left( y_{r} | x_{r}, \omega \right) -P_{\varphi }\left( y_{r} | x_{r}, {\widehat{\omega }}\right) =P^{*}\left( y_{r} | x_{r}\right) -\mathrm {P}\left( y_{r} | x_{r}\right) \pm K \pm K^{*} \end{aligned}$$

(20)

Hence, Inequation (1) follows from

$$\begin{aligned} P_{\varphi }^{*}\left( y_{r} | x_{r}, \omega \right) -P_{\varphi }\left( y_{r} | x_{r}, {\widehat{\omega }}\right) >a-K-K^{*} \end{aligned}$$

(21)

Based on Claim 1, when $T\rightarrow \infty $, Inequation (5) follows from

$$\begin{aligned} \lim _{T \rightarrow \infty } P_{\varphi }^{*}\left( y_{r} | x_{r}, \omega \right) -P_{\varphi }\left( y_{r} | x_{r}, {\widehat{\omega }}\right) >a \end{aligned}$$

(22)

$\square $

Proposition 1 indicate that the test accuracy will decrease a after inject poisoning samples into a dataset. However, our experiments show that Prop 1. rarely hold when using random noise to label the fake images that generated by conditional GAN (cGAN) as poisoning samples. We think two reasons may lead to this result: (1) When the fake images don’t exhibit global coherence with the images in a training dataset, injecting them into training dataset will change the probability distribution of training dataset which lead to the result that the assumption test dataset and training dataset have the same distribution doesn’t hold true. (2) The deep neural network is robust to the random label noise [11]. So, in Sects. 4.1 and 4.2, we introduce two key methods: (1) Enhanced Conditional DCGAN to synthesize fake images exhibiting global coherence. (2) The asymmetric poisoning vector generates poisonous labels, which is more harmful to deep neural networks than random label noise (symmetric poisoning vector).

Appendix B: Experiment Results About the Parameters of Poisonous Label Attack

1.1 Experiment Evaluation of Changing Optimization Goal

The results of the correlational analysis of optimization goals and test accuracy are shown in Fig. 7. As shown in Fig. 7a, no significant correlation was found between Loss function and test accuracy. The test accuracy is not likely to decrease when the Loss value increase. However, a positive correlation was found between test accuracy and probability $P_{\varphi } (y_r \vert x_r,{\hat{\omega }})$. Figure 7b reveals that there has been a marked increase for test accuracy when probability $P_{\varphi } (y_r \vert x_r,{\hat{\omega }})$ increase. Compare with two results. It can be seen that: compared with loss value $L(D_r,{\hat{\omega }})$, probability $P_{\varphi } (y_r \vert x_r,{\hat{\omega }})$ is more suitable for optimization goal.

1.2 Importance of Image Synthesis Model

To examined the impact of image synthesis models, we used cGAN(${\overline{\alpha }}=0.362$), cDCGAN(${\overline{\alpha }}=0.509$), EC-DCGAN(${\overline{\alpha }}=0.926$), ACGAN(${\overline{\alpha }}=0.923$) to generate fake images in poisonous label attack. The target class is 0. Figure 8a shows the test accuracy of the target class as the symmetric poisonous label attack goes on. The data in Fig. 8a shows that the test accuracy sharply decreases when using EC-DCGAN and ACGAN as a generator in the symmetric poisonous label attack. Figure 8b shows the test accuracy of the target class as asymmetric poisonous label attacks go on. What can be seen in this figure is the test accuracy declines faster when using EC-DCGAN as a generator in asymmetric poisonous label attack. Overall, these results indicate that the global coherence of fake images directly influences the effect of poisonous label attack, and EC-DCGAN is most suitable for poisonous label attack.

1.3 Comparison of Two Kinds of Poisoning Vector

To examined the impact of poisoning vectors, we used EC-DCGAN as a generator and used symmetric poisoning vector ($\beta =0.9,h=1$), asymmetric poisoning vector ($\beta =1,h=0.913$) to generate poisonous label. The target class is also 0. Figure 9 shows the test accuracy of the target class as the attack goes on. What stands out in the figure is the accuracy degradation of asymmetric poisoning vector (red line) is much faster than the symmetric poisoning vector (blue line). The result indicates that the asymmetric poisoning vector is better than the symmetric poisoning vector. Besides, this result also demonstrates that poisonous labels generated by asymmetric poisoning vector are more threatening for deep neural networks compared with random noise labels.

Appendix C: Architecture of EC-DCGAN and the L1 Loss of Training EC-DCGAN

See Table 6 and Fig. 10.

Table 6 EC-DCGAN ARCHITECTURE

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, H., Li, D. & Li, Y. Poisonous Label Attack: Black-Box Data Poisoning Attack with Enhanced Conditional DCGAN. Neural Process Lett 53, 4117–4142 (2021). https://doi.org/10.1007/s11063-021-10584-w

Download citation

Accepted: 06 July 2021
Published: 28 August 2021
Issue Date: December 2021
DOI: https://doi.org/10.1007/s11063-021-10584-w

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Poisonous Label Attack: Black-Box Data Poisoning Attack with Enhanced Conditional DCGAN

Abstract

Access this article

Similar content being viewed by others

Practical Poisoning Attacks on Neural Networks

DCAL: A New Method for Defending Against Adversarial Examples

Minority Reports Defense: Defending Against Adversarial Patches

Availability of data and material

References

Acknowledgements

Funding