Journals & Magazines >IEEE Transactions on Neural N... >Volume: 35 Issue: 12

Generating Semantic Adversarial Examples via Feature Manipulation in Latent Space

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The susceptibility of deep neural networks (DNNs) to adversarial intrusions, exemplified by adversarial examples, is well-documented. Conventional attacks implement unstr...Show More

Metadata

Abstract:

The susceptibility of deep neural networks (DNNs) to adversarial intrusions, exemplified by adversarial examples, is well-documented. Conventional attacks implement unstructured, pixel-wise perturbations to mislead classifiers, which often results in a noticeable departure from natural samples and lacks human-perceptible interpretability. In this work, we present an adversarial attack strategy that implements fine-granularity, semantic-meaning-oriented structural perturbations. Our proposed methodology manipulates the semantic attributes of images through the use of disentangled latent codes. We engineer adversarial perturbations by manipulating either a single latent code or a combination thereof. To this end, we propose two unsupervised semantic manipulation strategies: one based on vector-disentangled representation and the other on feature map-disentangled representation, taking into consideration the complexity of the latent codes and the smoothness of the reconstructed images. Our empirical evaluations, conducted extensively on real-world image data, showcase the potency of our attacks, particularly against black-box classifiers. Furthermore, we establish the existence of a universal semantic adversarial example that is agnostic to specific images.

Published in: IEEE Transactions on Neural Networks and Learning Systems ( Volume: 35, Issue: 12, December 2024)

Page(s): 17070 - 17084

Date of Publication: 10 August 2023

ISSN Information:

PubMed ID: 37561624

DOI: 10.1109/TNNLS.2023.3299408

Contents

References is not available for this document.

Generating Semantic Adversarial Examples via Feature Manipulation in Latent Space

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Generating Semantic Adversarial Examples via Feature Manipulation in Latent Space

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?