skip to main content
10.1145/3613330.3613339acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicdltConference Proceedingsconference-collections
research-article

STFGSM: Intelligent Image Classification Model Based on Swin Transformer and Fast Gradient Sign Method

Authors Info & Claims
Published:28 September 2023Publication History

ABSTRACT

The convolutional neural network is relied upon by the mainstream image classification model to be achieved, but the convolutional neural network itself has defects such as easy loss of data. At the same time, deep learning models are vulnerable to adversarial perturbations, resulting in a decline in model performance. In order to effectively solve the above problems, this paper presents STFGSM, an intelligent image classification model based on Swin Transformer and fast gradient sign method. The attention mechanism is utilized by the Swin Transformer to extract picture features, with the traditional convolution operation being replaced. The field of the image is enhanced and information loss is avoided by this. Furthermore, the anti-interference capability of the model is strengthened through adversarial training that uses adversarial samples generated via the fast gradient sign method algorithm. The experimental results show that the classification performance of STFGSM outperformed other mainstream image classification models, whose speed is faster and adaptability to adversarial samples is stronger. In the future, more complex adversarial training strategies can be introduced on the basis of the model or the model can be extended to tasks in other fields such as target detection and image generation.

References

  1. Fu Su, Qin Lv, and Renze Luo. Review of Image Classification Based on Deep Learning. Telecommunications Science 35, 11 (November 2019), 58-74. https://doi.org/10.11959/j.issn.1000-0801.2019268.Google ScholarGoogle ScholarCross RefCross Ref
  2. Y. LeCun, B. Boser, J. S. Denker, D.Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel. Backpropagation Applied to Handwritten Zip Code Recognition. Neural Computation 1, 4 (December 1989), 541-551. https://doi.org/10.1162/neco.1989.1.4.541.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Krizhevsky Alex, Sutskever Ilya, and E. Hinton Geoffrey. ImageNet Classification with Deep Convolutional Neural Networks. Communications of the ACM 60, 6 (June 2017), 84-90. https://doi.org/10.1145/3065386.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Simonyan Karen, and Zisserman Andrew. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv preprint arXiv:1409.1556 (April 2015). https://arxiv.org/abs/1409.1556.Google ScholarGoogle Scholar
  5. Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2015. Going Deeper wth Convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE Press, Piscataway, New Jersey, 1-9. https://doi.org/10.1109/CVPR.2015.7298594.Google ScholarGoogle ScholarCross RefCross Ref
  6. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE Press, Piscataway, New Jersey, 770-778. https://doi.org/10.1109/CVPR.2016.90.Google ScholarGoogle ScholarCross RefCross Ref
  7. Wenping Ma, Qifan Yang, Yue Wu, Wei Zhao, and Xiangrong Zhang. Double-Branch Multi-Attention Mechanism Network for Hyperspectral Image Classification. Remote Sensing 11, 11 (June 2019), 1307. https://doi.org/10.3390/rs11111307.Google ScholarGoogle ScholarCross RefCross Ref
  8. Dosovitskiy Alexey, Beyer Lucas, Kolesnikov Alexander, and 2020. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. arXiv preprint arXiv:2010.11929 (October 2020). https://arxiv.org/abs/2010.11929.Google ScholarGoogle Scholar
  9. Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. 2021. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision. IEEE Press, Piscataway, New Jersey, 10012-10022. https://doi.org/10.1109/ICCV48922.2021.00986.Google ScholarGoogle ScholarCross RefCross Ref
  10. J. Goodfellow Ian, Shlens Jonathon, and Szegedy Christian. 2014. Explaining and Harnessing Adversarial Examples. arXiv preprint arXiv:1412.6572 (December 2014). https://arxiv.org/abs/1412.6572.Google ScholarGoogle Scholar

Index Terms

  1. STFGSM: Intelligent Image Classification Model Based on Swin Transformer and Fast Gradient Sign Method

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICDLT '23: Proceedings of the 2023 7th International Conference on Deep Learning Technologies
      July 2023
      115 pages
      ISBN:9798400707520
      DOI:10.1145/3613330

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 28 September 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)26
      • Downloads (Last 6 weeks)5

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format