SRK-Augment: A self-replacement and discriminative region keeping augmentation scheme for better classification

Zhao, Hao; Wang, Jikai; Chen, Zonghai; Lin, Shiqi; Bao, Peng

doi:10.1007/s11063-022-11022-1

SRK-Augment: A self-replacement and discriminative region keeping augmentation scheme for better classification

Published: 10 November 2022

Volume 55, pages 3533–3549, (2023)
Cite this article

Neural Processing Letters Aims and scope Submit manuscript

Hao Zhao^1,2,
Jikai Wang¹,
Zonghai Chen ORCID: orcid.org/0000-0001-9312-9089¹,
Shiqi Lin¹ &
…
Peng Bao¹

221 Accesses
Explore all metrics

Abstract

Data augmentation has been proved to be a critical and effective way to alleviate over-fitting of deep learning model. Region-level removal is one of state-of-the-art solutions, which can not only synthesize vicinity samples, but also improve generalization of model. However, region removing using random strategy tends to make the training samples suffer from excessive information loss and the introduction of negative noise. In this paper, we propose a novel data augmentation scheme called Self-Replacement-and-Keeping-Augment (SRK-Augment), which exploits self-deformation data as the replacement template and keeps discriminative parts guided by Class Activation Map (CAM) in input image. Concretely, we firstly exploit Grad-CAM++ algorithm to calculate the CAM mask of the input image, and design a patch-shuffling mechanism (PS-operator) to obtain the structural self-deformation template. Then, we utilize the self-deformation template to fill the information removal area, as well as we apply the binary CAM mask to recover the discriminative regions. Finally, these augmented data will be randomly used for model training. The proposed method is simple to implement and can be incorporated with existing augmentation strategies with low computational cost. Extensive experiments are conducted on the challenging datasets. With the help of the SRK-Augment strategy, the performances of DCNNs have achieved obvious improvements. On CIFAR-10 dataset, the Top-1 error rate is dropped by 2.07% at most; On CIFAR-100 dataset, the Top-1 error rate is decreased by up to 3.73%; On Mini-ImageNet dataset, the maximum decline of the Top-1 error rate is 3.38%; On Pascal VOC dataset, the mean Average Precision increases by a maximum of 1.38%. Experimental results manifest the effectiveness and generality of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

SAPDA: Significant Areas Preserved Data Augmentation

Article 20 May 2024

EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification

LMix: regularization strategy for convolutional neural networks

Article 21 August 2022

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

References

Han C (2021) Pathology-aware generative adversarial networks for medical image augmentation. arXiv preprint arXiv:2106.01915
Wang M, Deng W (2021) Deep face recognition: a survey. Neurocomputing 429:215–244
Article Google Scholar
Liu X, Wang L, Han X (2022) Transformer with peak suppression and knowledge guidance for fine-grained image recognition. Neurocomputing 492:137–149
Article Google Scholar
Song Y, Li Q, Feng D et al (2016) Texture image classification with discriminative neural networks. Comput Vis Media 2(4):367–377
Article Google Scholar
Meng L, Xu J, Tan X et al (2021) MixSpeech: data augmentation for low-resource automatic speech recognition. In: ICASSP 2021-2021 IEEE International conference on acoustics, speech and signal processing (ICASSP). IEEE, 7008–7012
Park DS, Chan W, Zhang Y et al (2019) Specaugment: a simple data augmentation method for automatic speech recognition. arXiv preprint arXiv:1904.08779
Shorten C, Khoshgoftaar TM (2019) A survey on image data augmentation for deep learning. J Big Data 6(1):1–48
Article Google Scholar
Ghiasi G, Cui Y, Srinivas A et al (2021) Simple copy-paste is a strong data augmentation method for instance segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 2918–2928
Zhang S, Huang J, Metaxas D et al (2010) Discriminative sparse representations for cervigram image segmentation[C]//2010 IEEE International Symposium on Biomedical Imaging: From Nano to Macro. IEEE, pp 133–136
Srivastava N, Hinton G, Krizhevsky A et al (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929–1958
MathSciNet MATH Google Scholar
Naveed H (2021) Survey: image mixing and deleting for data augmentation. arXiv preprint arXiv:2106.07085
Wu Z, Meng M, Wu J (2020) Visual sentiment prediction with attribute augmentation and multi-attention mechanism. Neural Process Lett 51(3):2403–2416
Article Google Scholar
Dai D, Wang J, Chen Z et al (2021) Image guidance based 3D vehicle detection in traffic scene. Neurocomputing 428:1–11
Article Google Scholar
Fabian Z, Heckel R, Soltanolkotabi M (2021) Data augmentation for deep learning based accelerated MRI reconstruction with limited data. In: International conference on machine learning. PMLR pp 3057-3067
Li R, Bastiani M, Auer D et al (2021) Image augmentation using a task guided generative adversarial network for age estimation on brain MRI. In: Annual conference on medical image understanding and analysis. Springer, Cham, pp 350–360
Thurnhofer-Hemsi K, Dominguez E (2021) A convolutional neural network framework for accurate skin cancer detection. Neural Process Lett 53(5):3073–3093
Article Google Scholar
Shankar VG, Sisodia DS, Chandrakar P (2021) A novel discriminant feature selection based mutual information extraction from MR brain images for Alzheimer’s stages detection and prediction. Int J Imaging Syst Tech
Nirthika R, Manivannan S, Ramanan A et al (2022) Pooling in convolutional neural networks for medical image analysis: a survey and an empirical study. Neural Comput Appl, 1–27
Zhong Z, Zheng L, Kang G et al (2020) Random erasing data augmentation. In: Proceedings of the AAAI conference on artificial intelligence 34(07):13001–13008
DeVries T, Taylor GW (2017) Improved regularization of convolutional neural networks with cutout. arXiv preprint arXiv:1708.04552
Chen P, Liu S, Zhao H et al (2020) Gridmask data augmentation. arXiv preprint arXiv:2001.04086
Singh KK, Lee YJ (2017) Hide-and-seek: forcing a network to be meticulous for weakly-supervised object and action localization. In: 2017 IEEE international conference on computer vision (ICCV). IEEE, pp 3544–3553
Gong C, Wang D, Li M et al (2021) KeepAugment: a simple information-preserving data augmentation approach. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1055–1064
Wei L, Xiao A, Xie L et al (2020) Circumventing outliers of autoaugment with knowledge distillation. In: Computer vision-ECCV, 16th european conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part III 16. Springer International Publishing 2020:608–625
Gontijo-Lopes R, Smullin SJ, Cubuk ED et al (2020) Affinity and diversity: quantifying mechanisms of data augmentation. arXiv preprint arXiv:2002.08973
Uddin AFM, Monira M, Shin W et al (2020) Saliencymix: a saliency guided data augmentation strategy for better regularization. arXiv preprint arXiv:2006.01791
Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International conference on machine learning. PMLR, pp 448–456
Salimans T, Kingma DP (2016) Weight normalization: a simple reparameterization to accelerate training of deep neural networks. Adv Neural Inf Process Syst 29:901–909
Google Scholar
Wu Y, He K (2018) Group normalization. In: Proceedings of the European conference on computer vision (ECCV). pp 3-19
Kim B, Ko Y, Seo J (2022) Novel regularization method for the class imbalance problem. Expert Syst Appl 188:115974
Article Google Scholar
Lin D, Li Y, Prasad S et al (2021) CAM-guided multi-path decoding U-net with triplet feature regularization for defect detection and segmentation. Knowl-Based Syst 228:107272
Article Google Scholar
Hinton GE, Srivastava N, Krizhevsky A et al (2012) Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580
Wan L, Zeiler M, Zhang S et al (2013) Regularization of neural networks using dropconnect. In: International conference on machine learning. PMLR, pp 1058–1066
Liang S, Khoo Y, Yang H (2021) Drop-activation: implicit parameter reduction and harmonious regularization. Comm App Math Comp Sci 3(2):293–311
Article MathSciNet MATH Google Scholar
Wu L, Li J, Wang Y et al (2021) R-drop: regularized dropout for neural networks. Adv Neural Inf Process Syst 34:10890–10905
Google Scholar
Cubuk ED, Zoph B, Mane D et al (2019) Autoaugment: learning augmentation strategies from data. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 113–123
Lemley J, Bazrafkan S, Corcoran P (2017) Smart augmentation learning an optimal data augmentation strategy. IEEE Access 5:5858–5869
Article Google Scholar
Lim S, Kim I, Kim T et al (2019) Fast autoaugment. Adv Neural Inf Process Syst 32:6665–6675
Google Scholar
Cubuk ED, Zoph B, Shlens J et al (2020) Randaugment: Practical automated data augmentation with a reduced search space. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 702–703
May P (2019) Improved image augmentation for convolutional neural networks by copyout and copypairing. arXiv preprint arXiv:1909.00390
Summers C, Dinneen MJ (2019) Improved mixed-example data augmentation. In: 2019 IEEE winter conference on applications of computer vision (WACV). IEEE, pp 1262–1270
Zhang H, Cisse M, Dauphin YN et al (2017) mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412
Guo H, Mao Y, Zhang R (2019) Mixup as locally linear out-of-manifold regularization. In: Proceedings of the AAAI conference on artificial intelligence. 33(01):3714–3722
Takahashi R, Matsubara T, Uehara K (2019) Data augmentation using random image cropping and patching for deep CNNs. IEEE Trans Circuits Syst Video Technol 30(9):2917–2931
Article Google Scholar
Chattopadhay A, Sarkar A, Howlader P et al (2018) Grad-cam++: generalized gradient-based visual explanations for deep convolutional networks. In: 2018 IEEE winter conference on applications of computer vision (WACV). IEEE, pp 839–847
Krizhevsky A, Hinton G (2009) Learning multiple layers of features from tiny images
Peng Z, Li Z, Zhang J et al (2019) Few-shot image recognition with knowledge transfer. In: Proceedings of the IEEE/CVF international conference on computer vision. pp 441–449
Everingham M, Van Gool L, Williams CKI et al (2010) The pascal visual object classes (VOC) challenge. Int J Comput Vis 88(2):303–338
Article Google Scholar
Lin TY, Goyal P, Girshick R et al (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980–2988
Ma N, Zhang X, Zheng HT et al (2018) Shufflenet v2: Practical guide-lines for efficient cnn architecture design. In: Proceedings of the European conference on computer vision (ECCV), pp 116–131
Howard A, Sandler M, Chu G et al (2019) Searching for mobilenetv3. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 1314–1324

Download references

Acknowledgements

This work was supported by the National Natural Science Found of China (Grant No. 62103393).

Author information

Authors and Affiliations

Department of Automation, University of Science and Technology of China, Huangshan Road, Hefei, 230026, Anhui, China
Hao Zhao, Jikai Wang, Zonghai Chen, Shiqi Lin & Peng Bao
School of Information Engineering, Southwest University of Science and Technology, Qinglong Avenue, Mianyang, 610000, Sichuan, China
Hao Zhao

Authors

Hao Zhao
View author publications
You can also search for this author inPubMed Google Scholar
Jikai Wang
View author publications
You can also search for this author inPubMed Google Scholar
Zonghai Chen
View author publications
You can also search for this author inPubMed Google Scholar
Shiqi Lin
View author publications
You can also search for this author inPubMed Google Scholar
Peng Bao
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Zonghai Chen.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Zhao, H., Wang, J., Chen, Z. et al. SRK-Augment: A self-replacement and discriminative region keeping augmentation scheme for better classification. Neural Process Lett 55, 3533–3549 (2023). https://doi.org/10.1007/s11063-022-11022-1

Download citation

Accepted: 26 August 2022
Published: 10 November 2022
Issue Date: June 2023
DOI: https://doi.org/10.1007/s11063-022-11022-1

Keywords

Profiles

Zonghai Chen View author profile

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

SRK-Augment: A self-replacement and discriminative region keeping augmentation scheme for better classification

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

SAPDA: Significant Areas Preserved Data Augmentation

EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification

LMix: regularization strategy for convolutional neural networks

Explore related subjects

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Profiles

Subscribe and save

Buy Now