Skip to main content

Advertisement

Log in

SuperpixelGridMasks Data Augmentation: Application to Precision Health and Other Real-world Data

  • Research Article
  • Published:
Journal of Healthcare Informatics Research Aims and scope Submit manuscript

Abstract

A novel approach of data augmentation based on irregular superpixel decomposition is proposed. This approach called SuperpixelGridMasks permits to extend original image datasets that are required by training stages of machine learning-related analysis architectures towards increasing their performances. Three variants named SuperpixelGridCut, SuperpixelGridMean, and SuperpixelGridMix are presented. These grid-based methods produce a new style of image transformations using the dropping and fusing of information. Extensive experiments using various image classification models as well as precision health and surrounding real-world datasets show that baseline performances can be significantly outperformed using our methods. The comparative study also shows that our methods can overpass the performances of other data augmentations. SuperpixelGridCut, SuperpixelGridMean, and SuperpixelGridMix codes are publicly available at https://github.com/hammoudiproject/SuperpixelGridMasks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Algorithm 1
Algorithm 2
Algorithm 3
Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Code Availability

The source codes permitting to generate the presented SuperpixelGridMasks data augmentations will be publicly made available online at: https://github.com/hammoudiproject/SuperpixelGridMasks.

Notes

  1. Dataset Chest X-Ray Images: https://www.kaggle.com/datasets/paultimothymooney/chest-xray-pneumonia

  2. A PASCAL VOC dataset: http://host.robots.ox.ac.uk/pascal/VOC/databases.html#VOC2005_1

References

  1. Yang Z, Benhabiles H, Hammoudi K, Windal F, He R, Collard D (2021) A generalized deep learning-based framework for assistance to the human malaria diagnosis from microscopic images. Neural Comput. Appl.

  2. Hammoudi K, Benhabiles H, Melkemi M, Dornaika F, Arganda-carreras I, Collard D, Scherpereel A (2021) Deep learning on chest x-ray images to detect and evaluate pneumonia cases at the era of COVID-19. J Medical Syst 45(7):75

    Article  Google Scholar 

  3. Hammoudi K, Cabani A, Benhabiles H, Melkemi M (2020) Validating the correct wearing of protection mask by taking a selfie: design of a mobile application “checkyourmask” to limit the spread of covid-19. Comput Model Eng & Sci 124(3):1049–1059

    Google Scholar 

  4. Cabani A, Hammoudi K, Benhabiles H, Melkemi M (2020) Maskedface-net – a dataset of correctly/incorrectly masked face images in the context of covid-19. Smart Health

  5. Buslaev A, Iglovikov VI, Khvedchenya E, Parinov A, Druzhinin M, Kalinin AA (2020) Albumentations: fast and flexible image augmentations. Information 11:2

    Article  Google Scholar 

  6. Naveed H (2021) Survey: image mixing and deleting for data augmentation. CoRR, abs/2106.07085

  7. Devries T, Taylor GW (2017) Improved regularization of convolutional neural networks with cutout. CoRR, abs/1708.04552

  8. Yun S, Han D, Chun S, Oh S, Yoo Y, Choe J (2019) Cutmix: regularization strategy to train strong classifiers with localizable features. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), (Los Alamitos, CA, USA), pp. 6022–6031, IEEE Computer Society

  9. Huang S, Wang X, Tao D (2021) Snapmix: semantically proportional mixing for augmenting fine-grained data, in AAAI

  10. Zhao C, Lei Y (2021) Intra-class cutmix for unbalanced data augmentation. In: 2021 13th International Conference on Machine Learning and Computing, ICMLC 2021, (New York, NY, USA), p. 246–251 Association for Computing Machinery

  11. Bochkovskiy A, Wang C, Liao HM (2020) Yolov4: optimal speed and accuracy of object detection. CoRR, abs/2004.10934

  12. Chen P, Liu S, Zhao H, Jia J (2020) Gridmask data augmentation. CoRR, abs/2001.04086

  13. Feng S, Yang S, Niu Z, Xie J, Wei M, Li P (2021) Grid cut and mix: flexible and efficient data augmentation. In: Pan Z, Hei X (eds) Twelfth International Conference on Graphics and Image Processing (ICGIP 2020), vol 11720. International Society for Optics and Photonics, SPIE, pp 656–662

  14. Pereira MB, Santos JAD (2021) Chessmix: spatial context data augmentation for remote sensing semantic segmentation

  15. Kim J-H, Choo W, Song HO (2020) Puzzle mix: exploiting saliency and local statistics for optimal mixup. In: III HD, Singh A (eds) Proceedings of the 37th International Conference on Machine Learning. Proceedings of Machine Learning Research, PMLR 13–18, vol 119, pp 5275–5285

  16. Walawalkar D, Shen Z, Liu Z, Savvides M (2020) Attentive cutmix: an enhanced data augmentation approach for deep learning based image classification

  17. Uddin AFMS, Monira MS, Shin W, Chung T, Bae S-H (2021) Saliencymix: a saliency guided data augmentation strategy for better regularization. arXiv:2006.01791

  18. Yang L, Li X, Zhao B, Song R, Yang J (2022) Recursivemix: mixed learning with history

  19. Li C-L, Sohn K, Yoon J, Pfister T (2021) Cutpaste: self-supervised learning for anomaly detection and localization. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 9659–9669

  20. Hendrycks D, Mu N, Cubuk ED, Zoph B, Gilmer J, Lakshminarayanan B (2020) Augmix: a simple data processing method to improve robustness and uncertainty. In: Proceedings of the International Conference on Learning Representations (ICLR)

  21. Zhang Y, Yang L, Zheng H, Liang P, Mangold C, Loreto RG, Hughes DP, Chen DZ (2019) SPDA: Superpixel-based data augmentation for biomedical image segmentation. In: Cardoso MJ, Feragen A, Glocker B, Konukoglu E, Oguz I, Unal G, Vercauteren T (eds) Proceedings of the 2nd International Conference on Medical Imaging with Deep Learning, vol 102, pp 572–587

  22. Acción L, Argüello F, Heras DB (2020) Dual-window superpixel data augmentation for hyperspectral image classification. Appl Sci 10:24

    Article  Google Scholar 

  23. Franchi G, Belkhir N, Ha ML, hu y., Bursuc A, Blanz V, Yao A (Nov. 2021) Robust semantic segmentation with Superpixel-Mix. In: The British machine vision conference (BMVC), Online, United Kingdom

  24. Wang M, Liu X, Gao Y, Ma X, Soomro NQ (2017) Superpixel segmentation: a benchmark. Signal Process Image Commun 56:28–39

    Article  Google Scholar 

  25. Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L (2009) Imagenet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp 248–255

  26. Achanta R, Shaji A, Smith K, Lucchi A, Fua P, Süsstrunk S. (2012) Slic superpixels compared to state-of-the-art superpixel methods. IEEE Trans Pattern Anal Mach Intell 34(11):2274–2282

    Article  Google Scholar 

  27. van der Maaten L, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9:2579–2605 11

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Karim Hammoudi.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Karim Hammoudi and Adnane Cabani contributed equally to this work.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hammoudi, K., Cabani, A., Slika, B. et al. SuperpixelGridMasks Data Augmentation: Application to Precision Health and Other Real-world Data. J Healthc Inform Res 6, 442–460 (2022). https://doi.org/10.1007/s41666-022-00122-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s41666-022-00122-1

Keywords