Smart GAN: a smart generative adversarial network for limited imbalanced dataset

Kumari, Deepa; Vyshnavi, S. K.; Dhar, Rupsa; Rajita, B. S. A. S.; Panda, Subhrakanta; Christopher, Jabez

doi:10.1007/s11227-024-06198-3

Smart GAN: a smart generative adversarial network for limited imbalanced dataset

Published: 03 June 2024

Volume 80, pages 20640–20681, (2024)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Deepa Kumari¹,
S. K. Vyshnavi¹^na1,
Rupsa Dhar¹^na1,
B. S. A. S. Rajita¹^na1,
Subhrakanta Panda¹^na1 &
…
Jabez Christopher¹^na1

368 Accesses
Explore all metrics

Abstract

Advancements in Machine Learning (ML) and Computer Vision have led to notable improvements in the detection of breast cancer. However, the accuracy of the classifier is limited due to imbalanced datasets that cause overfitting. Thus, additional images are needed to improve the classifier’s performance. Generative Adversarial Networks (GAN) are used for image augmentation. Still, its limitations, such as the random use of different types of GAN, can lead to a too-good discriminator, causing generator training to fail due to vanishing gradients. Therefore, selecting the appropriate GAN model for the given scenario is crucial for optimal performance. This paper proposes a novel Smart Generative Adversarial Network (Smart GAN) architecture to develop an efficient and computational classification model for a limited imbalanced dataset. Smart GAN uses a three-fold approach. Different types of GAN augment the dataset in the first phase of experimental work, and their evaluation metrics are calculated. In the second phase, the metric scores are used as rewards for the Reinforcement learning model (Q-learning approach). The best augmentation is chosen based on the best Q-values for each metric score. It compares three different Convolutional Neural Networks (CNN) and selects the best-suited network to classify the augmented datasets. The proposed Smart GAN architecture outperforms other existing approaches by giving a better accuracy of 89.62% and 89.91% on Mammographic Image Analysis Society (MIAS) and Digital Database for Screening Mammography (DDSM) augmented datasets, respectively, representing an approximately 10% increment in detection rate compared to non-augmented datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Conditional Generative Adversarial Networks for Data Augmentation in Breast Cancer Classification

Enhancing Breast Cancer Detection Systems: Augmenting Mammogram Images Using Generative Adversarial Networks

Synergistic transfer learning and adversarial networks for breast cancer diagnosis: benign vs. invasive classification

Article Open access 03 March 2025

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Data availibility

The datasets generated and/or analyzed during the current study are available from the corresponding author on reasonable request.

Code availability

The custom code developed for this study is available from the corresponding author on reasonable request.

References

Elbatel M (2022) Mammograms classification: a review. arXiv preprint arXiv:2203.03618
Shah SM, Khan RA, Arif S, Sajid U (2021) Artificial intelligence for breast cancer detection: trends & directions. arXiv preprint arXiv:2110.00942
Kumari D, Yannam PKR, Gohel IN, Naidu MVSS, Arora Y, Rajita BSAS, Panda S, Christopher J (2023) Computational model for breast cancer diagnosis using hfse framework. Biomed Signal Process Control 86:105121
Article Google Scholar
Desai SD, Giraddi S, Verma N, Gupta P, Ramya S (2020) Breast cancer detection using GAN for limited labeled dataset. In: 2020 12th International Conference on Computational Intelligence and Communication Networks (CICN), pp 34–39. IEEE
Swiderski B, Gielata L, Olszewski P, Osowski S, Kołodziej M (2021) Deep neural system for supporting tumor recognition of mammograms using modified GAN. Expert Syst Appl 164:113968
Article Google Scholar
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778
Liu R, Dai W, Wu T, Wang M, Wan S, Liu J (2022) Aimic: deep learning for microscopic image classification. Comput Methods Programs Biomed 226:107162
Article Google Scholar
Wu E, Wu K, Cox D, Lotter W (2018) Conditional infilling GANs for data augmentation in mammogram classification. In: Image Analysis for Moving Organ, Breast, and Thoracic Images, vol 4, issue 8, pp 98–106
Guan S, Loew M (2019) Breast cancer detection using synthetic mammograms from generative adversarial networks in convolutional neural networks. J Med Imaging 6(3):031411
Article Google Scholar
Jendele L, Skopek O, Becker AS, Konukoglu E (2019) Adversarial augmentation for enhancing classification of mammography images. arXiv preprint arXiv:1902.07762
Lee J, Nishikawa RM (2020) Simulating breast mammogram using conditional generative adversarial network: application towards finding mammographically-occult cancer. In: Medical Imaging 2020: Computer-Aided Diagnosis, vol 11314, p 1131418. International Society for Optics and Photonics
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 1–9
Saxena D, Cao J (2021) Generative adversarial networks (GANs) challenges, solutions, and future directions. ACM Comput Surv (CSUR) 54(3):1–42
Article Google Scholar
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. Adv Neural Inf Process Syst 27
Borji A (2022) Pros and cons of GAN evaluation measures: new developments. Comput Vis Image Understand 215:103329
Article Google Scholar
Borji A (2021) Pros and cons of GAN evaluation measures: new developments. arXiv preprint arXiv:2103.09396
Jang B, Kim M, Harerimana G, Kim JW (2019) Q-learning algorithms: a comprehensive classification and applications. IEEE Access 7:133653–133667
Article Google Scholar
Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 4700–4708
Radford A, Metz L, Chintala S (2015) Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434
Mirza M, Osindero S (2014) Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784
Zhu J-Y, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp 2223–2232
Jabbar A, Li X, Omar B (2021) A survey on generative adversarial networks: variants, applications, and training. ACM Comput Surv (CSUR) 54(8):1–49
Article Google Scholar
Doan T, Mazoure B, Lyle C (2018) Gan q-learning. arXiv preprint arXiv:1805.04874
Heusel M, Ramsauer H, Unterthiner T, Nessler B, Hochreiter S (2017) Gans trained by a two time-scale update rule converge to a local nash equilibrium. Adv Neural Inf Process Syst 30
Shorten C, Khoshgoftaar TM (2019) A survey on image data augmentation for deep learning. J Big Data 6(1):1–48
Article Google Scholar
Lucic M, Kurach K, Michalski M, Gelly S, Bousquet O (2018) Are gans created equal? A large-scale study. Adv Neural Inf Process Syst 31
Zeng Y, Lu H, Borji A (2017) Statistics of deep generated images. arXiv preprint arXiv:1708.02688
Snell J, Ridgeway K, Liao R, Roads BD, Mozer MC, Zemel RS (2017) Learning to generate images with perceptual similarity metrics. In: 2017 IEEE International Conference on Image Processing (ICIP), pp 4277–4281. IEEE
Weng L (2019) From Gan to Wgan. arXiv preprint arXiv:1904.08994
Im DJ, Kim CD, Jiang H, Memisevic R (2016) Generating images with recurrent adversarial networks. arXiv preprint arXiv:1602.05110
Salimans T, Goodfellow I, Zaremba W, Cheung V, Radford A, Chen X (2016) Improved techniques for training Gans. Adv Neural Inf Process Syst 29
Che T, Li Y, Jacob AP, Bengio Y, Li W (2016) Mode regularized generative adversarial networks. arXiv preprint arXiv:1612.02136
Xu Q, Huang G, Yuan Y, Guo C, Sun Y, Wu F, Weinberger K (2018) An empirical study on evaluation metrics of generative adversarial networks. arXiv preprint arXiv:1806.07755
Kaelbling LP, Littman ML, Moore AW (1996) Reinforcement learning: a survey. J Artif Intell Res 4:237–285
Article Google Scholar
Dayan P, Niv Y (2008) Reinforcement learning: the good, the bad and the ugly. Curr Opin Neurobiol 18(2):185–196
Article Google Scholar
O’Doherty JP, Dayan P, Friston K, Critchley H, Dolan RJ (2003) Temporal difference models and reward-related learning in the human brain. Neuron 38(2):329–337
Article Google Scholar
Sewak M (2019) Temporal difference learning, SARSA, and Q-learning. Deep Reinforcement Learning. Springer, Berlin, pp 51–63
Chapter Google Scholar
Watkins CJ, Dayan P (1992) Q-learning. Mach Learn 8(3):279–292
Article Google Scholar
Yu C, Liu J, Nemati S, Yin G (2021) Reinforcement learning in healthcare: a survey. ACM Comput Surv (CSUR) 55(1):1–36
Article Google Scholar
Lo S-CB, Chan H-P, Lin J-S, Li H, Freedman MT, Mun SK (1995) Artificial convolution neural network for medical image pattern recognition. Neural Netw 8(7–8):1201–1214
Article Google Scholar
Nazir S, Dickson DM, Akram MU (2023) Survey of explainable artificial intelligence techniques for biomedical imaging with deep neural networks. Comput Biol Med 106668
Huang Z, Zhu X, Ding M, Zhang X (2020) Medical image classification using a light-weighted hybrid neural network based on pcanet and densenet. IEEE Access 8:24697–24712
Article Google Scholar
Bougourzi F, Dornaika F, Nakib A, Taleb-Ahmed A (2024) Emb-trattunet: a novel edge loss function and transformer-CNN architecture for multi-classes pneumonia infection segmentation in low annotation regimes. Artif Intell Rev 57(4):1–35
Article Google Scholar
Sunnetci KM, Kaba E, Celiker FB, Alkan A (2024) Deep network-based comprehensive parotid gland tumor detection. Acad Radiol 31(1):157–167
Article Google Scholar
Loizidou K, Elia R, Pitris C (2023) Computer-aided breast cancer detection and classification in mammography: a comprehensive review. Comput Biol Med 106554
Rahmaniar W, Hernawan A (2021) Real-time human detection using deep learning on embedded platforms: a review. J Robot Control (JRC) 2(6):462–468
Google Scholar
Dong X, Luo T, Fan R, Zhuge W, Hou C (2023) Active label distribution learning via kernel maximum mean discrepancy. Front Comput Sci 17(4):174327
Article Google Scholar
Jiang H, Diao Z, Shi T, Zhou Y, Wang F, Hu W, Zhu X, Luo S, Tong G, Yao Y-D (2023) A review of deep learning-based multiple-lesion recognition from medical images: classification, detection and segmentation. Comput Biol Med 106726
Chong MJ, Forsyth D (2020) Effectively unbiased FID and inception score and where to find them. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 6070–6079
Balakrishnan V, Shi Z, Law CL, Lim R, Teh LL, Fan Y, Periasamy J (2022) A comprehensive analysis of transformer-deep neural network models in twitter disaster detection. Mathematics 10(24):4664
Article Google Scholar
Rajita B, Halani V, Shah D, Panda S (2022) Gan-c: a generative adversarial network with a classifier for effective event prediction. Comput Intell 38(6):1922–1955
Article Google Scholar
Kumari D, Parmar AS, Goyal HS, Mishra K, Panda S (2024) Healthrec-chain: patient-centric blockchain enabled ipfs for privacy preserving scalable health data. Comput Netw 110223
Wu E, Wu K, Lotter W (2020) Synthesizing Lesions using contextual GANs improves breast cancer classification on mammograms. arXiv preprint arXiv:2006.00086
Alyafi B, Diaz O, Marti R (2020) DCGANs for realistic breast mass augmentation in X-ray mammography. In: Medical Imaging 2020: Computer-Aided Diagnosis, vol 11314, p 1131420. International Society for Optics and Photonics
Muramatsu C, Nishio M, Goto T, Oiwa M, Morita T, Yakami M, Kubo T, Togashi K, Fujita H (2020) Improving breast mass classification by shared data with domain transformation using a generative adversarial network. Comput Biol Med 119:103698
Article Google Scholar
Kansal S, Goel S, Bhattacharya J, Srivastava V (2020) Generative adversarial network-convolution neural network based breast cancer classification using optical coherence tomographic images. Laser Phys 30(11):115601
Article Google Scholar
Shen T, Hao K, Gou C, Wang F-Y (2021) Mass image synthesis in mammogram with contextual information based on GANs. Comput Methods Programs Biomed 202:106019
Article Google Scholar
Pang T, Wong JHD, Ng WL, Chan CS (2021) Semi-supervised GAN-based radiomics model for data augmentation in breast ultrasound mass classification. Comput Methods Programs Biomed 203:106018
Article Google Scholar
Souza R, Lucena O, Garrafa J, Gobbi D, Saluzzi M, Appenzeller S, Rittner L, Frayne R, Lotufo R (2018) An open, multi-vendor, multi-field-strength brain mr dataset and analysis of publicly available skull stripping methods agreement. NeuroImage 170:482–494
Article Google Scholar
Bien N, Rajpurkar P, Ball RL, Irvin J, Park A, Jones E, Bereket M, Patel BN, Yeom KW, Shpanskaya K et al (2018) Deep-learning-assisted diagnosis for knee magnetic resonance imaging: development and retrospective validation of mrnet. PLoS Med 15(11):1002699
Article Google Scholar
Antonelli M, Reinke A, Bakas S, Farahani K, Kopp-Schneider A, Landman BA, Litjens G, Menze B, Ronneberger O, Summers RM et al (2022) The medical segmentation decathlon. Nat Commun 13(1):4128
Article Google Scholar

Download references

Funding

No external funding is received for this research.

Author information

S. K. Vyshnavi, Rupsa Dhar, B. S. A. S. Rajita, Subhrakanta Panda and Jabez Christopher have contributed equally to this work.

Authors and Affiliations

CSIS department, BITS Pilani, Hyderabad Campus, Shameerpet, Hyderabad, Telangana, 500078, India
Deepa Kumari, S. K. Vyshnavi, Rupsa Dhar, B. S. A. S. Rajita, Subhrakanta Panda & Jabez Christopher

Authors

Deepa Kumari
View author publications
You can also search for this author inPubMed Google Scholar
S. K. Vyshnavi
View author publications
You can also search for this author inPubMed Google Scholar
Rupsa Dhar
View author publications
You can also search for this author inPubMed Google Scholar
B. S. A. S. Rajita
View author publications
You can also search for this author inPubMed Google Scholar
Subhrakanta Panda
View author publications
You can also search for this author inPubMed Google Scholar
Jabez Christopher
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

Deepa Kumari involved in the study design, data collection, and manuscript writing; Vyshnavi SK and Rupsa Dhar conducted data analysis and interpretation; BSAS Rajita compared the proposed work with existing methods; Subhrakanta Panda and Jabez Christopher supervised, reviewed and edited the manuscript.

Corresponding author

Correspondence to Deepa Kumari.

Ethics declarations

Conflict of interest

The authors declare that there are no Conflict of interest or Conflict of interest related to this work.

Ethical approval

Not Applicable.

Consent to participate

Informed consent was obtained from all participants (or their legal guardians) included in the study.

Consent for publication

Consent for the publication of potentially identifying information and images was obtained from all participants (or their legal guardians) included in the study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Kumari, D., Vyshnavi, S.K., Dhar, R. et al. Smart GAN: a smart generative adversarial network for limited imbalanced dataset. J Supercomput 80, 20640–20681 (2024). https://doi.org/10.1007/s11227-024-06198-3

Download citation

Accepted: 08 May 2024
Published: 03 June 2024
Issue Date: September 2024
DOI: https://doi.org/10.1007/s11227-024-06198-3

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Smart GAN: a smart generative adversarial network for limited imbalanced dataset

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Conditional Generative Adversarial Networks for Data Augmentation in Breast Cancer Classification

Enhancing Breast Cancer Detection Systems: Augmenting Mammogram Images Using Generative Adversarial Networks

Synergistic transfer learning and adversarial networks for breast cancer diagnosis: benign vs. invasive classification

Explore related subjects

Data availibility

Code availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Consent to participate

Consent for publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now