Abstract
Advancements in Machine Learning (ML) and Computer Vision have led to notable improvements in the detection of breast cancer. However, the accuracy of the classifier is limited due to imbalanced datasets that cause overfitting. Thus, additional images are needed to improve the classifier’s performance. Generative Adversarial Networks (GAN) are used for image augmentation. Still, its limitations, such as the random use of different types of GAN, can lead to a too-good discriminator, causing generator training to fail due to vanishing gradients. Therefore, selecting the appropriate GAN model for the given scenario is crucial for optimal performance. This paper proposes a novel Smart Generative Adversarial Network (Smart GAN) architecture to develop an efficient and computational classification model for a limited imbalanced dataset. Smart GAN uses a three-fold approach. Different types of GAN augment the dataset in the first phase of experimental work, and their evaluation metrics are calculated. In the second phase, the metric scores are used as rewards for the Reinforcement learning model (Q-learning approach). The best augmentation is chosen based on the best Q-values for each metric score. It compares three different Convolutional Neural Networks (CNN) and selects the best-suited network to classify the augmented datasets. The proposed Smart GAN architecture outperforms other existing approaches by giving a better accuracy of 89.62% and 89.91% on Mammographic Image Analysis Society (MIAS) and Digital Database for Screening Mammography (DDSM) augmented datasets, respectively, representing an approximately 10% increment in detection rate compared to non-augmented datasets.















Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availibility
The datasets generated and/or analyzed during the current study are available from the corresponding author on reasonable request.
Code availability
The custom code developed for this study is available from the corresponding author on reasonable request.
References
Elbatel M (2022) Mammograms classification: a review. arXiv preprint arXiv:2203.03618
Shah SM, Khan RA, Arif S, Sajid U (2021) Artificial intelligence for breast cancer detection: trends & directions. arXiv preprint arXiv:2110.00942
Kumari D, Yannam PKR, Gohel IN, Naidu MVSS, Arora Y, Rajita BSAS, Panda S, Christopher J (2023) Computational model for breast cancer diagnosis using hfse framework. Biomed Signal Process Control 86:105121
Desai SD, Giraddi S, Verma N, Gupta P, Ramya S (2020) Breast cancer detection using GAN for limited labeled dataset. In: 2020 12th International Conference on Computational Intelligence and Communication Networks (CICN), pp 34–39. IEEE
Swiderski B, Gielata L, Olszewski P, Osowski S, Kołodziej M (2021) Deep neural system for supporting tumor recognition of mammograms using modified GAN. Expert Syst Appl 164:113968
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778
Liu R, Dai W, Wu T, Wang M, Wan S, Liu J (2022) Aimic: deep learning for microscopic image classification. Comput Methods Programs Biomed 226:107162
Wu E, Wu K, Cox D, Lotter W (2018) Conditional infilling GANs for data augmentation in mammogram classification. In: Image Analysis for Moving Organ, Breast, and Thoracic Images, vol 4, issue 8, pp 98–106
Guan S, Loew M (2019) Breast cancer detection using synthetic mammograms from generative adversarial networks in convolutional neural networks. J Med Imaging 6(3):031411
Jendele L, Skopek O, Becker AS, Konukoglu E (2019) Adversarial augmentation for enhancing classification of mammography images. arXiv preprint arXiv:1902.07762
Lee J, Nishikawa RM (2020) Simulating breast mammogram using conditional generative adversarial network: application towards finding mammographically-occult cancer. In: Medical Imaging 2020: Computer-Aided Diagnosis, vol 11314, p 1131418. International Society for Optics and Photonics
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 1–9
Saxena D, Cao J (2021) Generative adversarial networks (GANs) challenges, solutions, and future directions. ACM Comput Surv (CSUR) 54(3):1–42
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. Adv Neural Inf Process Syst 27
Borji A (2022) Pros and cons of GAN evaluation measures: new developments. Comput Vis Image Understand 215:103329
Borji A (2021) Pros and cons of GAN evaluation measures: new developments. arXiv preprint arXiv:2103.09396
Jang B, Kim M, Harerimana G, Kim JW (2019) Q-learning algorithms: a comprehensive classification and applications. IEEE Access 7:133653–133667
Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 4700–4708
Radford A, Metz L, Chintala S (2015) Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434
Mirza M, Osindero S (2014) Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784
Zhu J-Y, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp 2223–2232
Jabbar A, Li X, Omar B (2021) A survey on generative adversarial networks: variants, applications, and training. ACM Comput Surv (CSUR) 54(8):1–49
Doan T, Mazoure B, Lyle C (2018) Gan q-learning. arXiv preprint arXiv:1805.04874
Heusel M, Ramsauer H, Unterthiner T, Nessler B, Hochreiter S (2017) Gans trained by a two time-scale update rule converge to a local nash equilibrium. Adv Neural Inf Process Syst 30
Shorten C, Khoshgoftaar TM (2019) A survey on image data augmentation for deep learning. J Big Data 6(1):1–48
Lucic M, Kurach K, Michalski M, Gelly S, Bousquet O (2018) Are gans created equal? A large-scale study. Adv Neural Inf Process Syst 31
Zeng Y, Lu H, Borji A (2017) Statistics of deep generated images. arXiv preprint arXiv:1708.02688
Snell J, Ridgeway K, Liao R, Roads BD, Mozer MC, Zemel RS (2017) Learning to generate images with perceptual similarity metrics. In: 2017 IEEE International Conference on Image Processing (ICIP), pp 4277–4281. IEEE
Weng L (2019) From Gan to Wgan. arXiv preprint arXiv:1904.08994
Im DJ, Kim CD, Jiang H, Memisevic R (2016) Generating images with recurrent adversarial networks. arXiv preprint arXiv:1602.05110
Salimans T, Goodfellow I, Zaremba W, Cheung V, Radford A, Chen X (2016) Improved techniques for training Gans. Adv Neural Inf Process Syst 29
Che T, Li Y, Jacob AP, Bengio Y, Li W (2016) Mode regularized generative adversarial networks. arXiv preprint arXiv:1612.02136
Xu Q, Huang G, Yuan Y, Guo C, Sun Y, Wu F, Weinberger K (2018) An empirical study on evaluation metrics of generative adversarial networks. arXiv preprint arXiv:1806.07755
Kaelbling LP, Littman ML, Moore AW (1996) Reinforcement learning: a survey. J Artif Intell Res 4:237–285
Dayan P, Niv Y (2008) Reinforcement learning: the good, the bad and the ugly. Curr Opin Neurobiol 18(2):185–196
O’Doherty JP, Dayan P, Friston K, Critchley H, Dolan RJ (2003) Temporal difference models and reward-related learning in the human brain. Neuron 38(2):329–337
Sewak M (2019) Temporal difference learning, SARSA, and Q-learning. Deep Reinforcement Learning. Springer, Berlin, pp 51–63
Watkins CJ, Dayan P (1992) Q-learning. Mach Learn 8(3):279–292
Yu C, Liu J, Nemati S, Yin G (2021) Reinforcement learning in healthcare: a survey. ACM Comput Surv (CSUR) 55(1):1–36
Lo S-CB, Chan H-P, Lin J-S, Li H, Freedman MT, Mun SK (1995) Artificial convolution neural network for medical image pattern recognition. Neural Netw 8(7–8):1201–1214
Nazir S, Dickson DM, Akram MU (2023) Survey of explainable artificial intelligence techniques for biomedical imaging with deep neural networks. Comput Biol Med 106668
Huang Z, Zhu X, Ding M, Zhang X (2020) Medical image classification using a light-weighted hybrid neural network based on pcanet and densenet. IEEE Access 8:24697–24712
Bougourzi F, Dornaika F, Nakib A, Taleb-Ahmed A (2024) Emb-trattunet: a novel edge loss function and transformer-CNN architecture for multi-classes pneumonia infection segmentation in low annotation regimes. Artif Intell Rev 57(4):1–35
Sunnetci KM, Kaba E, Celiker FB, Alkan A (2024) Deep network-based comprehensive parotid gland tumor detection. Acad Radiol 31(1):157–167
Loizidou K, Elia R, Pitris C (2023) Computer-aided breast cancer detection and classification in mammography: a comprehensive review. Comput Biol Med 106554
Rahmaniar W, Hernawan A (2021) Real-time human detection using deep learning on embedded platforms: a review. J Robot Control (JRC) 2(6):462–468
Dong X, Luo T, Fan R, Zhuge W, Hou C (2023) Active label distribution learning via kernel maximum mean discrepancy. Front Comput Sci 17(4):174327
Jiang H, Diao Z, Shi T, Zhou Y, Wang F, Hu W, Zhu X, Luo S, Tong G, Yao Y-D (2023) A review of deep learning-based multiple-lesion recognition from medical images: classification, detection and segmentation. Comput Biol Med 106726
Chong MJ, Forsyth D (2020) Effectively unbiased FID and inception score and where to find them. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 6070–6079
Balakrishnan V, Shi Z, Law CL, Lim R, Teh LL, Fan Y, Periasamy J (2022) A comprehensive analysis of transformer-deep neural network models in twitter disaster detection. Mathematics 10(24):4664
Rajita B, Halani V, Shah D, Panda S (2022) Gan-c: a generative adversarial network with a classifier for effective event prediction. Comput Intell 38(6):1922–1955
Kumari D, Parmar AS, Goyal HS, Mishra K, Panda S (2024) Healthrec-chain: patient-centric blockchain enabled ipfs for privacy preserving scalable health data. Comput Netw 110223
Wu E, Wu K, Lotter W (2020) Synthesizing Lesions using contextual GANs improves breast cancer classification on mammograms. arXiv preprint arXiv:2006.00086
Alyafi B, Diaz O, Marti R (2020) DCGANs for realistic breast mass augmentation in X-ray mammography. In: Medical Imaging 2020: Computer-Aided Diagnosis, vol 11314, p 1131420. International Society for Optics and Photonics
Muramatsu C, Nishio M, Goto T, Oiwa M, Morita T, Yakami M, Kubo T, Togashi K, Fujita H (2020) Improving breast mass classification by shared data with domain transformation using a generative adversarial network. Comput Biol Med 119:103698
Kansal S, Goel S, Bhattacharya J, Srivastava V (2020) Generative adversarial network-convolution neural network based breast cancer classification using optical coherence tomographic images. Laser Phys 30(11):115601
Shen T, Hao K, Gou C, Wang F-Y (2021) Mass image synthesis in mammogram with contextual information based on GANs. Comput Methods Programs Biomed 202:106019
Pang T, Wong JHD, Ng WL, Chan CS (2021) Semi-supervised GAN-based radiomics model for data augmentation in breast ultrasound mass classification. Comput Methods Programs Biomed 203:106018
Souza R, Lucena O, Garrafa J, Gobbi D, Saluzzi M, Appenzeller S, Rittner L, Frayne R, Lotufo R (2018) An open, multi-vendor, multi-field-strength brain mr dataset and analysis of publicly available skull stripping methods agreement. NeuroImage 170:482–494
Bien N, Rajpurkar P, Ball RL, Irvin J, Park A, Jones E, Bereket M, Patel BN, Yeom KW, Shpanskaya K et al (2018) Deep-learning-assisted diagnosis for knee magnetic resonance imaging: development and retrospective validation of mrnet. PLoS Med 15(11):1002699
Antonelli M, Reinke A, Bakas S, Farahani K, Kopp-Schneider A, Landman BA, Litjens G, Menze B, Ronneberger O, Summers RM et al (2022) The medical segmentation decathlon. Nat Commun 13(1):4128
Funding
No external funding is received for this research.
Author information
Authors and Affiliations
Contributions
Deepa Kumari involved in the study design, data collection, and manuscript writing; Vyshnavi SK and Rupsa Dhar conducted data analysis and interpretation; BSAS Rajita compared the proposed work with existing methods; Subhrakanta Panda and Jabez Christopher supervised, reviewed and edited the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there are no Conflict of interest or Conflict of interest related to this work.
Ethical approval
Not Applicable.
Consent to participate
Informed consent was obtained from all participants (or their legal guardians) included in the study.
Consent for publication
Consent for the publication of potentially identifying information and images was obtained from all participants (or their legal guardians) included in the study.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kumari, D., Vyshnavi, S.K., Dhar, R. et al. Smart GAN: a smart generative adversarial network for limited imbalanced dataset. J Supercomput 80, 20640–20681 (2024). https://doi.org/10.1007/s11227-024-06198-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-024-06198-3