Abstract
In this study, we present a Histogram-Matching based novel data augmentation method for Covid-19 and Pneumonia detection, from an imbalanced Radiology (Chest X-Ray) image dataset. Moreover, we have publicly shared a large augmented CXR dataset on the Kaggle site (https://www.kaggle.com/datasets/vibhuti25/histogram-matching-covid-cxr), which is more balanced than the original dataset. For data augmentation, we have incorporated the Histogram Matching technique, CLAHE 0.5, and CLAHE 0.75, in order to increase the number of images in the dataset. The number of images in the augmented dataset per class is chosen by a statistical formula which is computed from the dataset itself. We have implemented two existing Convolutional Neural Network (CNN) models for Covid-19 detection i.e., Covid Net and Covid-Lite models, on the proposed augmented dataset. We have achieved 3–5% and 7–9% improvement of testing accuracy, precision, recall and F1 score for Covid Net and Covid-Lite model respectively, after employing the proposed augmented dataset. Furthermore, we have implemented a popular CNN model i.e., VGG-16 and by the proposed data-augmentation we have achieved the best results of 95% accuracy, 96% F1 score, 96% precision and 96% recall.
Supported by organization x.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Gorbalenya, A.E., et al.: The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2. Nat. Microbiol. 5(4), 536–544 (2020)
Del Rio, C., Malani, P.N.: COVID-19 in 2022—the beginning of the end or the end of the beginning? JAMA 327(24), 2389–2390 (2022)
Siddhartha, M., Santra, A.: COVIDLite: a depth-wise separable deep neural network with white balance and CLAHE for detection of Covid-19. arXiv preprint arXiv:2006.13873 (2020)
Wang, L., Lin, Z.Q., Wong, A.: Covid-net: a tailored deep convolutional neural network design for detection of Covid-19 cases from chest X-ray images. Sci. Rep. 10(1), 1–12 (2020)
Sitaula, C., Hossain, M.B.: Attention-based VGG-16 model for Covid-19 chest X-ray image classification. Appl. Intell. 51(5), 2850–2863 (2021)
Kumar, A., Tripathi, A.R., Satapathy, S.C., Zhang, Y.D.: SARS-Net: Covid-19 detection from chest X-rays by combining graph convolutional network and convolutional neural network. Pattern Recogn. 122, 108255 (2022)
Xu, Y., Lam, H.K., Jia, G.: Manet: a two-stage deep learning method for classification of Covid-19 from chest X-ray images. Neurocomputing 443, 96–105 (2021)
Calderon-Ramirez, et al.: Correcting data imbalance for semi-supervised Covid-19 detection using X-ray chest images. Appl. Soft Comput. 111, 107692 (2021)
Tyagi, M., Roy, S., Bansal, V.: Custom weighted balanced loss function for Covid 19 detection from an imbalanced CXR dataset. In: 2022 26th International Conference on Pattern Recognition (ICPR), Montreal, Canada, pp. 2707–2713. IEEE (2022)
Roy, S., Tyagi, M., Bansal, V., Jain, V.: SVD-CLAHE boosting and balanced loss function for Covid-19 detection from an imbalanced chest X-ray dataset. Comput. Biol. Med. 106092 (2022)
Chandra, T.B., Verma, K., Singh, B.K., Jain, D., Netam, S.S.: Coronavirus disease (Covid-19) detection in chest X-ray images using majority voting based classifier ensemble. Expert Syst. Appl. 165, 113909 (2021)
Chamseddine, E., Mansouri, N., Soui, M., Abed, M.: Handling class imbalance in Covid-19 chest X-ray images classification: using smote and weighted loss. Appl. Soft Comput. 129, 109588 (2022)
Chowdhury, M.E., et al.: Can AI help in screening viral and Covid-19 pneumonia? IEEE Access 8, 132665–132676 (2020)
Chen, X., et al.: Automatic histogram specification for glioma grading using multicenter data. J. Healthcare Eng. 2019 (2019)
Gonzalez, R.C.: Digital Image Processing. Pearson Education India (2009)
Papoulis, A., Unnikrishna Pillai, S.: Probability, random variables and stochastic processes (2002)
Roy, S., Bhalla, K., Patel, R.: Mathematical analysis of histogram equalization techniques for medical image enhancement: a tutorial from the perspective of data loss. Multimed. Tools Appl. 1–30 (2023)
Pizer, S.M.: Contrast-limited adaptive histogram equalization: speed and effectiveness Stephen M. Pizer, R. Eugene Johnston, James P. Ericksen, Bonnie C. Yankaskas, Keith E. Muller medical image display research group. In: Proceedings of the First Conference on Visualization in Biomedical Computing, Atlanta, Georgia, vol. 337, p. 1 (1990)
Roy, S., Panda, S., Jangid, M.: Modified reinhard algorithm for color normalization of colorectal cancer histopathology images. In: 29th European Signal Processing Conference (EUSIPCO), Dublin, Ireland, pp. 1231–1235. IEEE (2021)
Al-Waisy, A.S., et al.: COVID-CheXNet: hybrid deep learning framework for identifying COVID-19 virus in chest X-rays images. Soft. Comput. 27(5), 2657–2672 (2023)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Roy, S., Bansal, V. (2024). Histogram Matching Based Data-Augmentation and Its Impact on CNN Model for Covid-19 and Pneumonia Detection from Radiology Images. In: Kaur, H., Jakhetiya, V., Goyal, P., Khanna, P., Raman, B., Kumar, S. (eds) Computer Vision and Image Processing. CVIP 2023. Communications in Computer and Information Science, vol 2009. Springer, Cham. https://doi.org/10.1007/978-3-031-58181-6_12
Download citation
DOI: https://doi.org/10.1007/978-3-031-58181-6_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-58180-9
Online ISBN: 978-3-031-58181-6
eBook Packages: Computer ScienceComputer Science (R0)