skip to main content
10.1145/3638569.3638577acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccbbConference Proceedingsconference-collections
research-article

COViT: Convolutions and ViT based Deep Learning Model for Covid19 and Viral Pneumonia Classification using X-ray Datasets

Published:07 March 2024Publication History

ABSTRACT

Artificial Intelligence based Covid19 through X-ray scans has revolutionized early diagnosis and treatment since the outbreak. There have been remarkable achievements in the research of Covid19 from Normal or other Pneumonia X-ray image classification using a convolutional neural network (CNN). CNN alone face problems in describing low-level features and can miss important information. Moreover, accurate diagnosis is important in the medical field with minimum false alarms. To answer the issue, the researchers of this paper have turned to self-attention mechanism inspired by the ViT, which has displayed state-of-the-art performance in the classification task. The proposed COViT method uses convolutions of 3 × 3 instead of patch embedding as in ViT, then alternate self-attention and MLP with hardswish function are added, and finally, MLP head is proposed which has average pooling, fully connected (FC) layer with ReLU function and kernel L2 as a classifier which improves the accuracy. Exhaustive experiments are carried out on three datasets. We have only considered Covid19 and Viral Pneumonia classes for our problem. The proposed model has achieved 98.98% classification accuracy on dataset1, 99.50% on dataset2 and 99.18% on dataset3, which validates the efficiency of COViT The proposed COViT which uses self-attention and CNN shows superiority over other SOTA models and has better accuracy than the methods in the literature.

References

  1. Q. Li, X. Guan, P. Wu, X. Wang, L. Zhou, Y. Tong, R. Ren, K. S. M. Leung, E. H. Y. Lau , “Early transmission dynamics in Wuhan China, of novel Coronavirus–infected pneumonia,” New England Journal of Medicine, vol. 382, pp. 1199–1207, 2020.Google ScholarGoogle ScholarCross RefCross Ref
  2. V. M. Corman, O. Landt, M. Kaiser, R. Molenkamp, A. Meijer, D. K. Chu, T. Bleicker, S. Brunink, J. Schneider, M. L. Schmidt, , “Detection of 2019 novel coronavirus (2019-ncov) by real-time rt-pcr,” Eurosurveillance, vol. 25, no. 3, pp. 2000045, 2020.Google ScholarGoogle ScholarCross RefCross Ref
  3. B. Van Ginneken, B. M. Ter Haar Romeny and M. A. Viergever, “Computer-aided diagnosis in chest radiography: a survey,” IEEE Transactions on Medical Imaging, vol. 20, no. 12, pp. 1228-1241, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  4. S. Khan and A. Khan, “FFireNet: Deep Learning Based Forest Fire Classification and Detection in Smart Cities,” Symmetry, vol. 14(10), pp. 2155, 2022.Google ScholarGoogle ScholarCross RefCross Ref
  5. A. Khan, S. Khan, B. Hassan, and Z. Zheng, “CNN-Based Smoker Classification and Detection in Smart City Application,” Sensors, vol. 22, no. 3, pp. 892, 2022.Google ScholarGoogle ScholarCross RefCross Ref
  6. X. Wang, R. Girshick, A. Gupta, and K. He, “Non-local Neural Networks,” In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7794-7803, 2018.Google ScholarGoogle ScholarCross RefCross Ref
  7. J. B. Cordonnier, A. Loukas, and M. Jaggi, “On the relationship between self-attention and convolutional layers,” In Proceedings of the International Conference on Learning Representation (ICLR), pp. 1-18, 2020.Google ScholarGoogle Scholar
  8. H. Touvron, M. Cord, M. Douze M, “Training data-efficient image transformers & distillation through attention,” In Proceedings of International Conference on Machine Learning, PMLR, pp 10347-10357, 2021.Google ScholarGoogle Scholar
  9. W. Wang, E. Xie, X. Li, , “Pyramid vision transformer: A versatile backbone for dense prediction without convolutions,” In Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 568-578, 2021.Google ScholarGoogle ScholarCross RefCross Ref
  10. M Farooq and A. Hafeez, “COVID-ResNet: A Deep Learning Framework for Screening of COVID19 from Radiographs,” 2020, arXiv:2003.14395v1.Google ScholarGoogle Scholar
  11. T. Ozturk, M. Talo, E. A. Yildirim, U. B. Baloglu, O. Yildirim, U. R. Acharya, “Automated detection of COVID-19 cases using deep neural networks with X-ray images,” Computers in Biology and Medicine, vol. 121, pp. 103792, 2020.Google ScholarGoogle ScholarCross RefCross Ref
  12. M. M. Taresh, N. Zhu, T. A. A. Ali, A. S. Hameed and M. L. Mutar, “Transfer Learning to Detect COVID-19 Automatically from X-Ray Images Using Convolutional Neural Networks,” International Journal of Biomedical Imaging, pp. 8828404, 2021.Google ScholarGoogle Scholar
  13. S. Hira, A. Bai, and S. Hira, “An automatic approach based on CNN architecture to detect Covid-19 disease from chest X-ray images,” Applied Intelligence, vol. 51, pp. 2864–2889, 2021.Google ScholarGoogle ScholarCross RefCross Ref
  14. S. Gazzah, R. Bayi, S. Kaloun and O. Bencharef, “A deep learning to distinguish covid-19 from others pneumonia cases,” Intelligent Automation & Soft Computing, vol. 31, no.2, pp. 677–692, 2022.Google ScholarGoogle ScholarCross RefCross Ref
  15. V. Madaan, A. Roy, C. Gupta, P. Agrawal, A. Sharma , “XCOVNet: Chest X-ray Image Classification for COVID-19 Early Detection Using Convolutional Neural Networks,” New Generation Computing, vol. 39, pp. 583–597, 2021.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. A. Keles, M. B. Keles and A. Keles, “COV19-CNNet and COV19-ResNet: Diagnostic Inference Engines for Early Detection of COVID-19,” Cognitive Computation, early access, 2021.Google ScholarGoogle ScholarCross RefCross Ref
  17. A. Basalamah and S. Rahman, “An optimized cnn model architecture for detecting coronavirus (covid-19) with x-ray images,” Computer Systems Science and Engineering, vol. 40, no.1, pp. 375–388, 2022.Google ScholarGoogle ScholarCross RefCross Ref
  18. M. Alajmi, O. A. Elshakankiry, W. El-Shafai, H. S. El-Sayed, A. I. Sallam , “Smart and automated diagnosis of covid-19 using artificial intelligence techniques,” Intelligent Automation & Soft Computing, vol. 32, no.3, pp. 1403–1413, 2022.Google ScholarGoogle ScholarCross RefCross Ref
  19. Md. Nahiduzzaman, Md. Rabiul Islam and Rakibul Hassan, “ChestX-Ray6: Prediction of multiple diseases including COVID-19 from chest X-ray images using convolutional neural network,” Expert Systems with Applications, vol. 211, pp. 118576, 2023.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. N. Muralidharan, S. Gupta, M. R. Prusty and R. K. Tripathy, “Detection of COVID19 from X-ray images using multiscale Deep Convolutional Neural Network,” Applied Soft Computing, vol. 119, pp. 108610, 2022.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. S. D. Deb, R. K. Jha, K. Jha and P. S. Tripathi, “A multi model ensemble based deep convolution neural network structure for detection of COVID19,” Biomedical Signal Processing and Control, vol. 71, pp. 103126, 2022.Google ScholarGoogle ScholarCross RefCross Ref
  22. S. Suganyadevi, V. Seethalakshmi, “CVD-HNet: Classifying Pneumonia and COVID-19 in Chest X-ray Images Using Deep Network,” Wireless Personal Communications, vol. 126, pp. 3279–3303, 2022.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. H. Yang, L. Wang, Y. Xu, , “CovidViT: a novel neural network with self-attention mechanism to detect Covid-19 through X-ray images,” International Journal of Machine Learning and Cybernetics, 2022.Google ScholarGoogle Scholar
  24. A. Abugabah, A. Mehmood, A. Ali AL Zubi and L. Sanzogni, "Smart covid-3d-scnn: a novel method to classify x-ray images of covid-19, " Computer Systems Science and Engineering, vol. 41, no.3, pp. 997–1008, 2022.Google ScholarGoogle ScholarCross RefCross Ref
  25. https://data.mendeley.com/datasets/9xkhgts2s6/3 (accessed on May 19, 2022)Google ScholarGoogle Scholar
  26. https://www.kaggle.com/datasets/tawsifurrahman/covid19-radiography-database (accessed on May30, 2022)Google ScholarGoogle Scholar
  27. A. Dosovitskiy , L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, J. Uszkoreit, and N. Houlsby, “An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale,” In Proceedings of the International Conference on Learning Representations (ICLR), 2021, arXiv:2010.11929.Google ScholarGoogle Scholar
  28. Z. Tu, H. Talebi, H Zhnag, F. Yang, P. Milanfar, A. Bovik and Y. Li, “MaxViT: Multi-Axis Vision Transformer,” 2022, arXiv:2204.01697.Google ScholarGoogle Scholar
  29. T. He, Z. Zhang, H. Zhang, Z. Zhang, J. Xie and M. Li, “Bag of Tricks for Image Classification with Convolutional Neural Networks,” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 558-567.Google ScholarGoogle ScholarCross RefCross Ref
  30. A. Howard, M. Sandler, B. Chen, W. Wang , “Searching for MobileNetV3,” In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019, pp. 1314-1324.Google ScholarGoogle ScholarCross RefCross Ref
  31. S. Mehta and M. Rastegari,“Separable Self-attention for Mobile Vision Transformers,” 2022, arXiv:2206.02680.Google ScholarGoogle Scholar

Index Terms

  1. COViT: Convolutions and ViT based Deep Learning Model for Covid19 and Viral Pneumonia Classification using X-ray Datasets
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            ICCBB '23: Proceedings of the 2023 7th International Conference on Computational Biology and Bioinformatics
            December 2023
            108 pages
            ISBN:9798400716331
            DOI:10.1145/3638569

            Copyright © 2023 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 7 March 2024

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed limited
          • Article Metrics

            • Downloads (Last 12 months)11
            • Downloads (Last 6 weeks)5

            Other Metrics

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format .

          View HTML Format