Skip to main content

PVT-COV19D: COVID-19 Detection Through Medical Image Classification Based on Pyramid Vision Transformer

  • Conference paper
  • First Online:
Computer Vision – ECCV 2022 Workshops (ECCV 2022)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13807))

Included in the following conference series:

  • 2237 Accesses

Abstract

With the outbreak of COVID-19, a large number of relevant studies have emerged in recent years. We propose an automatic COVID-19 diagnosis model based on PVTv2 and the multiple voting mechanism. To accommodate the different dimensions of the image input, we classified the images using the Transformer model, sampled the images in the dataset according to the normal distribution, and fed the sampling results into the PVTv2 model for training. A large number of experiments on the COV19-CT-DB dataset demonstrate the effectiveness of the proposed method. Our method won the sixth place in the (2nd) COVID19 Detection Challenge of ECCV 2022 Workshop: AI-enabled Medical Image Analysis - Digital Pathology & Radiology/COVID19. Our code is publicly available at https://github.com/MenSan233/Team-Dslab-Solution.

L. Zheng and J. Fang—Equal contribution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Ahuja, S., Panigrahi, B.K., Dey, N., Rajinikanth, V., Gandhi, T.K.: Deep transfer learning-based automated detection of Covid-19 from lung CT scan slices. Appl. Intell. 51(1), 571–585 (2021)

    Article  Google Scholar 

  2. Arsenos, A., Kollias, D., Kollias, S.: A large imaging database and novel deep neural architecture for Covid-19 diagnosis. In: 2022 IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP), pp. 1–5. IEEE (2022)

    Google Scholar 

  3. Bello, I., Zoph, B., Le, Q., Vaswani, A., Shlens, J.: Attention augmented convolutional networks. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV) (2020)

    Google Scholar 

  4. Cai, L., Gao, J., Zhao, D.: A review of the application of deep learning in medical image classification and segmentation. Ann. Transl. Med. 8(11) (2020)

    Google Scholar 

  5. Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., Zagoruyko, S.: End-to-end object detection with transformers. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12346, pp. 213–229. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58452-8_13

    Chapter  Google Scholar 

  6. Chaudhary, S., Sadbhawna, S., Jakhetiya, V., Subudhi, B.N., Baid, U., Guntuku, S.C.: Detecting Covid-19 and community acquired pneumonia using chest CT scan images with deep learning. In: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), ICASSP 2021 (2021)

    Google Scholar 

  7. Chung, M., et al.: CT imaging features of 2019 novel coronavirus (2019-nCoV). Radiology (2020)

    Google Scholar 

  8. Dosovitskiy, A., et al.: An image is worth \(16 \times 16\) words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)

  9. Garg, P., Ranjan, R., Upadhyay, K., Agrawal, M., Deepak, D.: Multi-scale residual network for Covid-19 diagnosis using CT-scans. In: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), ICASSP 2021 (2021)

    Google Scholar 

  10. Giri, B., Pandey, S., Shrestha, R., Pokharel, K., Ligler, F.S., Neupane, B.B.: Review of analytical performance of Covid-19 detection methods. Anal. Bioanal. Chem. 413(1), 35–48 (2021)

    Article  Google Scholar 

  11. Han, K., Xiao, A., Wu, E., Guo, J., Xu, C., Wang, Y.: Transformer in transformer. In: Advances in Neural Information Processing Systems, vol. 34, pp. 15908–15919 (2021)

    Google Scholar 

  12. He, X., et al.: Automated model design and benchmarking of 3D deep learning models for Covid-19 detection with chest CT scans. arXiv preprint arXiv:2101.05442 (2021)

  13. Heidarian, S., et al.: Covid-fact: a fully-automated capsule network-based framework for identification of Covid-19 cases from chest CT scans. Front. Artif. Intell. 4, 598932 (2021)

    Article  Google Scholar 

  14. Huang, Z., Wang, X., Huang, L., Huang, C., Wei, Y., Liu, W.: CCNet: criss-cross attention for semantic segmentation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 603–612 (2019)

    Google Scholar 

  15. Khadidos, A., Khadidos, A.O., Kannan, S., Natarajan, Y., Mohanty, S.N., Tsaramirsis, G.: Analysis of Covid-19 infections on a CT image using deepsense model. Front. Public Health 8, 599550 (2020)

    Article  Google Scholar 

  16. Kollias, D., Arsenos, A., Kollias, S.: AI-MIA: Covid-19 detection & severity analysis through medical imaging. arXiv preprint arXiv:2206.04732 (2022)

  17. Kollias, D., Arsenos, A., Soukissian, L., Kollias, S.: MIA-COV19D: Covid-19 detection through 3-D chest CT image analysis. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 537–544 (2021)

    Google Scholar 

  18. Kollias, D., et al.: Deep transparent prediction through latent representation analysis. arXiv preprint arXiv:2009.07044 (2020)

  19. Kollias, D., Tagaris, A., Stafylopatis, A., Kollias, S., Tagaris, G.: Deep neural architectures for prediction in healthcare. Complex Intell. Syst. 4(2), 119–131 (2018)

    Article  Google Scholar 

  20. Kollias, D., et al.: Transparent adaptation in deep medical image diagnosis. In: Heintz, F., Milano, M., O’Sullivan, B. (eds.) TAILOR 2020. LNCS (LNAI), vol. 12641, pp. 251–267. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-73959-1_22

    Chapter  Google Scholar 

  21. Li, L., et al.: Artificial intelligence distinguishes Covid-19 from community acquired pneumonia on chest CT. Radiology (2020)

    Google Scholar 

  22. Li, Y., Xia, L.: Coronavirus disease 2019 (Covid-19): role of chest CT in diagnosis and management. Am. J. Roentgenol. 214(6), 1280–1286 (2020)

    Article  Google Scholar 

  23. Nair, R., Alhudhaif, A., Koundal, D., Doewes, R.I., Sharma, P.: Deep learning-based Covid-19 detection system using pulmonary CT scans. Turk. J. Electr. Eng. Comput. Sci. 29(8), 2716–2727 (2021)

    Article  Google Scholar 

  24. Panahi, A.H., Rafiei, A., Rezaee, A.: FCOD: fast Covid-19 detector based on deep learning techniques. Inform. Med. Unlocked 22, 100506 (2021)

    Article  Google Scholar 

  25. Rahimzadeh, M., Attar, A., Sakhaei, S.M.: A fully automated deep learning-based network for detecting Covid-19 from a new and large lung CT scan dataset. Biomed. Signal Process. Control 68, 102588 (2021)

    Article  Google Scholar 

  26. Ramachandran, P., Parmar, N., Vaswani, A., Bello, I., Levskaya, A., Shlens, J.: Stand-alone self-attention in vision models. In: Advances in Neural Information Processing Systems, vol. 32 (2019)

    Google Scholar 

  27. Rehman, A., Saba, T., Tariq, U., Ayesha, N.: Deep learning-based Covid-19 detection using CT and X-ray images: current analytics and comparisons. IT Prof. 23(3), 63–68 (2021)

    Article  Google Scholar 

  28. Son, T., Kang, J., Kim, N., Cho, S., Kwak, S.: URIE: universal image enhancement for visual recognition in the wild. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12354, pp. 749–765. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58545-7_43

    Chapter  Google Scholar 

  29. Song, Y., et al.: Deep learning enables accurate diagnosis of novel coronavirus (Covid-19) with CT images. IEEE/ACM Trans. Comput. Biol. Bioinf. 18(6), 2775–2780 (2021)

    Article  Google Scholar 

  30. Touvron, H., Cord, M., Douze, M., Massa, F., Sablayrolles, A., Jégou, H.: Training data-efficient image transformers & distillation through attention. In: International Conference on Machine Learning, pp. 10347–10357. PMLR (2021)

    Google Scholar 

  31. Wang, W., et al.: Pyramid vision transformer: a versatile backbone for dense prediction without convolutions. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 568–578 (2021)

    Google Scholar 

  32. Wang, W., et al.: PVT v2: improved baselines with pyramid vision transformer. Comput. Vis. Media 8(3), 415–424 (2022)

    Article  MathSciNet  Google Scholar 

  33. Wang, X., Girshick, R., Gupta, A., He, K.: Non-local neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7794–7803 (2018)

    Google Scholar 

  34. Wang, X., et al.: A weakly-supervised framework for Covid-19 classification and lesion localization from chest CT. IEEE Trans. Med. Imaging 39(8), 2615–2625 (2020)

    Article  Google Scholar 

  35. Yuan, L., et al.: Tokens-to-token ViT: training vision transformers from scratch on imagenet. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 558–567 (2021)

    Google Scholar 

  36. Zhu, X., Su, W., Lu, L., Li, B., Wang, X., Dai, J.: Deformable DETR: deformable transformers for end-to-end object detection. arXiv preprint arXiv:2010.04159 (2020)

Download references

Acknowledgements

This work was partially supported by the National Key R &D Program of China under Grant No. 2020YFC0832500, National Natural Science Foundation of China under Grant No. 61402210, Ministry of Education-China Mobile Research Foundation under Grant No. MCM20170206, Fundamental Research Funds for the Central Universities under Grant No. lzujbky-2021-sp43, lzujbky-2019-kb51 and lzujbky-2018-k12, Science and Technology Plan of Qinghai Province under Grant No.2020-GX-164. We also acknowledge Mr. Rui Zhao for his contribution to this paper.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Rui Zhou or Zhaoyan Yan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zheng, L. et al. (2023). PVT-COV19D: COVID-19 Detection Through Medical Image Classification Based on Pyramid Vision Transformer. In: Karlinsky, L., Michaeli, T., Nishino, K. (eds) Computer Vision – ECCV 2022 Workshops. ECCV 2022. Lecture Notes in Computer Science, vol 13807. Springer, Cham. https://doi.org/10.1007/978-3-031-25082-8_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-25082-8_35

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-25081-1

  • Online ISBN: 978-3-031-25082-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics