Abstract
With the outbreak of COVID-19, a large number of relevant studies have emerged in recent years. We propose an automatic COVID-19 diagnosis model based on PVTv2 and the multiple voting mechanism. To accommodate the different dimensions of the image input, we classified the images using the Transformer model, sampled the images in the dataset according to the normal distribution, and fed the sampling results into the PVTv2 model for training. A large number of experiments on the COV19-CT-DB dataset demonstrate the effectiveness of the proposed method. Our method won the sixth place in the (2nd) COVID19 Detection Challenge of ECCV 2022 Workshop: AI-enabled Medical Image Analysis - Digital Pathology & Radiology/COVID19. Our code is publicly available at https://github.com/MenSan233/Team-Dslab-Solution.
L. Zheng and J. Fang—Equal contribution.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Ahuja, S., Panigrahi, B.K., Dey, N., Rajinikanth, V., Gandhi, T.K.: Deep transfer learning-based automated detection of Covid-19 from lung CT scan slices. Appl. Intell. 51(1), 571–585 (2021)
Arsenos, A., Kollias, D., Kollias, S.: A large imaging database and novel deep neural architecture for Covid-19 diagnosis. In: 2022 IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP), pp. 1–5. IEEE (2022)
Bello, I., Zoph, B., Le, Q., Vaswani, A., Shlens, J.: Attention augmented convolutional networks. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV) (2020)
Cai, L., Gao, J., Zhao, D.: A review of the application of deep learning in medical image classification and segmentation. Ann. Transl. Med. 8(11) (2020)
Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., Zagoruyko, S.: End-to-end object detection with transformers. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12346, pp. 213–229. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58452-8_13
Chaudhary, S., Sadbhawna, S., Jakhetiya, V., Subudhi, B.N., Baid, U., Guntuku, S.C.: Detecting Covid-19 and community acquired pneumonia using chest CT scan images with deep learning. In: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), ICASSP 2021 (2021)
Chung, M., et al.: CT imaging features of 2019 novel coronavirus (2019-nCoV). Radiology (2020)
Dosovitskiy, A., et al.: An image is worth \(16 \times 16\) words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)
Garg, P., Ranjan, R., Upadhyay, K., Agrawal, M., Deepak, D.: Multi-scale residual network for Covid-19 diagnosis using CT-scans. In: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), ICASSP 2021 (2021)
Giri, B., Pandey, S., Shrestha, R., Pokharel, K., Ligler, F.S., Neupane, B.B.: Review of analytical performance of Covid-19 detection methods. Anal. Bioanal. Chem. 413(1), 35–48 (2021)
Han, K., Xiao, A., Wu, E., Guo, J., Xu, C., Wang, Y.: Transformer in transformer. In: Advances in Neural Information Processing Systems, vol. 34, pp. 15908–15919 (2021)
He, X., et al.: Automated model design and benchmarking of 3D deep learning models for Covid-19 detection with chest CT scans. arXiv preprint arXiv:2101.05442 (2021)
Heidarian, S., et al.: Covid-fact: a fully-automated capsule network-based framework for identification of Covid-19 cases from chest CT scans. Front. Artif. Intell. 4, 598932 (2021)
Huang, Z., Wang, X., Huang, L., Huang, C., Wei, Y., Liu, W.: CCNet: criss-cross attention for semantic segmentation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 603–612 (2019)
Khadidos, A., Khadidos, A.O., Kannan, S., Natarajan, Y., Mohanty, S.N., Tsaramirsis, G.: Analysis of Covid-19 infections on a CT image using deepsense model. Front. Public Health 8, 599550 (2020)
Kollias, D., Arsenos, A., Kollias, S.: AI-MIA: Covid-19 detection & severity analysis through medical imaging. arXiv preprint arXiv:2206.04732 (2022)
Kollias, D., Arsenos, A., Soukissian, L., Kollias, S.: MIA-COV19D: Covid-19 detection through 3-D chest CT image analysis. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 537–544 (2021)
Kollias, D., et al.: Deep transparent prediction through latent representation analysis. arXiv preprint arXiv:2009.07044 (2020)
Kollias, D., Tagaris, A., Stafylopatis, A., Kollias, S., Tagaris, G.: Deep neural architectures for prediction in healthcare. Complex Intell. Syst. 4(2), 119–131 (2018)
Kollias, D., et al.: Transparent adaptation in deep medical image diagnosis. In: Heintz, F., Milano, M., O’Sullivan, B. (eds.) TAILOR 2020. LNCS (LNAI), vol. 12641, pp. 251–267. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-73959-1_22
Li, L., et al.: Artificial intelligence distinguishes Covid-19 from community acquired pneumonia on chest CT. Radiology (2020)
Li, Y., Xia, L.: Coronavirus disease 2019 (Covid-19): role of chest CT in diagnosis and management. Am. J. Roentgenol. 214(6), 1280–1286 (2020)
Nair, R., Alhudhaif, A., Koundal, D., Doewes, R.I., Sharma, P.: Deep learning-based Covid-19 detection system using pulmonary CT scans. Turk. J. Electr. Eng. Comput. Sci. 29(8), 2716–2727 (2021)
Panahi, A.H., Rafiei, A., Rezaee, A.: FCOD: fast Covid-19 detector based on deep learning techniques. Inform. Med. Unlocked 22, 100506 (2021)
Rahimzadeh, M., Attar, A., Sakhaei, S.M.: A fully automated deep learning-based network for detecting Covid-19 from a new and large lung CT scan dataset. Biomed. Signal Process. Control 68, 102588 (2021)
Ramachandran, P., Parmar, N., Vaswani, A., Bello, I., Levskaya, A., Shlens, J.: Stand-alone self-attention in vision models. In: Advances in Neural Information Processing Systems, vol. 32 (2019)
Rehman, A., Saba, T., Tariq, U., Ayesha, N.: Deep learning-based Covid-19 detection using CT and X-ray images: current analytics and comparisons. IT Prof. 23(3), 63–68 (2021)
Son, T., Kang, J., Kim, N., Cho, S., Kwak, S.: URIE: universal image enhancement for visual recognition in the wild. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12354, pp. 749–765. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58545-7_43
Song, Y., et al.: Deep learning enables accurate diagnosis of novel coronavirus (Covid-19) with CT images. IEEE/ACM Trans. Comput. Biol. Bioinf. 18(6), 2775–2780 (2021)
Touvron, H., Cord, M., Douze, M., Massa, F., Sablayrolles, A., Jégou, H.: Training data-efficient image transformers & distillation through attention. In: International Conference on Machine Learning, pp. 10347–10357. PMLR (2021)
Wang, W., et al.: Pyramid vision transformer: a versatile backbone for dense prediction without convolutions. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 568–578 (2021)
Wang, W., et al.: PVT v2: improved baselines with pyramid vision transformer. Comput. Vis. Media 8(3), 415–424 (2022)
Wang, X., Girshick, R., Gupta, A., He, K.: Non-local neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7794–7803 (2018)
Wang, X., et al.: A weakly-supervised framework for Covid-19 classification and lesion localization from chest CT. IEEE Trans. Med. Imaging 39(8), 2615–2625 (2020)
Yuan, L., et al.: Tokens-to-token ViT: training vision transformers from scratch on imagenet. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 558–567 (2021)
Zhu, X., Su, W., Lu, L., Li, B., Wang, X., Dai, J.: Deformable DETR: deformable transformers for end-to-end object detection. arXiv preprint arXiv:2010.04159 (2020)
Acknowledgements
This work was partially supported by the National Key R &D Program of China under Grant No. 2020YFC0832500, National Natural Science Foundation of China under Grant No. 61402210, Ministry of Education-China Mobile Research Foundation under Grant No. MCM20170206, Fundamental Research Funds for the Central Universities under Grant No. lzujbky-2021-sp43, lzujbky-2019-kb51 and lzujbky-2018-k12, Science and Technology Plan of Qinghai Province under Grant No.2020-GX-164. We also acknowledge Mr. Rui Zhao for his contribution to this paper.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Zheng, L. et al. (2023). PVT-COV19D: COVID-19 Detection Through Medical Image Classification Based on Pyramid Vision Transformer. In: Karlinsky, L., Michaeli, T., Nishino, K. (eds) Computer Vision – ECCV 2022 Workshops. ECCV 2022. Lecture Notes in Computer Science, vol 13807. Springer, Cham. https://doi.org/10.1007/978-3-031-25082-8_35
Download citation
DOI: https://doi.org/10.1007/978-3-031-25082-8_35
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-25081-1
Online ISBN: 978-3-031-25082-8
eBook Packages: Computer ScienceComputer Science (R0)