Skip to main content

Spatial-Slice Feature Learning Using Visual Transformer and Essential Slices Selection Module for COVID-19 Detection of CT Scans in the Wild

  • Conference paper
  • First Online:
Computer Vision – ECCV 2022 Workshops (ECCV 2022)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13807))

Included in the following conference series:

  • 2227 Accesses

Abstract

Computed tomography (CT) imaging could be convenient for diagnosing various diseases. However, the CT images could be diverse since their resolution and number of slices are determined by the machine and its settings. Conventional deep learning models are hard to tickle such diverse data since the essential requirement of the deep neural network is the consistent shape of the input data in each dimension. A way to overcome this issue is based on the slice-level classifier and aggregating the predictions for each slice to make the final result. However, it lacks slice-wise feature learning, leading to suppressed performance. This paper proposes an effective spatial-slice feature learning (SSFL) to tickle this issue for COVID-19 symptom classification. First, the semantic feature embedding of each slice for a CT scan is extracted by a conventional 2D convolutional neural network (CNN) and followed by using the visual Transformer-based sub-network to deal with feature learning between slices, leading to joint feature representation. Then, an essential slices set algorithm is proposed to automatically select a subset of the CT scan, which could effectively remove the uncertain slices as well as improve the performance of our SSFL. Comprehensive experiments reveal that the proposed SSFL method shows not only excellent performance but also achieves stable detection results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Abbas, A., Abdelsamea, M.M., Gaber, M.M.: Classification of Covid-19 in chest x-ray images using Detrac deep convolutional neural network. Appl. Intell. 51(2), 854–864 (2021)

    Article  Google Scholar 

  2. Arsenos, A., Kollias, D., Kollias, S.: A large imaging database and novel deep neural architecture for Covid-19 diagnosis. In: 2022 IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP), pp. 1–5. IEEE (2022)

    Google Scholar 

  3. Chen, G.L., Hsu, C.C., Wu, M.H.: Adaptive distribution learning with statistical hypothesis testing for Covid-19 CT scan classification. In: 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), pp. 471–479 (2021). https://doi.org/10.1109/ICCVW54120.2021.00057

  4. Chen, J.: Design of accurate classification of Covid-19 disease in x-ray images using deep learning approach. J. ISMAC 2, 132–148 (2021). https://doi.org/10.36548/jismac.2021.2.006

  5. Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)

    Google Scholar 

  6. Singh, D., Kumar, V., Kaur, M.: Classification of Covid-19 patients from chest CT images using multi-objective differential evolution-based convolutional neural networks. Eur. J. Clin. Microbiol. Infect. Diseases (2020)

    Google Scholar 

  7. Fang, L., Wang, X.: Covid-19 deep classification network based on convolution and deconvolution local enhancement. Comput. Biol. Med. 135, 104588 (2021)

    Article  Google Scholar 

  8. Foret, P., Kleiner, A., Mobahi, H., Neyshabur, B.: Sharpness-aware minimization for efficiently improving generalization. arXiv preprint arXiv:2010.01412 (2021)

  9. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)

    Google Scholar 

  10. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)

    Article  Google Scholar 

  11. Hou, J., Xu, J., Feng, R., Zhang, Y., Shan, F., Shi, W.: CMC-Cov19d: contrastive mixup classification for Covid-19 diagnosis, pp. 454–461 (2021). https://doi.org/10.1109/ICCVW54120.2021.00055

  12. Hussain, E., Hasan, M., Rahman, M.A., Lee, I., Tamanna, T., Parvez, M.Z.: Corodet: a deep learning based classification for Covid-19 detection using chest x-ray images. Chaos Solit. Fract. 142, 110495 (2021)

    Google Scholar 

  13. Ilya, L., Frank, H.: Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 (2017)

  14. Ismael, A.M., Şengür, A.: Deep learning approaches for Covid-19 detection based on chest x-ray images. Expert Syst. Appl. 164, 114054 (2021)

    Article  Google Scholar 

  15. Jiang, J., Lin, S.: Covid-19 detection in chest x-ray images using swin-transformer and transformer in transformer. arXiv preprint arXiv:2110.08427 (2021)

  16. Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)

  17. Kollias, D., Arsenos, A., Kollias, S.: AI-MIA: Covid-19 detection and severity analysis through medical imaging. arXiv preprint arXiv:2206.04732 (2022)

  18. Kollias, D., Arsenos, A., Soukissian, L., Kollias, S.: MIA-Cov19d: Covid-19 detection through 3-D chest CT image analysis. arXiv preprint arXiv:2106.07524 (2021)

  19. Kollias, D., et al.: Deep transparent prediction through latent representation analysis. arXiv preprint arXiv:2009.07044 (2020)

  20. Kollias, D., Tagaris, A., Stafylopatis, A., Kollias, S., Tagaris, G.: Deep neural architectures for prediction in healthcare. Complex Intell. Syst. 4(2), 119–131 (2018)

    Article  Google Scholar 

  21. Kollias, D., et al.: Transparent adaptation in deep medical image diagnosis. In: TAILOR, pp. 251–267 (2020)

    Google Scholar 

  22. Le Dinh, T., Lee, S.H., Kwon, S.G., Kwon, K.R.: Covid-19 chest x-ray classification and severity assessment using convolutional and transformer neural networks. Appl. Sci. 12(10) (2022). https://doi.org/10.3390/app12104861, https://www.mdpi.com/2076-3417/12/10/4861

  23. Liu, Z., et al.: Swin transformer v2: scaling up capacity and resolution. arXiv preprint arXiv:2111.09883 (2022)

  24. Liu, Z., et al.: Swin transformer: hierarchical vision transformer using shifted windows. arXiv preprint arXiv:2103.14030 (2021)

  25. Liu, Z., Mao, H., Wu, C.Y., Feichtenhofer, C., Darrell, T., Xie, S.: A convnet for the 2020s. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11976–11986 (2022)

    Google Scholar 

  26. Miron, R., Moisii, C., Dinu, S., Breaban, M.: Covid detection in chest CTs: improving the baseline on Cov19-CT-DB. arXiv preprint arXiv:2107.04808 (2021)

  27. Müller, R., Kornblith, S., Hinton, G.: When does label smoothing help? arXiv preprint arXiv:1906.02629 (2020)

  28. Pathak, Y., Shukla, P.K., Tiwari, A., Stalin, S., Singh, S., Shukla, P.: Deep transfer learning based classification model for Covid-19 disease. IRBM 43(2) (2020)

    Google Scholar 

  29. Tan, M., Le, Q.: Efficientnet: rethinking model scaling for convolutional neural networks. In: International Conference on Machine Learning, pp. 6105–6114. PMLR (2019)

    Google Scholar 

  30. Tan, W., Liu, J.: A 3D CNN network with BERT for automatic Covid-19 diagnosis from CT-scan images. arXiv preprint arXiv:2106.14403 (2021)

  31. Wightman, R.: PyTorch image models (2019). https://github.com/rwightman/pytorch-image-models. https://doi.org/10.5281/zenodo.4414861

  32. Wikipedia contributors: Mathematical morphology—Wikipedia, the free encyclopedia (2022). https://en.wikipedia.org/w/index.php?title=Mathematical_morphology &oldid=1082436538. Accessed 2 July 2022

Download references

Acknowledgement

This study was supported in part by the National Science and Technology Council, Taiwan, under Grants 110-2222-E-006 -012, 111-2221-E-006 -210, 111-2221-E-001-002, 111-2634-F-007-002. We thank to National Center for High-performance Computing (NCHC) for providing computational and storage resources.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chih-Chung Hsu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Hsu, CC., Tsai, CH., Chen, GL., Ma, SD., Tai, SC. (2023). Spatial-Slice Feature Learning Using Visual Transformer and Essential Slices Selection Module for COVID-19 Detection of CT Scans in the Wild. In: Karlinsky, L., Michaeli, T., Nishino, K. (eds) Computer Vision – ECCV 2022 Workshops. ECCV 2022. Lecture Notes in Computer Science, vol 13807. Springer, Cham. https://doi.org/10.1007/978-3-031-25082-8_42

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-25082-8_42

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-25081-1

  • Online ISBN: 978-3-031-25082-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics