Spatial-Slice Feature Learning Using Visual Transformer and Essential Slices Selection Module for COVID-19 Detection of CT Scans in the Wild

Hsu, Chih-Chung; Tsai, Chi-Han; Chen, Guan-Lin; Ma, Sin-Di; Tai, Shen-Chieh

doi:10.1007/978-3-031-25082-8_42

Chih-Chung Hsu ORCID: orcid.org/0000-0002-2083-4438¹⁰,
Chi-Han Tsai¹⁰,
Guan-Lin Chen¹⁰,
Sin-Di Ma¹⁰ &
…
Shen-Chieh Tai¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13807))

Included in the following conference series:

European Conference on Computer Vision

2227 Accesses

Abstract

Computed tomography (CT) imaging could be convenient for diagnosing various diseases. However, the CT images could be diverse since their resolution and number of slices are determined by the machine and its settings. Conventional deep learning models are hard to tickle such diverse data since the essential requirement of the deep neural network is the consistent shape of the input data in each dimension. A way to overcome this issue is based on the slice-level classifier and aggregating the predictions for each slice to make the final result. However, it lacks slice-wise feature learning, leading to suppressed performance. This paper proposes an effective spatial-slice feature learning (SSFL) to tickle this issue for COVID-19 symptom classification. First, the semantic feature embedding of each slice for a CT scan is extracted by a conventional 2D convolutional neural network (CNN) and followed by using the visual Transformer-based sub-network to deal with feature learning between slices, leading to joint feature representation. Then, an essential slices set algorithm is proposed to automatically select a subset of the CT scan, which could effectively remove the uncertain slices as well as improve the performance of our SSFL. Comprehensive experiments reveal that the proposed SSFL method shows not only excellent performance but also achieves stable detection results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Automated Diagnosis of COVID-19 from CT Scans Based on Concatenation of Mobilenetv2 and ResNet50 Features

Multi-Feature Vision Transformer via Self-Supervised Representation Learning for Improvement of COVID-19 Diagnosis

A multi-label classification model for full slice brain computerised tomography image

Article Open access 18 November 2020

References

Abbas, A., Abdelsamea, M.M., Gaber, M.M.: Classification of Covid-19 in chest x-ray images using Detrac deep convolutional neural network. Appl. Intell. 51(2), 854–864 (2021)
Article Google Scholar
Arsenos, A., Kollias, D., Kollias, S.: A large imaging database and novel deep neural architecture for Covid-19 diagnosis. In: 2022 IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP), pp. 1–5. IEEE (2022)
Google Scholar
Chen, G.L., Hsu, C.C., Wu, M.H.: Adaptive distribution learning with statistical hypothesis testing for Covid-19 CT scan classification. In: 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), pp. 471–479 (2021). https://doi.org/10.1109/ICCVW54120.2021.00057
Chen, J.: Design of accurate classification of Covid-19 disease in x-ray images using deep learning approach. J. ISMAC 2, 132–148 (2021). https://doi.org/10.36548/jismac.2021.2.006
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)
Google Scholar
Singh, D., Kumar, V., Kaur, M.: Classification of Covid-19 patients from chest CT images using multi-objective differential evolution-based convolutional neural networks. Eur. J. Clin. Microbiol. Infect. Diseases (2020)
Google Scholar
Fang, L., Wang, X.: Covid-19 deep classification network based on convolution and deconvolution local enhancement. Comput. Biol. Med. 135, 104588 (2021)
Article Google Scholar
Foret, P., Kleiner, A., Mobahi, H., Neyshabur, B.: Sharpness-aware minimization for efficiently improving generalization. arXiv preprint arXiv:2010.01412 (2021)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Hou, J., Xu, J., Feng, R., Zhang, Y., Shan, F., Shi, W.: CMC-Cov19d: contrastive mixup classification for Covid-19 diagnosis, pp. 454–461 (2021). https://doi.org/10.1109/ICCVW54120.2021.00055
Hussain, E., Hasan, M., Rahman, M.A., Lee, I., Tamanna, T., Parvez, M.Z.: Corodet: a deep learning based classification for Covid-19 detection using chest x-ray images. Chaos Solit. Fract. 142, 110495 (2021)
Google Scholar
Ilya, L., Frank, H.: Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 (2017)
Ismael, A.M., Şengür, A.: Deep learning approaches for Covid-19 detection based on chest x-ray images. Expert Syst. Appl. 164, 114054 (2021)
Article Google Scholar
Jiang, J., Lin, S.: Covid-19 detection in chest x-ray images using swin-transformer and transformer in transformer. arXiv preprint arXiv:2110.08427 (2021)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Kollias, D., Arsenos, A., Kollias, S.: AI-MIA: Covid-19 detection and severity analysis through medical imaging. arXiv preprint arXiv:2206.04732 (2022)
Kollias, D., Arsenos, A., Soukissian, L., Kollias, S.: MIA-Cov19d: Covid-19 detection through 3-D chest CT image analysis. arXiv preprint arXiv:2106.07524 (2021)
Kollias, D., et al.: Deep transparent prediction through latent representation analysis. arXiv preprint arXiv:2009.07044 (2020)
Kollias, D., Tagaris, A., Stafylopatis, A., Kollias, S., Tagaris, G.: Deep neural architectures for prediction in healthcare. Complex Intell. Syst. 4(2), 119–131 (2018)
Article Google Scholar
Kollias, D., et al.: Transparent adaptation in deep medical image diagnosis. In: TAILOR, pp. 251–267 (2020)
Google Scholar
Le Dinh, T., Lee, S.H., Kwon, S.G., Kwon, K.R.: Covid-19 chest x-ray classification and severity assessment using convolutional and transformer neural networks. Appl. Sci. 12(10) (2022). https://doi.org/10.3390/app12104861, https://www.mdpi.com/2076-3417/12/10/4861
Liu, Z., et al.: Swin transformer v2: scaling up capacity and resolution. arXiv preprint arXiv:2111.09883 (2022)
Liu, Z., et al.: Swin transformer: hierarchical vision transformer using shifted windows. arXiv preprint arXiv:2103.14030 (2021)
Liu, Z., Mao, H., Wu, C.Y., Feichtenhofer, C., Darrell, T., Xie, S.: A convnet for the 2020s. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11976–11986 (2022)
Google Scholar
Miron, R., Moisii, C., Dinu, S., Breaban, M.: Covid detection in chest CTs: improving the baseline on Cov19-CT-DB. arXiv preprint arXiv:2107.04808 (2021)
Müller, R., Kornblith, S., Hinton, G.: When does label smoothing help? arXiv preprint arXiv:1906.02629 (2020)
Pathak, Y., Shukla, P.K., Tiwari, A., Stalin, S., Singh, S., Shukla, P.: Deep transfer learning based classification model for Covid-19 disease. IRBM 43(2) (2020)
Google Scholar
Tan, M., Le, Q.: Efficientnet: rethinking model scaling for convolutional neural networks. In: International Conference on Machine Learning, pp. 6105–6114. PMLR (2019)
Google Scholar
Tan, W., Liu, J.: A 3D CNN network with BERT for automatic Covid-19 diagnosis from CT-scan images. arXiv preprint arXiv:2106.14403 (2021)
Wightman, R.: PyTorch image models (2019). https://github.com/rwightman/pytorch-image-models. https://doi.org/10.5281/zenodo.4414861
Wikipedia contributors: Mathematical morphology—Wikipedia, the free encyclopedia (2022). https://en.wikipedia.org/w/index.php?title=Mathematical_morphology &oldid=1082436538. Accessed 2 July 2022

Download references

Acknowledgement

This study was supported in part by the National Science and Technology Council, Taiwan, under Grants 110-2222-E-006 -012, 111-2221-E-006 -210, 111-2221-E-001-002, 111-2634-F-007-002. We thank to National Center for High-performance Computing (NCHC) for providing computational and storage resources.

Author information

Authors and Affiliations

Institute of Data Science, National Cheng Kung University, No. 1, University Road, Tainan City, Taiwan
Chih-Chung Hsu, Chi-Han Tsai, Guan-Lin Chen, Sin-Di Ma & Shen-Chieh Tai

Authors

Chih-Chung Hsu
View author publications
You can also search for this author in PubMed Google Scholar
Chi-Han Tsai
View author publications
You can also search for this author in PubMed Google Scholar
Guan-Lin Chen
View author publications
You can also search for this author in PubMed Google Scholar
Sin-Di Ma
View author publications
You can also search for this author in PubMed Google Scholar
Shen-Chieh Tai
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chih-Chung Hsu .

Editor information

Editors and Affiliations

IBM Research - MIT-IBM Watson AI Lab, Massachusetts, USA
Leonid Karlinsky
Technion – Israel Institute of Technology, Haifa, Israel
Tomer Michaeli
Kyoto University, Kyoto, Japan
Ko Nishino

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hsu, CC., Tsai, CH., Chen, GL., Ma, SD., Tai, SC. (2023). Spatial-Slice Feature Learning Using Visual Transformer and Essential Slices Selection Module for COVID-19 Detection of CT Scans in the Wild. In: Karlinsky, L., Michaeli, T., Nishino, K. (eds) Computer Vision – ECCV 2022 Workshops. ECCV 2022. Lecture Notes in Computer Science, vol 13807. Springer, Cham. https://doi.org/10.1007/978-3-031-25082-8_42

Download citation

DOI: https://doi.org/10.1007/978-3-031-25082-8_42
Published: 12 February 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-25081-1
Online ISBN: 978-3-031-25082-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Spatial-Slice Feature Learning Using Visual Transformer and Essential Slices Selection Module for COVID-19 Detection of CT Scans in the Wild