Abstract
Computed tomography (CT) imaging could be convenient for diagnosing various diseases. However, the CT images could be diverse since their resolution and number of slices are determined by the machine and its settings. Conventional deep learning models are hard to tickle such diverse data since the essential requirement of the deep neural network is the consistent shape of the input data in each dimension. A way to overcome this issue is based on the slice-level classifier and aggregating the predictions for each slice to make the final result. However, it lacks slice-wise feature learning, leading to suppressed performance. This paper proposes an effective spatial-slice feature learning (SSFL) to tickle this issue for COVID-19 symptom classification. First, the semantic feature embedding of each slice for a CT scan is extracted by a conventional 2D convolutional neural network (CNN) and followed by using the visual Transformer-based sub-network to deal with feature learning between slices, leading to joint feature representation. Then, an essential slices set algorithm is proposed to automatically select a subset of the CT scan, which could effectively remove the uncertain slices as well as improve the performance of our SSFL. Comprehensive experiments reveal that the proposed SSFL method shows not only excellent performance but also achieves stable detection results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Abbas, A., Abdelsamea, M.M., Gaber, M.M.: Classification of Covid-19 in chest x-ray images using Detrac deep convolutional neural network. Appl. Intell. 51(2), 854–864 (2021)
Arsenos, A., Kollias, D., Kollias, S.: A large imaging database and novel deep neural architecture for Covid-19 diagnosis. In: 2022 IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP), pp. 1–5. IEEE (2022)
Chen, G.L., Hsu, C.C., Wu, M.H.: Adaptive distribution learning with statistical hypothesis testing for Covid-19 CT scan classification. In: 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), pp. 471–479 (2021). https://doi.org/10.1109/ICCVW54120.2021.00057
Chen, J.: Design of accurate classification of Covid-19 disease in x-ray images using deep learning approach. J. ISMAC 2, 132–148 (2021). https://doi.org/10.36548/jismac.2021.2.006
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)
Singh, D., Kumar, V., Kaur, M.: Classification of Covid-19 patients from chest CT images using multi-objective differential evolution-based convolutional neural networks. Eur. J. Clin. Microbiol. Infect. Diseases (2020)
Fang, L., Wang, X.: Covid-19 deep classification network based on convolution and deconvolution local enhancement. Comput. Biol. Med. 135, 104588 (2021)
Foret, P., Kleiner, A., Mobahi, H., Neyshabur, B.: Sharpness-aware minimization for efficiently improving generalization. arXiv preprint arXiv:2010.01412 (2021)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Hou, J., Xu, J., Feng, R., Zhang, Y., Shan, F., Shi, W.: CMC-Cov19d: contrastive mixup classification for Covid-19 diagnosis, pp. 454–461 (2021). https://doi.org/10.1109/ICCVW54120.2021.00055
Hussain, E., Hasan, M., Rahman, M.A., Lee, I., Tamanna, T., Parvez, M.Z.: Corodet: a deep learning based classification for Covid-19 detection using chest x-ray images. Chaos Solit. Fract. 142, 110495 (2021)
Ilya, L., Frank, H.: Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 (2017)
Ismael, A.M., Şengür, A.: Deep learning approaches for Covid-19 detection based on chest x-ray images. Expert Syst. Appl. 164, 114054 (2021)
Jiang, J., Lin, S.: Covid-19 detection in chest x-ray images using swin-transformer and transformer in transformer. arXiv preprint arXiv:2110.08427 (2021)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Kollias, D., Arsenos, A., Kollias, S.: AI-MIA: Covid-19 detection and severity analysis through medical imaging. arXiv preprint arXiv:2206.04732 (2022)
Kollias, D., Arsenos, A., Soukissian, L., Kollias, S.: MIA-Cov19d: Covid-19 detection through 3-D chest CT image analysis. arXiv preprint arXiv:2106.07524 (2021)
Kollias, D., et al.: Deep transparent prediction through latent representation analysis. arXiv preprint arXiv:2009.07044 (2020)
Kollias, D., Tagaris, A., Stafylopatis, A., Kollias, S., Tagaris, G.: Deep neural architectures for prediction in healthcare. Complex Intell. Syst. 4(2), 119–131 (2018)
Kollias, D., et al.: Transparent adaptation in deep medical image diagnosis. In: TAILOR, pp. 251–267 (2020)
Le Dinh, T., Lee, S.H., Kwon, S.G., Kwon, K.R.: Covid-19 chest x-ray classification and severity assessment using convolutional and transformer neural networks. Appl. Sci. 12(10) (2022). https://doi.org/10.3390/app12104861, https://www.mdpi.com/2076-3417/12/10/4861
Liu, Z., et al.: Swin transformer v2: scaling up capacity and resolution. arXiv preprint arXiv:2111.09883 (2022)
Liu, Z., et al.: Swin transformer: hierarchical vision transformer using shifted windows. arXiv preprint arXiv:2103.14030 (2021)
Liu, Z., Mao, H., Wu, C.Y., Feichtenhofer, C., Darrell, T., Xie, S.: A convnet for the 2020s. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11976–11986 (2022)
Miron, R., Moisii, C., Dinu, S., Breaban, M.: Covid detection in chest CTs: improving the baseline on Cov19-CT-DB. arXiv preprint arXiv:2107.04808 (2021)
Müller, R., Kornblith, S., Hinton, G.: When does label smoothing help? arXiv preprint arXiv:1906.02629 (2020)
Pathak, Y., Shukla, P.K., Tiwari, A., Stalin, S., Singh, S., Shukla, P.: Deep transfer learning based classification model for Covid-19 disease. IRBM 43(2) (2020)
Tan, M., Le, Q.: Efficientnet: rethinking model scaling for convolutional neural networks. In: International Conference on Machine Learning, pp. 6105–6114. PMLR (2019)
Tan, W., Liu, J.: A 3D CNN network with BERT for automatic Covid-19 diagnosis from CT-scan images. arXiv preprint arXiv:2106.14403 (2021)
Wightman, R.: PyTorch image models (2019). https://github.com/rwightman/pytorch-image-models. https://doi.org/10.5281/zenodo.4414861
Wikipedia contributors: Mathematical morphology—Wikipedia, the free encyclopedia (2022). https://en.wikipedia.org/w/index.php?title=Mathematical_morphology &oldid=1082436538. Accessed 2 July 2022
Acknowledgement
This study was supported in part by the National Science and Technology Council, Taiwan, under Grants 110-2222-E-006 -012, 111-2221-E-006 -210, 111-2221-E-001-002, 111-2634-F-007-002. We thank to National Center for High-performance Computing (NCHC) for providing computational and storage resources.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Hsu, CC., Tsai, CH., Chen, GL., Ma, SD., Tai, SC. (2023). Spatial-Slice Feature Learning Using Visual Transformer and Essential Slices Selection Module for COVID-19 Detection of CT Scans in the Wild. In: Karlinsky, L., Michaeli, T., Nishino, K. (eds) Computer Vision – ECCV 2022 Workshops. ECCV 2022. Lecture Notes in Computer Science, vol 13807. Springer, Cham. https://doi.org/10.1007/978-3-031-25082-8_42
Download citation
DOI: https://doi.org/10.1007/978-3-031-25082-8_42
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-25081-1
Online ISBN: 978-3-031-25082-8
eBook Packages: Computer ScienceComputer Science (R0)