Abstract
Accurate and early differential diagnosis of parkinsonism (idiopathic Parkinson’s disease, multiple system atrophy, and progressive supranuclear palsy) is crucial for informing prognosis and determining treatment strategies. Current automated differential diagnosis methods for \(^{18}\)F-fluorodeoxyglucose (\(^{18}\)F-FDG) positron emission tomography (PET) scans, such as convolutional neural networks (CNNs), often focus on local brain regions and do not explicitly model the complex metabolic interactions between distinct brain regions. These interactions, as reflected in FDG PET images, are keys for the differential diagnosis of parkinsonism. Vision transformer (ViT) models are promising in modeling such long-range dependencies, but they may overlook the local metabolic alternations and have not been widely adapted for 3D medical image classification due to data limitations. Therefore, we propose a novel metabolism-aware vision transformer (MetaViT), which uses self-attention and convolution to explicitly characterize both global and local metabolic interactions between interrelated brain regions. A masked image reconstruction task is introduced to guide the MetaViT model to focus on disease-related brain regions, addressing the scarcity of 3D medical imaging data and improving the trustworthiness and interpretability of the resulting model. The proposed framework is evaluated on a 3D FDG PET imaging dataset with 902 subjects, achieving a high accuracy of 97.7% in the differential diagnosis of parkinsonism and outperforming several state-of-the-art CNN and ViT-based approaches.
This work was supported by Alibaba Group through Alibaba Research Intern Program.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Braak, H., Rüb, U., Gai, W., Del Tredici, K.: Idiopathic Parkinson’s disease: possible routes by which vulnerable neuronal types may be subject to neuroinvasion by an unknown pathogen. J. Neural Transm. 110(5), 517–536 (2003)
Dosovitskiy, A., et al.: An image is worth 16\(\times \)16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)
Eckert, T., et al.: Differentiation of idiopathic Parkinson’s disease, multiple system atrophy, progressive supranuclear palsy, and healthy controls using magnetization transfer imaging. Neuroimage 21(1), 229–235 (2004)
Guo, M.H., Lu, C.Z., Liu, Z.N., Cheng, M.M., Hu, S.M.: Visual attention network. arXiv preprint arXiv:2202.09741 (2022)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Hughes, A.J., Daniel, S.E., Ben-Shlomo, Y., Lees, A.J.: The accuracy of diagnosis of parkinsonian syndromes in a specialist movement disorder service. Brain 125(4), 861–870 (2002)
Hughes, A.J., Daniel, S.E., Kilford, L., Lees, A.J.: Accuracy of clinical diagnosis of idiopathic Parkinson’s disease: a clinico-pathological study of 100 cases. J. Neurol. Neurosurg. Psychiatry 55(3), 181–184 (1992)
Jumper, J., et al.: Highly accurate protein structure prediction with AlphaFold. Nature 596(7873), 583–589 (2021)
Kish, S.J., Shannak, K., Hornykiewicz, O.: Uneven pattern of dopamine loss in the striatum of patients with idiopathic Parkinson’s disease. N. Engl. J. Med. 318(14), 876–880 (1988)
Lee, S.H., Lee, S., Song, B.C.: Vision transformer for small-size datasets. arXiv preprint arXiv:2112.13492 (2021)
Liu, Z., et al.: Swin transformer: hierarchical vision transformer using shifted windows. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 10012–10022 (2021)
Loshchilov, I., Hutter, F.: Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 (2017)
Matthews, D.C., et al.: FDG pet Parkinson’s disease-related pattern as a biomarker for clinical trials in early stage disease. NeuroImage: Clin. 20, 572–579 (2018)
Pagan, F.L.: Improving outcomes through early diagnosis of Parkinson’s disease. Am. J. Manag. Care 18(7), S176 (2012)
Raghu, M., Unterthiner, T., Kornblith, S., Zhang, C., Dosovitskiy, A.: Do vision transformers see like convolutional neural networks? Adv. Neural. Inf. Process. Syst. 34, 12116–12128 (2021)
Spetsieris, P.G., Ma, Y., Dhawan, V., Eidelberg, D.: Differential diagnosis of parkinsonian syndromes using PCA-based functional imaging features. Neuroimage 45(4), 1241–1252 (2009)
Srinivas, S., Fleuret, F.: Full-gradient representation for neural network visualization. In: Advances in Neural Information Processing Systems (NeurIPS) (2019)
Tang, C.C., et al.: Differential diagnosis of parkinsonism: a metabolic imaging study using pattern analysis. Lancet Neurol. 9(2), 149–158 (2010)
Wu, P., et al.: Differential diagnosis of parkinsonism based on deep metabolic imaging indices. J. Nucl. Med. 63, 1741–1747 (2022)
Xie, Q., Luong, M.T., Hovy, E., Le, Q.V.: Self-training with noisy student improves ImageNet classification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10687–10698 (2020)
Yuan, L., et al.: Tokens-to-token ViT: training vision transformers from scratch on imagenet. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 558–567 (2021)
Zhang, L., Wen, Y.: A transformer-based framework for automatic COVID19 diagnosis in chest CTs. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 513–518 (2021)
Zhao, Y., et al.: A 3D deep residual convolutional neural network for differential diagnosis of parkinsonian syndromes on 18 F-FDG pet images. In: 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), pp. 3531–3534 (2019)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Zhao, L. et al. (2023). MetaViT: Metabolism-Aware Vision Transformer for Differential Diagnosis of Parkinsonism with \(^{18}\)F-FDG PET. In: Frangi, A., de Bruijne, M., Wassermann, D., Navab, N. (eds) Information Processing in Medical Imaging. IPMI 2023. Lecture Notes in Computer Science, vol 13939. Springer, Cham. https://doi.org/10.1007/978-3-031-34048-2_11
Download citation
DOI: https://doi.org/10.1007/978-3-031-34048-2_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-34047-5
Online ISBN: 978-3-031-34048-2
eBook Packages: Computer ScienceComputer Science (R0)