MultiViT: Multimodal Vision Transformer for Schizophrenia Prediction using Structural MRI and Functional Network Connectivity Data | IEEE Conference Publication | IEEE Xplore