Self-Supervised Vision-Language Pretraining for Medial Visual Question Answering | IEEE Conference Publication | IEEE Xplore