Representation Learning Through Cross-Modality Supervision | IEEE Conference Publication | IEEE Xplore