Visual Grounding With Joint Multimodal Representation and Interaction | IEEE Journals & Magazine | IEEE Xplore