Learning to Follow Verbal Instructions with Visual Grounding | IEEE Conference Publication | IEEE Xplore