Teaching Structured Vision & Language Concepts to Vision & Language Models | IEEE Conference Publication | IEEE Xplore