A Multi-Layer Attention Network for Visual Commonsense Reasoning | IEEE Conference Publication | IEEE Xplore