Convolutional neural networks and multimodal fusion for text aided image classification | IEEE Conference Publication | IEEE Xplore