Glove-Ing Attention: A Multi-Modal Neural Learning Approach to Image Captioning | IEEE Conference Publication | IEEE Xplore