CSTNET: Enhancing Global-To-Local Interactions for Image Captioning | IEEE Conference Publication | IEEE Xplore