Cross on Cross Attention: Deep Fusion Transformer for Image Captioning | IEEE Journals & Magazine | IEEE Xplore