Multi-modal Fine-grained Retrieval with Local and Global Cross-Attention | IEEE Conference Publication | IEEE Xplore