Context-Aware Hierarchical Transformer for Fine-Grained Video-Text Retrieval | IEEE Conference Publication | IEEE Xplore