Vision-and-Dialog Navigation by Fusing Cross-modal features | IEEE Conference Publication | IEEE Xplore