CoD: Coherent Detection of Entities from Images with Multiple Modalities | IEEE Conference Publication | IEEE Xplore