A Translation Framework for Visually Grounded Spoken Unit Discovery | IEEE Conference Publication | IEEE Xplore