Align or attend? Toward More Efficient and Accurate Spoken Word Discovery Using Speech-to-Image Retrieval | IEEE Conference Publication | IEEE Xplore