Multi-modal Voice Activity Detection by Embedding Image Features into Speech Signal | IEEE Conference Publication | IEEE Xplore