Multi-modal Representation Learning for Short Video Understanding and Recommendation | IEEE Conference Publication | IEEE Xplore