UniFormerV2: Unlocking the Potential of Image ViTs for Video Understanding | IEEE Conference Publication | IEEE Xplore