Understanding and Optimizing INT4 Convolution for Accelerated DNN Inference on Tensor Cores | IEEE Conference Publication | IEEE Xplore