Understanding and Reducing Weight-Load Overhead of Systolic Deep Learning Accelerators | IEEE Conference Publication | IEEE Xplore