A Design of 16TOPS Efficient GEMM Module in Deep Learning Accelerator | IEEE Conference Publication | IEEE Xplore