Fast Blockwise Matrix-Matrix Multiplication Using AVX and Prefetching on Shared Memory | IEEE Conference Publication | IEEE Xplore