A Highly Efficient SGEMM Implementation using DMA on the Intel/Movidius Myriad-2 | IEEE Conference Publication | IEEE Xplore