LU Factorization of Small Matrices: Accelerating Batched DGETRF on the GPU | IEEE Conference Publication | IEEE Xplore