CUDA M3: Designing Efficient CUDA Managed Memory-Aware MPI by Exploiting GDR and IPC | IEEE Conference Publication | IEEE Xplore