BlockMaestro: Enabling Programmer-Transparent Task-based Execution in GPU Systems | IEEE Conference Publication | IEEE Xplore