Efficient Fork-Join on GPUs Through Warp Specialization | IEEE Conference Publication | IEEE Xplore