Dynamic Kernel Fusion for Bulk Non-contiguous Data Transfer on GPU Clusters | IEEE Conference Publication | IEEE Xplore