A Practical Performance Model for Compute and Memory Bound GPU Kernels | IEEE Conference Publication | IEEE Xplore