Efficient Performance Estimation and Work-Group Size Pruning for OpenCL Kernels on GPUs | IEEE Journals & Magazine | IEEE Xplore