Hierarchically characterizing CUDA program behavior | IEEE Conference Publication | IEEE Xplore