Locality-aware Thread Block Design in Single and Multi-GPU Graph Processing | IEEE Conference Publication | IEEE Xplore