Hardware-Based and Hybrid L1 Data Cache Bypassing to Improve GPU Performance | IEEE Conference Publication | IEEE Xplore