Highly Efficient and GPU-Friendly Implementation of BFS on Single-node System | IEEE Conference Publication | IEEE Xplore