Designing efficient small message transfer mechanism for inter-node MPI communication on InfiniBand GPU clusters | IEEE Conference Publication | IEEE Xplore