Performance Optimization of Allreduce Operation for Multi-GPU Systems | IEEE Conference Publication | IEEE Xplore