An efficient implementation for broadcasting data in parallel applications over Ethernet clusters | IEEE Conference Publication | IEEE Xplore