Abstract
The multicast operation is a useful operation in parallel applications. It is therefore important to ensure that for a given architecture, the parallel application runs efficiently. With the hardware-supported multicast of the InfiniBand Architecture (IBA), we propose a multicast scheme for m×n mesh InfiniBand networks based on XY routing. The basic concept of the proposed multicast scheme is to find the union sets of the output ports of switches, which are in the paths between the source node and each destination node in a multicast group. Furthermore, in the proposed scheme, we consider the usage of virtual lanes and evaluate their performance. We implement the proposed multicast scheme on a 2-D mesh InfiniBand network simulator. Several multicast configurations consisting of different message size, different traffic workload and different number of virtual lanes are simulated. The simulation results show that the proposed multicast scheme outperforms its corresponding unicast scheme for all simulation cases. The larger the message size, the larger the number of multicast source nodes, and the larger the size of the multicast group, the better the speedup that can be expected from the proposed multicast scheme. The usage of virtual lanes is also shown to improve the speed of the multicast operations.
Similar content being viewed by others
References
Alfaro FJ, Sanchez JL, Duato J (2004) QoS in InfiniBand subnetworks. IEEE Trans Parallel Distrib Syst 15(9):810–823
Chiang C-M, Ni LM (1995) Deadlock-free multi-head wormhole routing. In: Proceedings of the first high performance computing—Asia
Chien A, Kim JH (1992) Planar-adaptive routing: low-cost adaptive networks for multiprocessors. In: Proceedings of the 19th international symposium on computer architecture, May, pp 268–277
Duan X, Zhang D, Sun X (2009) Fault-tolerant routing schemes for wormhole mesh. In: Proceedings of international symposium on parallel and distributed processing with applications, August, pp 298–301
Duato J (1993) A new theory of deadlock-free adaptive multicast routing in wormhole networks. In: Proceedings of the 5th IEEE symposium on parallel and distributed processing, December, pp 64–71
Duato J (1994) A theory of fault-tolerant routing in wormhole networks. In: Proceedings of the international conference on parallel and distributed systems, December, pp 600–607
Duato J, Yalamanchili S, Ni L (1997) Interconnection networks—an engineering approach. IEEE CS Press, Los Alamitos
Glass J, Ni LM (1992) The turn model for adaptive routing. In: Proceedings of the 19th international symposium on computer architecture, May, pp 278–287
InfiniBand™ (2008) Trade Association, InfiniBand™ architecture specification vol 1, Release 1.2.1, January
Jesshope R, Miller PR, Yantchev JT (1989) High performance communications in processor networks. In: Proceedings of the 16th international symposium on computer architecture, May–June, pp 150–157
Jiang Z, Wu J, Wang D (2005) A new fault information model for fault-tolerant adaptive and minimal routing in 3-D meshes. In: Proceedings of international conference on parallel processing, June, pp 500–507
Kessler RE, Schwarzmeire JL (1993) CRAY T3D: a new dimension for Cray research. In: Proceedings of compcon, pp 176–182
Lenoski D et al (1992) The Stanford DASH multiprocessor. IEEE Comput 25(3):63–79
Lin X, Ni LM (1991) Deadlock-free multicast wormhole routing in multicomputer networks. In: Proceedings of the 18th international symposium on computer architecture, May, pp 116–125
Lin X, McKinley PK, Ni LM (1991) Performance evaluation of multicast wormhole routing in 2-D-mesh multicomputers. In: Proceedings of the international conference on parallel processing, August, vol I, pp 435–442
Lin X, Mckinley PK, Esfahanian AH (1993) Adaptive multicast wormhole routing in 2-D mesh multicomputers. In: Proceedings of parallel architectures and languages Europe 93, June, pp 228–241
Linder H, Harden JC (1991) An adaptive and fault-tolerant wormhole routing strategy for k-ary n-cubes. IEEE Trans Comput C-40(1):2–22
Littlefield RJ (1992) Characterizing and tuning communications performance for real applications. In: Proceedings of the first international DELTA applications workshop, February
Mejia A, Flich J, Duato J, Reinemo S-A, Skeie T (2006) Segment-based routing: an efficient fault-tolerant routing algorithm for meshes and tori. In: Proceedings of international symposium on parallel and distributed processing, April
Mohapatra P, Varavithya V (1996) A hardware multicast routing algorithm for two-dimensional meshes. In: IEEE symposium on parallel and distributed processing, October, pp 198–205
Nickmanesh S, Movaghar A, Rookhosh F (2008) Performance modeling of fault-tolerant fully adaptive wormhole switching 2-D meshes in presence of virtual channels. In: Proceedings of international conference on systems and networks communications, October, pp 109–114
Nienaber W, Yuan X, Duan Z (2009) LID assignment in InfiniBand networks. IEEE Trans Parallel Distrib Syst 20(4):484–497
Sullivan H, Bashhow TR (1977) A large scale, homogeneous, fully, distributed parallel machine. In: Proceedings of the 4th international symposium on computer architecture, March
Theiss I, Lysne O (2006) FRoots: a fault-tolerant and topology-flexible routing technique. IEEE Trans Parallel Distrib Syst 17(10):1136–1150
Vishnu A, Krishnan M, Panda DK (2009) An efficient hardware-software approach to network fault-tolerance with InfiniBand. In: Proceedings of international conference on cluster computing and workshops, August, pp 1–9
Zhou J, Lin X-Y, Chung Y-C (2007) Hardware supported multicast in fat-tree-based InfiniBand networks. J Supercomput 40(3):333–352
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhou, J., Liu, SE. & Chung, YC. A hardware supported multicast scheme based on XY routing for 2-D mesh InfiniBand networks. J Supercomput 59, 901–919 (2012). https://doi.org/10.1007/s11227-010-0478-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-010-0478-z