TTPM – An efficient deadlock-free algorithm for multicast communication in 2D torus networks

https://doi.org/10.1016/j.sysarc.2008.03.004Get rights and content

Abstract

A torus network has become increasingly important to multicomputer design because of its many features including scalability, low bandwidth and fixed degree of nodes. A multicast communication is a significant operation in multicomputer systems and can be used to support several other collective communication operations. This paper presents an efficient algorithm, TTPM, to find a deadlock-free multicast wormhole routing in two-dimensional torus parallel machines. The introduced algorithm is designed such that messages can be sent to any number of destinations within two start-up communication phases; hence the name Torus Two Phase Multicast (TTPM) algorithm. An efficient routing function is developed and used as a basis for the introduced algorithm. Also, TTPM allows some intermediate nodes that are not in the destination set to perform multicast functions. This feature allows flexibility in multicast path selection and therefore improves the performance. Performance results of a simulation study on torus networks are discussed to compare TTPM algorithm with a previous algorithm.

Introduction

In high performance multicomputers, processors communicate between each other by passing messages via the communication network. Torus networks are widely used in high performance parallel computing systems and have been implemented in many research and commercial multicomputer systems such as Cray T3D, the Torus Routing Chip [1], and AP1000 [2]. They over useful edge connectivety and they can be partitioned into meshes. The two dimension (2D) torus topology is important in multicomputer design because of its many properties including constant node degree, constant length channel wires, high channel bandwidth, and low contention latency. In torus network topologies, wraparound channels have been added to connect each edge node to the corresponding node on the opposite edge. Under random traffic, the symmetry of torus networks leads to a more balanced utilization of communication channels than in mesh networks. With wormhole switching techniques and lower dimensional networks, performance analysis of direct networks offers improved latency and throughput results for the same network bandwidth [3].

The performance of multicast communication is measured in terms of its latency in delivering a message to all destinations. In wormhole-routed networks, the communication latency consists of two parts, start-up latency and network latency. The start-up latency is the time required to start a message, which involves operation system overheads. The network latency is a combination of propagation delay, router delay, and contention delay. The contention latency is computed for all delays associated with contention for routing resources among the various worms in the network. In multicast communication where an operation might consist of several communication phases, the start-up latency has a significant impact on the performance [4]. Deadlock in the interconnection network occurs when a set of messages is blocked forever because each message in the set holds one or more resources needed by another message in this set [5]. Collective communication, which involves a group of intercommunicating nodes, is useful in several applications. Some parallel programming languages provide efficient support for these applications. The importance of collective communication is further demonstrated by their inclusion in the message passing interface (MPI) standards [6]. An important primitive among collective communication operations is multicast communication, which is defined as sending a single message from a source node to a set of destination nodes.

The multicast communication can be used as a basis for many collective operations, such as barrier synchronization and global reduction, as well as cache invalidation in shared-memory multiprocessor systems [7]. The design and performance of multicast communication depends on several characteristics of the network architecture, including the switching strategy and network topology. The four main switching techniques are circuit switching, store-and-forward, virtual-cut-through, and wormhole switching [8]. The popular switching technique is wormhole routing in which a message is decomposed into a number of packets, which are further subdivided into flits. Flits are pipelined among several nodes by way of routers at the nodes [9]. A flit is the smallest unit of information that a channel can accept or refuse. The flit length is often affected by the network size. A 256-node network requires 8 bits per flit [5], [8]. Routing algorithms have been developed for wormhole communications networks [8], [10].

The basic node architecture of a 2D torus is illustrated in Fig. 1. The router provides communication services to the host processor. The incoming/outgoing internal channels to/from the router are usually referred as injection/consumption channels. The number of injection/consumption channels implies the number of messages that can be sent/received concurrently by the processor. To reduce the latency, multicast communication needs to be supported by the hardware. Hardware multicasting requires some additional functionality implemented along with the abilities associated with unicast communication. The communication services as forward and absorb, etc., which are required in wormhole-routed networks for multicasting are defined in the next Section 3.2. The importance of hardware supported multicasting was reported by Ni [11]. Also, in order to reduce the complexity of the communication hardware, many wormhole-routed systems support a so-called one-port communications architecture, in which each node can access the network through a single communication channel. One-port architectures occur frequently in massively parallel computers. Most wormhole-routed, torus networks use one-port architectures [6], [12].

An n × m 2D torus, denoted by Tn×m, is a 2D torus network computer system, where n, m represent the numbers of columns, and rows respectively. For each node pi = (xi, yi)  Tn×m, 0  xi  n  1, 0  yi  m  1, and deg [pi] = 4 is the degree of node pi. The absorb and the forward capability introduces additional resource dependencies that can lead to deadlock situations. The upper bound of the number of consumption channels required to avoid such deadlocks is equal to nv where n is the network dimension ofTn×n and v is the number of virtual channels per direction [6].

In this paper, torus networks with bi-directional channels are used. The neighbor nodes in a bi-directional torus may be connected by either single, bi-directional physical channels, or by pairs of unidirectional physical channels in opposite directions. For simplicity, the torus networks will be drawn without channels.

This paper is organized as follows. The next section summarizes related research. The types of multicast phases and some properties of the introduced algorithm are discussed in Section 3. A deterministic TTPM algorithm based on the dimension order routings is reported in Sections 4 Torus Two Phase Multicast (TTPM) algorithm, 5 Performance evaluation. Finally, results and discussion are given in Section 6.

Section snippets

Related research

The algorithms for multicast communication can be categorized into three types, unicast-based, path-based, and tree-based [7], [13], [14]. Among the unicast-based multicast methods, separate addressing algorithm is the simplest one, in which the source node iteratively unicasts the message to each destination node one after another. Unicast-based multicasting can also be performed in a multi-phase communication structure, in which the destination nodes are organized in some sort of a binomial

The two phase multicast algorithm

In order to perform a multicast operation using multi-destination messages, one or more communication steps may be used. Methods that reach all destination nodes in one communication step are termed single-phase [10], while those that require more than one step are called multi-phase [18]. During the first phase of a multi-phase multicast, the source node sends a multi-destination message to a subset of the destination nodes. During subsequent phases, some (perhaps all) of the nodes that have

Torus Two Phase Multicast (TTPM) algorithm

In this section, the proposed multicast routing algorithm is introduced. It uses at most two start-up phases to communicate with any number of destinations.

Performance evaluation

The time needed to send a message M between two arbitrary nodes is given by: T (M = Ts + Tn + Tb, where Ts is the start-up time, Tn is the network time, and Tb is the blocking time [10]. As long as there is no congestion in the network, T (M) is well approximated by: T (M) = Ts + Tn [15]. The time Ts consists of two parts, the first one at the source node in the beginning of the first phase, denoted by TS1. After the first phase is completed, some of the intermediate nodes along the main path need a

Results and discussions

To compare the performance of the two algorithms, TDP and TTPM, it is assumed that the network latency time between any two nodes is 30 ns, which is directly mapped to one time unit. For TTPM algorithm, TS1 is set to 1 μs (33 time units), and TS2 is set to 240 ns (8 time units) because TTPM algorithm takes more time in the first start-up time. For TDP algorithm, there are two start-up times nearly equal, so anyone is set to 450 ns (15 time units). Up to 100 random 2D torus networks that contain two

Conclusions

In this paper, a deadlock-free two phase multicast wormhole routing in 2D torus parallel systems, TTPM, was presented. TTPM algorithm is a path-based technique, which uses only the vertical boundary links of the torus network. It requires at most two communication start-up steps to multicast to any member of destinations. During the first step, the message is sent to a node using a main path such that the nodes covered during the first step can send the message to the remaining destinations in

Mohamed. G. Darwish is the Professor of Information Technology at the Faculty of Computers and Information, Cairo University, and Dean, Faculty of Computer Science and Information Technology – Ahram Canadian University Egypt. He received B.Sc. 1971 and M.Sc. 1974 degrees from Cairo University in Computers and Communications Engineering and Ph.D. degree from University of Toulouse France 1978 in Systems Engineering. His main interests are Computer Networks; Parallel and Distributed Systems;

References (19)

There are more references available in the full text version of this article.

Cited by (6)

  • A tree-based algorithm for multicasting in 2D torus networks

    2015, Egyptian Informatics Journal
    Citation Excerpt :

    Finally, conclusion is given in Section 5. Various multicast routing algorithms have been proposed for torus network topologies [2,6,8,11,16–23]. Some multicast routing algorithms for 2D torus networks [18–23] are described in this section.

  • Two Modified Multicast Algorithms for Two Dimensional Mesh and Torus Networks

    2016, Proceedings - International Symposium on Parallel Architectures, Algorithms and Programming, PAAP
  • 3-additive approximation algorithm for multicast time in 2D torus networks

    2016, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
  • Ready groups: A path-based multicast algorithm for 2D torus networks

    2010, INFOS2010 - 2010 7th International Conference on Informatics and Systems
  • An effective approach for multicast on multi-core architecture

    2009, International Conference on Scalable Computing and Communications - The 8th International Conference on Embedded Computing, ScalCom-EmbeddedCom 2009

Mohamed. G. Darwish is the Professor of Information Technology at the Faculty of Computers and Information, Cairo University, and Dean, Faculty of Computer Science and Information Technology – Ahram Canadian University Egypt. He received B.Sc. 1971 and M.Sc. 1974 degrees from Cairo University in Computers and Communications Engineering and Ph.D. degree from University of Toulouse France 1978 in Systems Engineering. His main interests are Computer Networks; Parallel and Distributed Systems; Software Process Improvement; Informatics Strategic Planning. Dr. Darwish is recipient of National Prize for outstanding research in Computer and Systems Engineering; Holding the National Medal of Science and Arts, first class; Listed in the National Encyclopedia for Distinguished Egyptian Personalities “who’s who in Egypt” and in the Arab International Encyclopedia, “who’s who in Arab and Islamic Countries”.

Ahmed A.A. Radwan is Professor of Computer Science, Head of Computer Science Department, Faculty of Science, Minia University, Egypt, and the Head of Quality Assurance and Accreditation Project unit – Faculty of Computer & Information Science-Minia University He received his B.Sc. 1975 and M.Sc. 1981 Degrees from Assiut University, Faculty of science, Egypt in mathematics and Ph.D. degree from Liverpool University, England in Computer Science. His main areas of interests are parallel processing, multicast communication, Algorithm design and analysis, Optimal Routing Algorithms for Computer Networks, Parallel Systems, Graph Drawing and Mobile Networks.

Mohamed A. Abd El-Baky received his B.Sc. Degree in Computational Sciences from Faculty of Science, Cairo University, Egypt in 1991. He received his M.Sc. Degree in Computer Science from Cairo University, in 1996 and Ph.D. degree in Computer Science from Cairo University, in 2000. From 2000, he is an assistant professor in Mathematics and Computer Science Division, at Cairo University. His research interests include parallel processing, message-passing systems, and multicast communication.

Kadry Hamed received his B.Sc. Degree in Mathematics from Faculty of Science, South Valley University, Egypt in 1998. He received his M.Sc. Degree in Computer Science from Fayoum University, Egypt in 2007. His research interests include parallel and distributed systems, multicast communication, and high performance networks.

View full text