Elsevier

Parallel Computing

Volume 24, Issue 14, December 1998, Pages 2053-2075
Parallel Computing

Performance evaluation of deterministic wormhole routing in k-ary n-cubes

https://doi.org/10.1016/S0167-8191(98)00094-5Get rights and content

Abstract

We present a new analytical approach for the performance evaluation of deterministic wormhole routing in k-ary n-cubes. Our methodology achieves closed formulas for average time values through the analysis of network flows. The comparison with simulation models demonstrates that our methodology gives accurate results for both low and high traffic conditions. Another important quality is the flexibility of our approach. We demonstrate that it can be used to model dimension-ordered-routing in several k-ary n-cubes such as hypercubes, 3D symmetric and asymmetric tori, architectures with uni- and bi-directional channels.

Introduction

Wormhole routing is an efficient switching technique which is adopted by a variety of parallel machines with distributed memory architectures, such as Intel Paragon, nCUBE-2, MIT J-machine, Cray T3D. Most multicomputers employ wormhole in combination with deterministic routing policies: the route between two nodes is determined in a static way and, in case of link conflict, the transmission waits until the link becomes free. Although the deterministic policies do not have the ability to respond to dynamic network conditions, their implementation is quite simple. Conversely, adaptive wormhole routing requires more sophisticated hardware and more complex algorithms for preventing deadlocks and livelocks. An interesting survey of wormhole routing is contained in [14]. Commercial multicomputers adopted deterministic routing in three types of k-ary n-cubes: uni-directional hypercube, uni-directional and bi-directional torus, bi-directional mesh [13]. In uni-directional systems, there is only one link between adjacent nodes, while in a bi-directional system, each connection between two nodes consists of one link for each direction. For bi-directional networks, the wrap-around links between the edge nodes are not essential, while in uni-directional systems without wrap-around, a communication may require k-1 hops even between adjacent nodes.

The goal of this paper is to propose an approach that achieves accurate closed formulas for the mean latency time of dimension-ordered wormhole routing in k-ary n-cubes. The error is below 10% in most instances, and much less for low and average traffic conditions. An important contribution of our results is that the same approach is immediately applicable to any k-ary n-cube with wrap-around connections for k < 5. For higher values of k, it is impossible to construct a deadlock-free minimal deterministic routing algorithm [5]. We present the methodology for hypercubes, an asymmetric (k0×k1×k2) and symmetric (k×k×k) torus with uni-directional and bi-directional links, respectively.

Performance analysis of wormhole routing in k-ary n-cubes topologies has been investigated by several authors. The majority of them adopted simulation models 2, 3, 4, 7, 8, 12, 15, 16, while few results were obtained analytically 6, 11, 1. Adve and Vernon [1] use approximate Mean Value Analysis to analyze the performance of wormhole routing in a k-ary n-cube, where the system is modeled as a closed queuing network. This choice makes their model realistic, although complex to the extent that it can be solved only through iterative algorithms. Dally [6] proposes a general model for k-ary n-cubes, that he adopts to demonstrate that low k-dimensional topologies provide best performance. The results of this paper are a very important contribution in this area, however the Dally model solves only symmetric topologies with uni-directional links. Kim and Das [11] specialize this model for the hypercube topology. They obtain accurate results, although limited to the hypercube that is a perfect symmetric and balanced topology. Our approach is more general than the models proposed in 6, 11, because it is applicable to several topologies such as hypercubes, symmetric and asymmetric tori with both uni-directional and bi-directional links. The contribution of our model is outlined in Table 1.

The paper is organized as follows. In Section 2, we describe the operational features of the interconnection network and motivate the assumptions that make the model analytically tractable. In Section 3, we present the approach for the message latency time evaluation focusing on a 3D asymmetric torus with uni-directional links. In 4 The model for torus with bi-directional links, 5 The model for hypercube, we demonstrate the flexibility of our approach by extending the analysis to a 3D symmetric torus with bi-directional links and a hypercube, respectively. In Section 6, we validate all models by comparing the analytical results against time values which are obtained from our simulations and other published results. In Section 7, we use our models to evaluate the performance of wormhole routing in several network dimensions and traffic loads. Section 8concludes the paper with some final remarks.

Section snippets

Deterministic wormhole routing

There are three main routing strategies that transmit data while building the path from the source to the destination node: store-and-forward, virtual-cut-through, and wormhole. The store-and-forward technique collects the entire message at each intermediate node of the path before requesting the next link. A first reduction of the transmission time was achieved by the virtual-cut-through proposed by Kermani and Kleinrock [10]. Unlike store-and-forward, this technique divides the message in

The model for torus with uni-directional links

In this section we consider XYZ dimension-ordered routing in a k0×k1×k2 torus with uni-directional links. The proposed methodology allows us to evaluate the average message transmission time through a closed formula. Moreover, we demonstrate the flexibility of the model by analyzing the routing in 3D symmetric torus with bi-directional links in Section 4, and hypercube in Section 5.

The model for torus with bi-directional links

The approach presented in Section 3is highly flexible. We demonstrate how it can be extended to other architectures by analyzing the performance of dimension-ordered routing in a 3D symmetric torus with bi-directional links. The same methodology can be easily applied even to other torus and hypercube topologies. Because of space limitations, we only point out the main differences between the model for bi-directional links and that for asymmetric torus with uni-directional channels. The basic

The model for hypercube

The model for deterministic wormhole routing in a hypercube is for certain aspects a simplification of the model for the 3D torus. The hypercube is, in fact, a perfect symmetric topology in any dimension, that is, for any n of the 2-ary n-cube. As routing policy, we consider the E-cube algorithm [17] which is the dimension-ordered policy applied to hypercube topologies. Since the hypercube has more links than a torus and the E-cube algorithm allows each message to use only a small subset of the

Validation of the models

In this section we validate the analytical models with uni-directional and bi-directional links, and use them to obtain some interesting performance parameters. We compare the results of the former model against the analytical and simulation results presented by Dally in [6].

Table 2 shows the average latency time as a function of the traffic 1/τ (expressed in bit/cycle) in a 16-ary 3-cube. Our analytical values are obtained by setting k0=k1=k2=16 and τE=τ/L in Eq. (15). These results are very

Performance results

We can use the analytical models to evaluate the performance of deterministic wormhole routing in various network topologies and scenarios. The values of this section are obtained by considering 1/τE as the message generation rate.

Fig. 6 shows the sensitivity of the latency time to the distribution of the links in each of the three dimensions of a torus with 103 nodes and uni-directional links. The symmetric topologies achieve best performance and this occurs even in the (not shown) case of

Conclusions

We propose a new approach for modeling deterministic (dimension-ordered) wormhole routing in symmetric and asymmetric k-ary n-cubes. The methodology is based on the backward flow analysis and can be considered a unifying approach to achieve closed formulas for the mean latency time in any k-ary n-cube with wrap-around connections and uni-directional or bi-directional links.

In this paper we apply this methodology to the hypercube and two topologies that were not yet solved through closed

References (18)

  • P. Kermani et al.

    Virtual cut-through: A new computer communication switching technique

    Computer Networks

    (1979)
  • V.S. Adve et al.

    Performance analysis of mesh interconnection networks with deterministic routing

    IEEE Trans. Parallel and Distributed Systems

    (1994)
  • R.V. Boppana, S. Chalasani, A comparison of adaptive wormhole routing algorithms, in: Proceedings of 20th Annual...
  • A.A. Chien et al.

    Planar-adaptive routing: low-cost adaptive networks for multiprocessors

    J. ACM

    (1995)
  • M. Colajanni, A. Dell'Arte, B. Ciciani, Communication switching techniques and link-conflict resolution strategies: A...
  • W.J. Dally et al.

    Deadlock-free message routing in multicomputer networks

    IEEE Trans. Comput.

    (1987)
  • W.J. Dally

    Performance analysis of k-ary n-cube interconnection networks

    IEEE Trans. Comput.

    (1990)
  • L. Gravano et al.

    Adaptive deadlock- and livelock-free routing with all minimal paths in torus networks

    IEEE Trans. Parallel and Distributed Systems

    (1994)
  • D.C. Grunwald, D.A. Reed, Networks for parallel processors: measurements and prognostications, in: Proceedings of 3rd...
There are more references available in the full text version of this article.

Cited by (0)

View full text