Performance analysis of distributed memory computers with parallel node architecture

doi:10.1016/0164-1212(94)00069-Y

Journal of Systems and Software

Volume 29, Issue 2, May 1995, Pages 107-120

https://doi.org/10.1016/0164-1212(94)00069-Y Get rights and content

Abstract

In a distributed memory computer (DMC), parallelism at node level can be achieved by use of pipelined arithmetic units or communication processors that allow overlap between processing and communication activities. We derive a performance model for distributed memory computers whose nodes are vector-processing elements (VPEs), i.e., nodes with a parallel internal architecture. The model is based on the one introduced by Hockney for SIMD computers and shared-memory MIMD architectures. The approach has been extended to distributed-memory architectures, including VPE networks, to achieve a powerful characterization of these systems in terms of a few performance parameters. The discussion points out how vector capabilities can be effectively exploited in DMCs and identifies the parameters of the concurrent system (hardware and software) that most significantly affect the overall performance. Finally, the generality of the model is discussed with respect to different kinds of VPE architectures proposed in the literature or available on the market.

References (9)

G. Iannello et al.
Communication Workload Analysis in Symmetric Concurrent Systems
J. Parallel Dist. Comp.
(1994)
S.J. Bradshaw
TTM100: An i860 Processing Unit for Multi-Processing Systems
Transtech Tech. Note
(1990)
W.J. Daily
The Message-Driven Processor: A Multicomputer Processing Node with Efficient Mechanisms
IEEE Micro
(1992)
G.C. Fox

There are more references available in the full text version of this article.

Cited by (0)

View full text

Performance analysis of distributed memory computers with parallel node architecture

Abstract

J. Parallel Dist. Comp.

TTM100: An i860 Processing Unit for Multi-Processing Systems

Transtech Tech. Note

The Message-Driven Processor: A Multicomputer Processing Node with Efficient Mechanisms

IEEE Micro