Modeling Parallel Bandwidth: Local versus Global Restrictions

Adler, M.; Gibbons, P. B.; Matias, Y.; Ramachandran, V.

doi:10.1007/PL00008269

Modeling Parallel Bandwidth: Local versus Global Restrictions

Published: July 1999

Volume 24, pages 381–404, (1999)
Cite this article

Algorithmica Aims and scope Submit manuscript

M. Adler¹,
P. B. Gibbons²,
Y. Matias³ &
…
V. Ramachandran⁵

68 Accesses
3 Citations
Explore all metrics

Abstract.

Recently there has been an increasing interest in models of parallel computation that account for the bandwidth limitations in communication networks. Some models (e.g., bsp, logp, and qsm) account for bandwidth limitations using a per-processor parameter g > 1 , such that each processor can send/receive at most h messages in g . . . h time. Other models (e.g., pram(m )) account for bandwidth limitations as an aggregate parameter m < p , such that the p processors can send at most m messages in total at each step.

This paper provides the first detailed study of the algorithmic implications of modeling parallel bandwidth as a per-processor (local) limitation versus an aggregate (global) limitation. We consider a number of basic problems such as broadcasting, parity, summation, and sorting, and give several new upper and lower time bounds that demonstrate the advantage of globally limited models over locally limited models given the same aggregate bandwidth (i.e., p . . . 1/g = m ). In general, globally limited models have a possible advantage whenever there is an imbalance in the number of messages sent/received by the processors. To exploit this advantage, the processors must schedule the sending of messages so as to respect the aggregate bandwidth limit. We present a new parallel scheduling algorithm for globally limited models that enable an unknown, arbitrarily unbalanced set of messages to be sent through the limited bandwidth within a (1 + ε) factor of the optimal off-line schedule with high probability, even if the penalty for overloading the network is an exponential function of the overload. We also present a near-optimal algorithm for the case where long messages must be sent as flits in consecutive time steps, as well as for the case where new messages to be sent arrive dynamically over an infinite time line. These results consider both message passing (distributed memory) and shared memory scenarios, and improve upon the best results for the locally limited model by a factor of Θ(g) . Finally, we present results quantifying the power of concurrent reads in a globally limited bandwidth setting, including showing an Ω(p lg m/m lg p) time separation between the exclusive-read and the concurrent-read pram(m ) models, which, when m << p , greatly improves upon the \(2^{\Omega(\sqrt{\lg p})}\) separation known previously.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Author information

Authors and Affiliations

Department of Computer Science, University of Toronto, 10 King's College Road, Toronto, Ontario, Canada M5S 3G4. micah@cs.toronto.edu., , , , , , CA
M. Adler
Bell Laboratories (Lucent Technologies), 600 Mountain Avenue, Murray Hill, NJ 07974, USA. gibbons@research.bell-labs.com., , , , , , US
P. B. Gibbons
Department of Computer Science, Tel Aviv University, Tel Aviv, 69978, Israel, matias@math.tau.ac.il, , , , , , IL
Y. Matias
Department of Computer Sciences, University of Texas at Austin, Austin, TX 78712, USA. vlr@cs.utexas.edu, , , , , , US
V. Ramachandran

Authors

M. Adler
View author publications
You can also search for this author in PubMed Google Scholar
P. B. Gibbons
View author publications
You can also search for this author in PubMed Google Scholar
Y. Matias
View author publications
You can also search for this author in PubMed Google Scholar
V. Ramachandran
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Received June 1, 1997; revised March 10,1998.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Adler, M., Gibbons, P., Matias, Y. et al. Modeling Parallel Bandwidth: Local versus Global Restrictions . Algorithmica 24, 381–404 (1999). https://doi.org/10.1007/PL00008269

Download citation

Issue Date: July 1999
DOI: https://doi.org/10.1007/PL00008269

Key words. Limited bandwidth, Parallel computation, Modeling.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Modeling Parallel Bandwidth: Local versus Global Restrictions

Abstract.

Access this article

Similar content being viewed by others

Scalability in Parallel Processing

A Unified Framework for Designing EPTAS’s for Load Balancing on Parallel Machines

Modeling Contention and Mapping Effects in Multi-core Clusters

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Navigation

Modeling Parallel Bandwidth: Local versus Global Restrictions

Abstract.

Access this article

Similar content being viewed by others

Scalability in Parallel Processing

A Unified Framework for Designing EPTAS’s for Load Balancing on Parallel Machines

Modeling Contention and Mapping Effects in Multi-core Clusters

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation