On “Exploiting” Node-Heterogeneous Clusters Optimally

Adler, Micah; Gong, Ying; Rosenberg, Arnold L.

doi:10.1007/s00224-007-9001-1

On “Exploiting” Node-Heterogeneous Clusters Optimally

Published: 04 July 2007

Volume 42, pages 465–487, (2008)
Cite this article

Theory of Computing Systems Aims and scope Submit manuscript

Micah Adler¹,
Ying Gong¹ &
Arnold L. Rosenberg¹

40 Accesses
Explore all metrics

Abstract

It is proved that “FIFO” worksharing protocols provide asymptotically optimal solutions to two problems related to sharing large collections of independent tasks in a heterogeneous network of workstations (HNOW) $\mathcal{N}$ . In the $\mathsf{HNOW-Exploitation Problem}$ , one seeks to accomplish as much work as possible on $\mathcal{N}$ during a prespecified fixed period of L time units. In the $\mathsf{HNOW-Rental Problem}$ , one seeks to complete W units of work by “renting” $\mathcal{N}$ for as short a time as necessary. The worksharing protocols we study are crafted within an architectural model that characterizes $\mathcal {N}$ via parameters that measure $\mathcal{N}$ ’s workstations’ computational and communicational powers. All valid protocols are self-scheduling, in the sense that they determine completely both an amount of work to allocate to each of $\mathcal{N}$ ’s workstations and a schedule for all related interworkstation communications. The schedules provide either a value for W given L, or a value for L given W, hence solve both of the motivating problems. A protocol observes a FIFO regimen if it has $\mathcal{N}$ ’s workstations finish their assigned work, and return their results, in the same order in which they are supplied with their workloads. The proven optimality of FIFO protocols resides in the fact that they accomplish at least as much work as any other protocol during all sufficiently long worksharing episodes, and that they complete sufficiently large given collections of tasks at least as fast as any other protocol. Simulation experiments illustrate that the superiority of FIFO protocols is often observed during worksharing episodes of only a few minutes’ duration.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Energy-efficient scheduling and routing via randomized rounding

Article 28 October 2016

Optimizing Egalitarian Performance in the Side-Effects Model of Colocation for Data Center Resource Management

Resource Allocation for Multi-tasking Optimization: Explanation of an Empirical Formula

References

Adler, M., Gong, Y., Rosenberg, A.L.: Asymptotically optimal worksharing in HNOWs: how long is “sufficiently long?” In: 36th Ann. Simulation Symp., pp. 39–46 (2003)
Alexandrov, A., Ionescu, M.I., Schauser, K.E., Scheiman, C.: LogGP: incorporating long messages into the LogP model for parallel computation. J. Parallel Distrib. Comput. 44, 71–79 (1997)
Article Google Scholar
Anderson, T.E., Culler, D.E., Patterson, D.A., the HNOW Team: A case for NOW (networks of workstations). IEEE Micro 15, 54–64 (1995)
Article Google Scholar
Banikazemi, M., Moorthy, V., Panda, D.K.: Efficient collective communication on heterogeneous networks of workstations. In: Intl. Conf. on Parallel Processing, pp. 460–467 (1998)
Banino, C., Beaumont, O., Carter, L., Ferrante, J., Legrand, A., Robert, Y.: Scheduling strategies for master-slave tasking on heterogeneous processor grids. IEEE Trans. Parallel Distrib. Syst. 15, 319–330 (2004)
Article Google Scholar
Barlas, G.D.: Collection-aware optimum sequencing of operations and closed-form solutions for the distribution of a divisible load on arbitrary processor trees. IEEE Trans. Parallel Distrib. Syst. 9, 429–441 (1998)
Article Google Scholar
Beaumont, O., Carter, L., Ferrante, J., Legrand, A., Robert, Y.: Bandwidth-centric allocation of independent tasks on heterogeneous platforms. In: Int. Parallel and Distrib. Process. Symp. (2002)
Beaumont, O., Legrand, A., Robert, Y.: The master-slave paradigm with heterogeneous processors. IEEE Trans. Parallel Distrib. Syst. 14, 897–908 (2003)
Article Google Scholar
Beaumont, O., Marchal, L., Robert, Y.: Scheduling divisible loads with return messages on heterogeneous master-worker platforms. In: High-Performance Computing: The 12th Int. Conf. Lecture Notes in Computer Science, vol. 3769, pp. 498–507. Springer, Berlin (2005)
Google Scholar
Bharadwaj, V., Ghose, D., Mani, V.: Optimal sequencing and arrangement in distributed single-level tree networks. IEEE Trans. Parallel Distrib. Syst. 5, 968–976 (1994)
Article Google Scholar
Bharadwaj, V., Ghose, D., Mani, V.: Multi-installment load distribution in tree networks with delays. IEEE Trans. Aerosp. Electron. Syst. 31, 555–567 (1995)
Article Google Scholar
Bharadwaj, V., Ghose, D., Mani, V., Robertazzi, T.G.: Scheduling Divisible Loads in Parallel and Distributed Systems. Wiley, New York (1996)
Google Scholar
Bhat, P.B., Prasanna, V.K., Raghavendra, C.S.: Adaptive communication algorithms for distributed heterogeneous systems. In: 7th IEEE Int. Symp. on High Performance Distributed Computing (1998)
Bhat, P.B., Raghavendra, C.S., Prasanna, V.K.: Efficient collective communication in distributed heterogeneous systems. In: 19th IEEE Int. Conf. on Distributed Computing Systems (1999)
Cappello, F., Fraigniaud, P., Mans, B., Rosenberg, A.L.: An algorithmic model for heterogeneous clusters: rationale and experience. Int. J. Found. Comput. Sci. 16, 195–216 (2005)
Article MATH Google Scholar
Cheng, Y.C., Robertazzi, T.G.: Distributed computation for tree networks with communication delays. IEEE Trans. Aerosp. Electron. Syst. 26, 511–516 (1990)
Article Google Scholar
Culler, D.E., Karp, R.M., Patterson, D., Sahay, A., Schauser, K.E., Santos, E., Subramonian, R., von Eicken, T.: LogP: towards a realistic model of parallel computation. Commun. ACM 39, 78–85 (1996)
Article Google Scholar
Dutot, P.-F.: Master-slave tasking on heterogeneous processors. In: 17th Int. Parallel and Distributed Processing Symp. (2003)
Fraigniaud, P., Mans, B., Rosenberg, A.L.: Efficient trigger-broadcasting in heterogeneous clusters. J. Parallel Distrib. Comput. 65, 628–642 (2005)
Article MATH Google Scholar
Hsu, T.-S., Lee, J.C., Lopez, D.R., Royce, W.A.: Task allocation on a network of processors. IEEE Trans. Comput. 49, 1339–1353 (2000)
Article MathSciNet Google Scholar
Karp, R.M., Sahay, A., Santos, E., Schauser, K.E.: Optimal broadcast and summation in the logP model. In: 5th ACM Symp. on Parallel Algorithms and Architectures, pp. 142–153 (1993)
Kesavan, R., Bondalapati, K., Panda, D.K.: Multicast on irregular switch-based networks with wormhole routing. In: 3rd Int. Symp. on High-Performance Computer Architecture (1996)
Pfister, G.F.: In Search of Clusters. Prentice-Hall, Englewood Cliffs (1995)
MATH Google Scholar
Reimann, D.A., Chaudhary, V., Sethi, I.K.: Modeling cone-beam tomographic reconstruction using LogSMP: an extended LogP model for clusters of SMPs. In: 6th Int. Conf. on High-Performance Computing. Lecture Notes in Computer Science, vol. 1745, pp. 77–83. Springer, Berlin (1999)
Google Scholar
Rosenberg, A.L.: On sharing bag-of-tasks workloads in heterogeneous networks of workstations: greedier is not better. In: 3rd IEEE Int. Conf. on Cluster Computing, pp. 124–131 (2001)
Tosun, A.S., Agarwal, A.: Efficient broadcast algorithms for heterogeneous networks of workstations. In: 13th Int. Conf. on Parallel and Distributed Computing Systs. (2000)
White, S.W., Torney, D.C.: Use of a workstation cluster for the physical mapping of chromosomes. SIAM News, 14–17 (March 1993)
Yang, Y., Casanova, H.: UMR: A multi-round algorithm for scheduling divisible workloads. In: 17th Int. Parallel and Distributed Processing Symp. (2003)

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Massachusetts Amherst, Amherst, MA, 01003, USA
Micah Adler, Ying Gong & Arnold L. Rosenberg

Authors

Micah Adler
View author publications
You can also search for this author inPubMed Google Scholar
Ying Gong
View author publications
You can also search for this author inPubMed Google Scholar
Arnold L. Rosenberg
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Micah Adler.

Additional information

A portion of this research was presented at the 15th ACM Symp. on Parallelism in Algorithms and Architectures (2003).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Adler, M., Gong, Y. & Rosenberg, A.L. On “Exploiting” Node-Heterogeneous Clusters Optimally. Theory Comput Syst 42, 465–487 (2008). https://doi.org/10.1007/s00224-007-9001-1

Download citation

Published: 04 July 2007
Issue Date: May 2008
DOI: https://doi.org/10.1007/s00224-007-9001-1

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On “Exploiting” Node-Heterogeneous Clusters Optimally

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Energy-efficient scheduling and routing via randomized rounding

Optimizing Egalitarian Performance in the Side-Effects Model of Colocation for Data Center Resource Management

Resource Allocation for Multi-tasking Optimization: Explanation of an Empirical Formula

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now