Skip to main content
Log in

On “Exploiting” Node-Heterogeneous Clusters Optimally

  • Published:
Theory of Computing Systems Aims and scope Submit manuscript

Abstract

It is proved that “FIFO” worksharing protocols provide asymptotically optimal solutions to two problems related to sharing large collections of independent tasks in a heterogeneous network of workstations (HNOW) \(\mathcal{N}\) . In the \(\mathsf{HNOW-Exploitation Problem}\) , one seeks to accomplish as much work as possible on \(\mathcal{N}\) during a prespecified fixed period of L time units. In the \(\mathsf{HNOW-Rental Problem}\) , one seeks to complete W units of work by “renting” \(\mathcal{N}\) for as short a time as necessary. The worksharing protocols we study are crafted within an architectural model that characterizes \(\mathcal {N}\) via parameters that measure \(\mathcal{N}\) ’s workstations’ computational and communicational powers. All valid protocols are self-scheduling, in the sense that they determine completely both an amount of work to allocate to each of \(\mathcal{N}\) ’s workstations and a schedule for all related interworkstation communications. The schedules provide either a value for W given L, or a value for L given W, hence solve both of the motivating problems. A protocol observes a FIFO regimen if it has \(\mathcal{N}\) ’s workstations finish their assigned work, and return their results, in the same order in which they are supplied with their workloads. The proven optimality of FIFO protocols resides in the fact that they accomplish at least as much work as any other protocol during all sufficiently long worksharing episodes, and that they complete sufficiently large given collections of tasks at least as fast as any other protocol. Simulation experiments illustrate that the superiority of FIFO protocols is often observed during worksharing episodes of only a few minutes’ duration.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Adler, M., Gong, Y., Rosenberg, A.L.: Asymptotically optimal worksharing in HNOWs: how long is “sufficiently long?” In: 36th Ann. Simulation Symp., pp. 39–46 (2003)

  2. Alexandrov, A., Ionescu, M.I., Schauser, K.E., Scheiman, C.: LogGP: incorporating long messages into the LogP model for parallel computation. J. Parallel Distrib. Comput. 44, 71–79 (1997)

    Article  Google Scholar 

  3. Anderson, T.E., Culler, D.E., Patterson, D.A., the HNOW Team: A case for NOW (networks of workstations). IEEE Micro 15, 54–64 (1995)

    Article  Google Scholar 

  4. Banikazemi, M., Moorthy, V., Panda, D.K.: Efficient collective communication on heterogeneous networks of workstations. In: Intl. Conf. on Parallel Processing, pp. 460–467 (1998)

  5. Banino, C., Beaumont, O., Carter, L., Ferrante, J., Legrand, A., Robert, Y.: Scheduling strategies for master-slave tasking on heterogeneous processor grids. IEEE Trans. Parallel Distrib. Syst. 15, 319–330 (2004)

    Article  Google Scholar 

  6. Barlas, G.D.: Collection-aware optimum sequencing of operations and closed-form solutions for the distribution of a divisible load on arbitrary processor trees. IEEE Trans. Parallel Distrib. Syst. 9, 429–441 (1998)

    Article  Google Scholar 

  7. Beaumont, O., Carter, L., Ferrante, J., Legrand, A., Robert, Y.: Bandwidth-centric allocation of independent tasks on heterogeneous platforms. In: Int. Parallel and Distrib. Process. Symp. (2002)

  8. Beaumont, O., Legrand, A., Robert, Y.: The master-slave paradigm with heterogeneous processors. IEEE Trans. Parallel Distrib. Syst. 14, 897–908 (2003)

    Article  Google Scholar 

  9. Beaumont, O., Marchal, L., Robert, Y.: Scheduling divisible loads with return messages on heterogeneous master-worker platforms. In: High-Performance Computing: The 12th Int. Conf. Lecture Notes in Computer Science, vol. 3769, pp. 498–507. Springer, Berlin (2005)

    Google Scholar 

  10. Bharadwaj, V., Ghose, D., Mani, V.: Optimal sequencing and arrangement in distributed single-level tree networks. IEEE Trans. Parallel Distrib. Syst. 5, 968–976 (1994)

    Article  Google Scholar 

  11. Bharadwaj, V., Ghose, D., Mani, V.: Multi-installment load distribution in tree networks with delays. IEEE Trans. Aerosp. Electron. Syst. 31, 555–567 (1995)

    Article  Google Scholar 

  12. Bharadwaj, V., Ghose, D., Mani, V., Robertazzi, T.G.: Scheduling Divisible Loads in Parallel and Distributed Systems. Wiley, New York (1996)

    Google Scholar 

  13. Bhat, P.B., Prasanna, V.K., Raghavendra, C.S.: Adaptive communication algorithms for distributed heterogeneous systems. In: 7th IEEE Int. Symp. on High Performance Distributed Computing (1998)

  14. Bhat, P.B., Raghavendra, C.S., Prasanna, V.K.: Efficient collective communication in distributed heterogeneous systems. In: 19th IEEE Int. Conf. on Distributed Computing Systems (1999)

  15. Cappello, F., Fraigniaud, P., Mans, B., Rosenberg, A.L.: An algorithmic model for heterogeneous clusters: rationale and experience. Int. J. Found. Comput. Sci. 16, 195–216 (2005)

    Article  MATH  Google Scholar 

  16. Cheng, Y.C., Robertazzi, T.G.: Distributed computation for tree networks with communication delays. IEEE Trans. Aerosp. Electron. Syst. 26, 511–516 (1990)

    Article  Google Scholar 

  17. Culler, D.E., Karp, R.M., Patterson, D., Sahay, A., Schauser, K.E., Santos, E., Subramonian, R., von Eicken, T.: LogP: towards a realistic model of parallel computation. Commun. ACM 39, 78–85 (1996)

    Article  Google Scholar 

  18. Dutot, P.-F.: Master-slave tasking on heterogeneous processors. In: 17th Int. Parallel and Distributed Processing Symp. (2003)

  19. Fraigniaud, P., Mans, B., Rosenberg, A.L.: Efficient trigger-broadcasting in heterogeneous clusters. J. Parallel Distrib. Comput. 65, 628–642 (2005)

    Article  MATH  Google Scholar 

  20. Hsu, T.-S., Lee, J.C., Lopez, D.R., Royce, W.A.: Task allocation on a network of processors. IEEE Trans. Comput. 49, 1339–1353 (2000)

    Article  MathSciNet  Google Scholar 

  21. Karp, R.M., Sahay, A., Santos, E., Schauser, K.E.: Optimal broadcast and summation in the logP model. In: 5th ACM Symp. on Parallel Algorithms and Architectures, pp. 142–153 (1993)

  22. Kesavan, R., Bondalapati, K., Panda, D.K.: Multicast on irregular switch-based networks with wormhole routing. In: 3rd Int. Symp. on High-Performance Computer Architecture (1996)

  23. Pfister, G.F.: In Search of Clusters. Prentice-Hall, Englewood Cliffs (1995)

    MATH  Google Scholar 

  24. Reimann, D.A., Chaudhary, V., Sethi, I.K.: Modeling cone-beam tomographic reconstruction using LogSMP: an extended LogP model for clusters of SMPs. In: 6th Int. Conf. on High-Performance Computing. Lecture Notes in Computer Science, vol. 1745, pp. 77–83. Springer, Berlin (1999)

    Google Scholar 

  25. Rosenberg, A.L.: On sharing bag-of-tasks workloads in heterogeneous networks of workstations: greedier is not better. In: 3rd IEEE Int. Conf. on Cluster Computing, pp. 124–131 (2001)

  26. Tosun, A.S., Agarwal, A.: Efficient broadcast algorithms for heterogeneous networks of workstations. In: 13th Int. Conf. on Parallel and Distributed Computing Systs. (2000)

  27. White, S.W., Torney, D.C.: Use of a workstation cluster for the physical mapping of chromosomes. SIAM News, 14–17 (March 1993)

  28. Yang, Y., Casanova, H.: UMR: A multi-round algorithm for scheduling divisible workloads. In: 17th Int. Parallel and Distributed Processing Symp. (2003)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Micah Adler.

Additional information

A portion of this research was presented at the 15th ACM Symp. on Parallelism in Algorithms and Architectures (2003).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Adler, M., Gong, Y. & Rosenberg, A.L. On “Exploiting” Node-Heterogeneous Clusters Optimally. Theory Comput Syst 42, 465–487 (2008). https://doi.org/10.1007/s00224-007-9001-1

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00224-007-9001-1

Keywords