Abstract
It is proved that “FIFO” worksharing protocols provide asymptotically optimal solutions to two problems related to sharing large collections of independent tasks in a heterogeneous network of workstations (HNOW) \(\mathcal{N}\) . In the \(\mathsf{HNOW-Exploitation Problem}\) , one seeks to accomplish as much work as possible on \(\mathcal{N}\) during a prespecified fixed period of L time units. In the \(\mathsf{HNOW-Rental Problem}\) , one seeks to complete W units of work by “renting” \(\mathcal{N}\) for as short a time as necessary. The worksharing protocols we study are crafted within an architectural model that characterizes \(\mathcal {N}\) via parameters that measure \(\mathcal{N}\) ’s workstations’ computational and communicational powers. All valid protocols are self-scheduling, in the sense that they determine completely both an amount of work to allocate to each of \(\mathcal{N}\) ’s workstations and a schedule for all related interworkstation communications. The schedules provide either a value for W given L, or a value for L given W, hence solve both of the motivating problems. A protocol observes a FIFO regimen if it has \(\mathcal{N}\) ’s workstations finish their assigned work, and return their results, in the same order in which they are supplied with their workloads. The proven optimality of FIFO protocols resides in the fact that they accomplish at least as much work as any other protocol during all sufficiently long worksharing episodes, and that they complete sufficiently large given collections of tasks at least as fast as any other protocol. Simulation experiments illustrate that the superiority of FIFO protocols is often observed during worksharing episodes of only a few minutes’ duration.
Similar content being viewed by others
References
Adler, M., Gong, Y., Rosenberg, A.L.: Asymptotically optimal worksharing in HNOWs: how long is “sufficiently long?” In: 36th Ann. Simulation Symp., pp. 39–46 (2003)
Alexandrov, A., Ionescu, M.I., Schauser, K.E., Scheiman, C.: LogGP: incorporating long messages into the LogP model for parallel computation. J. Parallel Distrib. Comput. 44, 71–79 (1997)
Anderson, T.E., Culler, D.E., Patterson, D.A., the HNOW Team: A case for NOW (networks of workstations). IEEE Micro 15, 54–64 (1995)
Banikazemi, M., Moorthy, V., Panda, D.K.: Efficient collective communication on heterogeneous networks of workstations. In: Intl. Conf. on Parallel Processing, pp. 460–467 (1998)
Banino, C., Beaumont, O., Carter, L., Ferrante, J., Legrand, A., Robert, Y.: Scheduling strategies for master-slave tasking on heterogeneous processor grids. IEEE Trans. Parallel Distrib. Syst. 15, 319–330 (2004)
Barlas, G.D.: Collection-aware optimum sequencing of operations and closed-form solutions for the distribution of a divisible load on arbitrary processor trees. IEEE Trans. Parallel Distrib. Syst. 9, 429–441 (1998)
Beaumont, O., Carter, L., Ferrante, J., Legrand, A., Robert, Y.: Bandwidth-centric allocation of independent tasks on heterogeneous platforms. In: Int. Parallel and Distrib. Process. Symp. (2002)
Beaumont, O., Legrand, A., Robert, Y.: The master-slave paradigm with heterogeneous processors. IEEE Trans. Parallel Distrib. Syst. 14, 897–908 (2003)
Beaumont, O., Marchal, L., Robert, Y.: Scheduling divisible loads with return messages on heterogeneous master-worker platforms. In: High-Performance Computing: The 12th Int. Conf. Lecture Notes in Computer Science, vol. 3769, pp. 498–507. Springer, Berlin (2005)
Bharadwaj, V., Ghose, D., Mani, V.: Optimal sequencing and arrangement in distributed single-level tree networks. IEEE Trans. Parallel Distrib. Syst. 5, 968–976 (1994)
Bharadwaj, V., Ghose, D., Mani, V.: Multi-installment load distribution in tree networks with delays. IEEE Trans. Aerosp. Electron. Syst. 31, 555–567 (1995)
Bharadwaj, V., Ghose, D., Mani, V., Robertazzi, T.G.: Scheduling Divisible Loads in Parallel and Distributed Systems. Wiley, New York (1996)
Bhat, P.B., Prasanna, V.K., Raghavendra, C.S.: Adaptive communication algorithms for distributed heterogeneous systems. In: 7th IEEE Int. Symp. on High Performance Distributed Computing (1998)
Bhat, P.B., Raghavendra, C.S., Prasanna, V.K.: Efficient collective communication in distributed heterogeneous systems. In: 19th IEEE Int. Conf. on Distributed Computing Systems (1999)
Cappello, F., Fraigniaud, P., Mans, B., Rosenberg, A.L.: An algorithmic model for heterogeneous clusters: rationale and experience. Int. J. Found. Comput. Sci. 16, 195–216 (2005)
Cheng, Y.C., Robertazzi, T.G.: Distributed computation for tree networks with communication delays. IEEE Trans. Aerosp. Electron. Syst. 26, 511–516 (1990)
Culler, D.E., Karp, R.M., Patterson, D., Sahay, A., Schauser, K.E., Santos, E., Subramonian, R., von Eicken, T.: LogP: towards a realistic model of parallel computation. Commun. ACM 39, 78–85 (1996)
Dutot, P.-F.: Master-slave tasking on heterogeneous processors. In: 17th Int. Parallel and Distributed Processing Symp. (2003)
Fraigniaud, P., Mans, B., Rosenberg, A.L.: Efficient trigger-broadcasting in heterogeneous clusters. J. Parallel Distrib. Comput. 65, 628–642 (2005)
Hsu, T.-S., Lee, J.C., Lopez, D.R., Royce, W.A.: Task allocation on a network of processors. IEEE Trans. Comput. 49, 1339–1353 (2000)
Karp, R.M., Sahay, A., Santos, E., Schauser, K.E.: Optimal broadcast and summation in the logP model. In: 5th ACM Symp. on Parallel Algorithms and Architectures, pp. 142–153 (1993)
Kesavan, R., Bondalapati, K., Panda, D.K.: Multicast on irregular switch-based networks with wormhole routing. In: 3rd Int. Symp. on High-Performance Computer Architecture (1996)
Pfister, G.F.: In Search of Clusters. Prentice-Hall, Englewood Cliffs (1995)
Reimann, D.A., Chaudhary, V., Sethi, I.K.: Modeling cone-beam tomographic reconstruction using LogSMP: an extended LogP model for clusters of SMPs. In: 6th Int. Conf. on High-Performance Computing. Lecture Notes in Computer Science, vol. 1745, pp. 77–83. Springer, Berlin (1999)
Rosenberg, A.L.: On sharing bag-of-tasks workloads in heterogeneous networks of workstations: greedier is not better. In: 3rd IEEE Int. Conf. on Cluster Computing, pp. 124–131 (2001)
Tosun, A.S., Agarwal, A.: Efficient broadcast algorithms for heterogeneous networks of workstations. In: 13th Int. Conf. on Parallel and Distributed Computing Systs. (2000)
White, S.W., Torney, D.C.: Use of a workstation cluster for the physical mapping of chromosomes. SIAM News, 14–17 (March 1993)
Yang, Y., Casanova, H.: UMR: A multi-round algorithm for scheduling divisible workloads. In: 17th Int. Parallel and Distributed Processing Symp. (2003)
Author information
Authors and Affiliations
Corresponding author
Additional information
A portion of this research was presented at the 15th ACM Symp. on Parallelism in Algorithms and Architectures (2003).
Rights and permissions
About this article
Cite this article
Adler, M., Gong, Y. & Rosenberg, A.L. On “Exploiting” Node-Heterogeneous Clusters Optimally. Theory Comput Syst 42, 465–487 (2008). https://doi.org/10.1007/s00224-007-9001-1
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00224-007-9001-1