Abstract
We provide optimal within a constant explicit upper bounds on the makespan of schedules for tree-structured programs on mesh arrays of processors, and provide polynomial-time algorithms to find schedules with makespan matching these bounds. In particular, we show how to find, in polynomial time, a (nonpreemptive) schedule for a binary tree dag withn unit execution time tasks and heighth on ad-dimensional mesh array withm processors and links of unit bandwidth and unit propagation delay whose makespan isO(n/m+n 1/(d+1)+h), i.e., optimal within a constant factor. Further, we extend these schedules to bounded degree forest dags with arbitrary positive integer execution time tasks and to meshes when the propagation delay of all the links is an arbitrary positive integer. Thus, we provide a polynomial-time approximation algorithm for an NP-hard problem, with a performance ratio that is a constant.
We also show how to schedule tree dags on any parallel architecture that satisfies certain natural, not very restrictive, conditions that are satisfied by most parallel architectures used in practice. Letε be a fixed positive real number. We provide polynomial time computable schedules for binary tree dags withn unit execution time tasks and heighth ∉ (g(n)n −ɛ,g(n) logn) on any parallel architecture satisfying those conditions, with unit bandwidth and unit propagation delay links, with optimal up to a constant makespanO(g(n)+ft), whereg is a function that depends only on that architecture. The number of processors used is optimal within a constant factor ifh ≤g(n)n −ɛ, and is optimal within anO(logn) factor ifh≥g(n)logn. As an example, for hypercube and complete binary tree architectures, we achieve optimal within a constant makespanO(h) whenh=Ω(log2 n), using an optimal within anO(logn) factor number of processors. Further, we extend these schedules to the case of bounded-degree forest dags with tasks of arbitrary positive integer execution times and architectures when the propagation delay of all the links is a given arbitrary positive integer.
Similar content being viewed by others
References
A. Aggarwal, A. K. Chandra, and M. Snir. Communication complexity of PRAMs.Theoretical Computer Science,71 (1990), 3–28.
S. Bhatt, F. Chung, T. Leighton, and A. Rosenberg. Optimal simulations of tree machines.Proc. 27th Annual Symposium on Foundations of Computer Science, 1986, pp. 274–282.
G. Frederickson. Updating of minimum spanning trees, with applications.SIAM Journal on Computing,14(4) (1985), 781–798.
M. R. Garey and D. S. Johnson.Computers and Intractability, A Guide to the Theory of NP-Completeness. Freeman, San Francisco, CA, 1979.
D. Ghosal, A. Mukherjee, R. Thurimella, and Y. Yesha. Mapping task trees onto a linear array.Proc. 1991International Conference on Parallel Processing, Vol. I, 1991, pp. 629–633.
D. Ghosal, A. Mukherjee, R. Thurimella, and Y. Yesha. Scheduling task-trees onto a linear array. Manuscript, 1992.
H. Jung, L. Kirousis, and P. Spirakis. Lower bounds and efficient algorithms for multiprocessor scheduling of dags with communication delays.Proc. ACM Symposium on Parallel Algorithms and Architectures, 1989, pp. 254–264.
K. Kalpakis and Y. Yesha. On the Power of the Linear Array Architecture for Performing Tree-Structured Computations. UMIACS-TR-92-75 CS-TR-2924, University of Maryland at College Park, Institute for Advanced Computer Studies, July 1992.
K. Kalpakis and Y. Yesha. Optimal Within a Constant Schedules for Forest Dags on Parallel Architectures. UMIACS-TR-92-110.1 CS-TR-2974.1, University of Maryland at College Park, Institute for Advanced Computer Studies, October 1992.
S. R. Kosaraju. Parallel evaluation of division-free arithmetic expressions.Proc. 18th Annual ACM Symposium on Theory of Computing, 1986, pp. 231–239.
C. H. Papadimitriou and J. D. Ullman. A communication-time tradeoff.SIAM Journal on Computing,16(4) (1987), 639–646.
C. H. Papadimitriou and M. Yannakakis. Towards an architecture-independent analysis of parallel algorithms.SIAM Journal on Computing,19(2) (1990), 322–328.
R. Thurimella and Y. Yesha. A scheduling principle for precedence graphs with communication delay.Proc. 1992International Conference on Parallel Processing, Vol. III, 1992, pp. 229–236.
Author information
Authors and Affiliations
Additional information
Communicated by C. H. Papadimitriou.
The second author was supported in part by the National Science Foundation under Grant CCR-9106062, and in part by the University of Maryland at College Park, Institute for Advanced Computer Studies.
Rights and permissions
About this article
Cite this article
Kalpakis, K., Yesha, Y. Scheduling tree dags on parallel architectures. Algorithmica 15, 373–396 (1996). https://doi.org/10.1007/BF01961545
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF01961545