Skip to main content
Log in

Scheduling tree dags on parallel architectures

  • Published:
Algorithmica Aims and scope Submit manuscript

Abstract

We provide optimal within a constant explicit upper bounds on the makespan of schedules for tree-structured programs on mesh arrays of processors, and provide polynomial-time algorithms to find schedules with makespan matching these bounds. In particular, we show how to find, in polynomial time, a (nonpreemptive) schedule for a binary tree dag withn unit execution time tasks and heighth on ad-dimensional mesh array withm processors and links of unit bandwidth and unit propagation delay whose makespan isO(n/m+n 1/(d+1)+h), i.e., optimal within a constant factor. Further, we extend these schedules to bounded degree forest dags with arbitrary positive integer execution time tasks and to meshes when the propagation delay of all the links is an arbitrary positive integer. Thus, we provide a polynomial-time approximation algorithm for an NP-hard problem, with a performance ratio that is a constant.

We also show how to schedule tree dags on any parallel architecture that satisfies certain natural, not very restrictive, conditions that are satisfied by most parallel architectures used in practice. Letε be a fixed positive real number. We provide polynomial time computable schedules for binary tree dags withn unit execution time tasks and heighth ∉ (g(n)n −ɛ,g(n) logn) on any parallel architecture satisfying those conditions, with unit bandwidth and unit propagation delay links, with optimal up to a constant makespanO(g(n)+ft), whereg is a function that depends only on that architecture. The number of processors used is optimal within a constant factor ifhg(n)n −ɛ, and is optimal within anO(logn) factor ifh≥g(n)logn. As an example, for hypercube and complete binary tree architectures, we achieve optimal within a constant makespanO(h) whenh=Ω(log2 n), using an optimal within anO(logn) factor number of processors. Further, we extend these schedules to the case of bounded-degree forest dags with tasks of arbitrary positive integer execution times and architectures when the propagation delay of all the links is a given arbitrary positive integer.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. A. Aggarwal, A. K. Chandra, and M. Snir. Communication complexity of PRAMs.Theoretical Computer Science,71 (1990), 3–28.

    Article  Google Scholar 

  2. S. Bhatt, F. Chung, T. Leighton, and A. Rosenberg. Optimal simulations of tree machines.Proc. 27th Annual Symposium on Foundations of Computer Science, 1986, pp. 274–282.

  3. G. Frederickson. Updating of minimum spanning trees, with applications.SIAM Journal on Computing,14(4) (1985), 781–798.

    Article  Google Scholar 

  4. M. R. Garey and D. S. Johnson.Computers and Intractability, A Guide to the Theory of NP-Completeness. Freeman, San Francisco, CA, 1979.

    Google Scholar 

  5. D. Ghosal, A. Mukherjee, R. Thurimella, and Y. Yesha. Mapping task trees onto a linear array.Proc. 1991International Conference on Parallel Processing, Vol. I, 1991, pp. 629–633.

    Google Scholar 

  6. D. Ghosal, A. Mukherjee, R. Thurimella, and Y. Yesha. Scheduling task-trees onto a linear array. Manuscript, 1992.

  7. H. Jung, L. Kirousis, and P. Spirakis. Lower bounds and efficient algorithms for multiprocessor scheduling of dags with communication delays.Proc. ACM Symposium on Parallel Algorithms and Architectures, 1989, pp. 254–264.

  8. K. Kalpakis and Y. Yesha. On the Power of the Linear Array Architecture for Performing Tree-Structured Computations. UMIACS-TR-92-75 CS-TR-2924, University of Maryland at College Park, Institute for Advanced Computer Studies, July 1992.

    Google Scholar 

  9. K. Kalpakis and Y. Yesha. Optimal Within a Constant Schedules for Forest Dags on Parallel Architectures. UMIACS-TR-92-110.1 CS-TR-2974.1, University of Maryland at College Park, Institute for Advanced Computer Studies, October 1992.

    Google Scholar 

  10. S. R. Kosaraju. Parallel evaluation of division-free arithmetic expressions.Proc. 18th Annual ACM Symposium on Theory of Computing, 1986, pp. 231–239.

  11. C. H. Papadimitriou and J. D. Ullman. A communication-time tradeoff.SIAM Journal on Computing,16(4) (1987), 639–646.

    Article  Google Scholar 

  12. C. H. Papadimitriou and M. Yannakakis. Towards an architecture-independent analysis of parallel algorithms.SIAM Journal on Computing,19(2) (1990), 322–328.

    Article  Google Scholar 

  13. R. Thurimella and Y. Yesha. A scheduling principle for precedence graphs with communication delay.Proc. 1992International Conference on Parallel Processing, Vol. III, 1992, pp. 229–236.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Additional information

Communicated by C. H. Papadimitriou.

The second author was supported in part by the National Science Foundation under Grant CCR-9106062, and in part by the University of Maryland at College Park, Institute for Advanced Computer Studies.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kalpakis, K., Yesha, Y. Scheduling tree dags on parallel architectures. Algorithmica 15, 373–396 (1996). https://doi.org/10.1007/BF01961545

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01961545

Key words

Navigation