Skip to main content

A compilation technique for varying communication cost NUMA architectures

  • Conference paper
  • First Online:
PARLE'94 Parallel Architectures and Languages Europe (PARLE 1994)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 817))

  • 133 Accesses

Abstract

In an earlier work, a Threshold Scheduling Algorithm was proposed to schedule the functional parallelism in a program on distributed memory systems. In this work, we address the issue of regeneration of the schedule for a set of distributed memory architectures with different communication costs. A new concept of dominant edges of a schedule is introduced to denote those edges which dictate schedule regeneration due to the changes in their communication costs. It is shown that under certain conditions, schedule on the whole or at least part of the graph can be reused for a different architecture reducing the cost of program re-partitioning and re-scheduling. The usefulness of this method is demonstrated by incorporating it in the scheduler of the compiler backend for targeting Sisal (Streams and Iterations in a Single Assignment Language) on a family of Intel i860 architectures: Gamma, Delta and Paragon which vary in their communication costs. It is shown that almost 30 to 65 % of the schedule can be reused, thereby, avoiding program re-partitioning to a large degree.

The authors gratefully acknowledge the support of Lawrence Livermore National Laboratory for allowing the use of Sisal compiler source code

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. F. Berman and L. Snyder, “On Mapping Parallel Algorithms To Parallel Architectures”, Journal of Parallel and Distributed Computing, vol. 4, 1987, pp. 439–458.

    Article  Google Scholar 

  2. D. C. Cann, “Retire Fortran? A Debate Rekindled”, Communications of ACM, August 1992, Vol. 35, No. 8, pp. 81–89.

    Article  Google Scholar 

  3. T. Casavant and J. A. Kuhl, “Taxonomy Of Scheduling In General Purpose Distributed Computing Systems”, IEEE Transactions on Software Engineering, vol. 14, no. 2, February 1988, pp. 141–154.

    Article  Google Scholar 

  4. R. Cole, “Approximate Parallel Scheduling: Applications To Logarithmic-time Optimal Parallel Graph Algorithms”, Information and Computation, Vol. 92, No. 1, May 1991.

    Google Scholar 

  5. H. El-Rewini and T. G. Lewis, “Scheduling Parallel Program Tasks Onto Arbitrary Target Machines”, Journal of Parallel and Distributed Computing, vol. 9, no. 2, June 1990, pp. 138–153.

    Article  Google Scholar 

  6. J. T. Feo, D. C. Cann and R. R. Oldehoeft, “A Report on Sisal Language Project”, Journal of Parallel and Distributed Computing, Vol. 10, no. 4, October 1990, pp. 349–366.

    Article  Google Scholar 

  7. A. Gerasoulis and T. Yang, “A Comparison Of Clustering Heuristics For Scheduling Directed Acyclic Graphs On Multiprocessors”, Journal Of Parallel And Distributed Computing, Vol. 16, No. 4, December 1992, pp. 276–291.

    Google Scholar 

  8. R. L. Graham, E. L. Lawler, J. K. Lenstra and A. H. G. Rinnooy Kan, “Optimization And Approximation In Deterministic Sequencing And Scheduling: A Survey”, Annals of Discrete Mathematics, pages 287–326, North-Holland Publishing Company, 1979.

    Google Scholar 

  9. C. H. Papadimitriou and M. Yannakakis, “Towards An Architecture Independent Analysis of Parallel Algorithms”, SIAM Journal of Computing, Vol. 19, No. 2, April 1990, pp. 322–328.

    Article  Google Scholar 

  10. S. S. Pande, D. P. Agrawal and J. Mauney, “A Fully Automatic Compiler For Distributed Memory Machines”, Proceedings Of The 26th Hawaii International Conference On System Sciences, January 1993, pp. 536–545.

    Google Scholar 

  11. V. Sarkar, “Partitioning And Scheduling Of Parallel Programs For Execution On Multiprocessors”, Ph. D. Thesis, Stanford University, 1987.

    Google Scholar 

  12. B. Shirazi, M. Wang and G. Pathak, “Analysis And Evaluation Of Heuristic Methods For Static Task Scheduling”, Journal of Parallel and Distributed Computing, vol. 10, no. 3, November 1990, pp. 222–232.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Costas Halatsis Dimitrios Maritsas George Philokyprou Sergios Theodoridis

Rights and permissions

Reprints and permissions

Copyright information

© 1994 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pande, S., Psarris, K. (1994). A compilation technique for varying communication cost NUMA architectures. In: Halatsis, C., Maritsas, D., Philokyprou, G., Theodoridis, S. (eds) PARLE'94 Parallel Architectures and Languages Europe. PARLE 1994. Lecture Notes in Computer Science, vol 817. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-58184-7_89

Download citation

  • DOI: https://doi.org/10.1007/3-540-58184-7_89

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-58184-0

  • Online ISBN: 978-3-540-48477-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics