Abstract
This paper addresses the exploitation of overlapping communication with calculation within parallel FORTRAN 77 codes for Computational Fluid Dynamics (CFD) and Computational Structured Dynamics (CSD). The obvious objective is to overlap interprocessor communication with calculation on each processor in a distributed memory parallel system and so improve the efficiency of the parallel implementation. A general strategy for converting synchronous to overlapped communication is presented together with tools to enable its automatic implementation in FORTRAN 77 codes. This strategy is then implemented within the parallelisation toolkit, CAPTools, to facilitate the automatic generation of parallel code with overlapped communications. The success of these tools are demonstrated on two codes from the NAS-PAR and PERFECT benchmark suites. In each case, the tools produce parallel code with overlapped communications which is as good as that which could be generated manually. The parallel performance of the codes also improve in line with expectation.
Preview
Unable to display preview. Download preview PDF.
References
Berry M. et al, The PERFECT Club Benchmarks: Effective Performance Evaluation of Supercomputers. CSRD Technical Report 827, Center for Supercomputing Research and Development, University of Illinois. May 1989.
Bertsekas D.P. and Tsitsiklis J.N., Parallel and Distributed Computation: Numerical Methods, Prentice-Hall, Englewood Cliffs, NJ. 1989.
Evans E.W., Strategies and Tools for the Exploitation of Massively Parallel Computer Systems. PhD Thesis. University of Greenwich. In Preparation.
Holm J., Lain A. and Banerjee P., Compilation of Scientific Programs into Multithreaded and Message Driven Computation, pp 518–525, In Proceedings of the 1994 Scalable High Performance Computing Conference, Knoxville, TN. May 1994.
Ierotheou C.S., Johnson S.P., Cross M. and Leggett P.F., Computer Aided Parallelisation Tools (CAPTools) — Conceptual Overview and Performance On The Parallelisation of Structured Mesh Codes, Parallel Computing 22:163–195. March 1996.
Johnson S.P., Cross M. and Everett M., Exploitation Of Symbolic Information In Interprocedural Dependence Analysis, Parallel Computing 22:197–226. March 1996.
Johnson S.P., Ierotheou C.S. and Cross M., Automatic Parallel Code Generation For Message Passing On Distributed Memory Systems, Parallel Computing 22:227–258. March 1996.
Kennedy K. and Nedeljković N., Combining Dependence and Data-Flow Analyses to Optimize Communication, To appear in the Proceedings of the 9th International Parallel Processing Symposium, IPPS'95. September 1994.
Leggett P.F., Marsh A.T.J., Johnson S.P and Cross M., Integrating User Knowledge With Information From Parallelisation Tools To Facilitate The Automatic Generation Of Efficient Parallel FORTRAN Code, Parallel Computing 22:259–288. March 1996.
McManus K., A Strategy for Mapping Unstructured Mesh Computational Mechanics Programs Onto Distributed Memory Parallel Architectures. PhD Thesis. University of Greenwich. 1996.
Padua D.A. and Wolfe M.J., Advanced Compiler Optimisations For Supercomputers, pp 1184–1201, Communications Of The ACM. 1986.
Saphir W., Woo A. and Yarrow M., NAS Parallel Benchmarks 2.1 Results. Technical Report NAS-96-010, NASA Ames Research Center. August 1996.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Evans, E.W., Johnson, S.P., Leggett, P.F., Cross, M. (1997). Overlapped communications automatically generated in a parallelisation tool. In: Hertzberger, B., Sloot, P. (eds) High-Performance Computing and Networking. HPCN-Europe 1997. Lecture Notes in Computer Science, vol 1225. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0031651
Download citation
DOI: https://doi.org/10.1007/BFb0031651
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-62898-9
Online ISBN: 978-3-540-69041-2
eBook Packages: Springer Book Archive