Abstract
This paper describes user-level optimisations for virtual shared memory (VSM) systems and demonstrates performance improvements for three scientific kernel codes written in Fortran-S and running on a 30 node prototype distributed memory architecture. These optimisations can be applied to all consistency models and directory schemes, whether in hardware or software, which employ an invalidation based protocol. The semantics of these optimisations are carefully stated. Currently these optimisations are performed by the programmer, but there is much scope for automating this process within a compiler.
This work was funded by the U.K. Meteorological Office, the ESPRIT SODA and the ESPRIT APPARC projects.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
D. Bailey, J. Barton, T. Lasinski, and H. Simon. The NAS Parallel Benchmarks. NASA Technical Memorandum 103863, 1993.
F. Bodin, L. Kervella, and T. Priol. Fortran-S: A Fortran Interface for Shared Virtual Memory Architecture. In Proceedings of Supercomputing. IEEE Press, November 1993.
F. Bodin and M.F.P. OBoyle. A Compiler Strategy for SVM. In 3rd Workshop on Languages, Compilers and Runtime Systems for Scalable Computing. Kluwer Press, May 1995.
A. Dickinson et al. Implementation and Initial Results from a Parallel Version of the Meteorological Office Atmosphere Prediction Model. In Coming of Age: Proceedings of the Sixth ECMWF Workshop on the use of Parallel Processors in Meteorology. World Scientific, 1994.
B. Falsafi et al. Application-Specific Protocols for User-Level Shared Memory. In Supercomputing 94. IEEE Press.
A.J.G Hey. The GENESIS Distributed-Memory Benchmarks. Parallel Computing 17 (10–11), 1991.
Kuck and Associates Inc. Champaign Illinois. KAP User's Guide, 1988.
A.R. Lebeck and D.A. Wood. Dynamic Self-Invalidation: Reducing Coherence Overhead in Shared-Memory Multiprocessors. In ISCA95, pages 48–59, 1995.
K. Li and P. Hudak. Memory Coherence in Shared Virtual Memory Systems. ACM Transactions on Computer Systems, 7(4):321–359, 1989.
R. Mirchandaney, S. Hirandani, and A. Sethi. Improving the Performance of DSM Systems via Compiler Involvement. In Proceedings of Supercomputing, 1994.
D. Mosberger. Memory Consistency Models. ACM SIGOPs Review, 27(1), 1993.
F. Mounes-Toussi and D.J. Lilja. The Potential of Compile-Time Analysis to Adapt the Cache Coherence Enforcement Strategy to the Data Sharing Characteristics. IEEE Transactions on Parallel and Distributed Systems, 6(5), May 1995.
S.K. Reinhardt, J.R. Larus, and D.A. Wood. Tempest and Typhoon: User-level Shared Memory. In Proceedings of the 21st Annual Iternational Symposium on Computer Architecture, 1994.
C.J. Skelton et al. EDS a Parallel Computer System for Advanced Information Processing. In Parallel Architectures and Languages Europe, PARLE92, pages 3–18, 1992.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1996 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ford, R.W., Nisbet, A.P., Bull, J.M. (1996). User-level VSM optimization and its application. In: Dongarra, J., Madsen, K., Waśniewski, J. (eds) Applied Parallel Computing Computations in Physics, Chemistry and Engineering Science. PARA 1995. Lecture Notes in Computer Science, vol 1041. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60902-4_26
Download citation
DOI: https://doi.org/10.1007/3-540-60902-4_26
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60902-5
Online ISBN: 978-3-540-49670-0
eBook Packages: Springer Book Archive