skip to main content
10.1145/2597917.2597945acmconferencesArticle/Chapter ViewAbstractPublication PagescfConference Proceedingsconference-collections
research-article

Hardware support for address mapping in PGAS languages: a UPC case study

Published: 20 May 2014 Publication History

Abstract

The Partitioned Global Address Space (PGAS) programming model strikes a balance between the explicit, locality-aware, message-passing model and locality-agnostic, but easy-to-use, shared memory model (e.g. OpenMP). However, the PGAS memory model comes at a performance cost which limits both scalability and performance. Compiler optimizations are often not sufficient and manual optimizations are needed which considerably limit the productivity advantage. This paper proposes a hardware architectural support for PGAS, which allows the processor to efficiently handle shared addresses through new instructions. A prototype compiler is realized allowing to use the support with unmodified code, preserving the PGAS productivity advantage. Speedups of up to 5.5x are demonstrated on the unmodified NAS Parallel Benchmarks using the Gem5 full system simulator.

References

[1]
D. H. Bailey, T. Harris, W. Saphir, R. van der Wijngaart, A. Woo, and M. Yarrow. The NAS parallel benchmarks 2.0", nas-95-020. Technical report, Moffett Field, CA, USA, 1995.
[2]
N. Binkert, B. Beckmann, G. Black, S. K. Reinhardt, A. Saidi, A. Basu, J. Hestness, D. R. Hower, T. Krishna, S. Sardashti, R. Sen, K. Sewell, M. Shoaib, N. Vaish, M. D. Hill, and D. A. Wood. The gem5 simulator. SIGARCH Comput. Archit. News, 39(2):1--7, August 2011.
[3]
T. El-Ghazawi and F. Cantonnet. UPC performance and potential: A NPB experimental study. In Proceedings of the ACM/IEEE conference on Supercomputing, pages 1--26. IEEE Computer Society Press Los Alamitos, CA, USA, 2002.
[4]
M. M. Mueller. Efficient address translation. Interner Bericht. Universität Karlsruhe, Fakultät für Informatik; 2000, 12, 2000.
[5]
UPC Consortium. UPC language specifications v1.2, May 2005.
[6]
UPC NAS Parallel Benchmarks. threads.seas.gwu.edu/sites/npb-upc.

Cited By

View all
  • (2017)HPC-Oriented Toolchain for Hardware Simulators2017 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/CLUSTER.2017.108(653-654)Online publication date: Sep-2017
  • (2015)Assessing memory access performance of chapel through synthetic benchmarksProceedings of the 15th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing10.1109/CCGrid.2015.157(1147-1150)Online publication date: 4-May-2015

Index Terms

  1. Hardware support for address mapping in PGAS languages: a UPC case study

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        CF '14: Proceedings of the 11th ACM Conference on Computing Frontiers
        May 2014
        305 pages
        ISBN:9781450328708
        DOI:10.1145/2597917
        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 20 May 2014

        Check for updates

        Qualifiers

        • Research-article

        Conference

        CF'14
        Sponsor:
        CF'14: Computing Frontiers Conference
        May 20 - 22, 2014
        Cagliari, Italy

        Acceptance Rates

        CF '14 Paper Acceptance Rate 28 of 62 submissions, 45%;
        Overall Acceptance Rate 273 of 785 submissions, 35%

        Upcoming Conference

        CF '25

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)3
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 01 Mar 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2017)HPC-Oriented Toolchain for Hardware Simulators2017 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/CLUSTER.2017.108(653-654)Online publication date: Sep-2017
        • (2015)Assessing memory access performance of chapel through synthetic benchmarksProceedings of the 15th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing10.1109/CCGrid.2015.157(1147-1150)Online publication date: 4-May-2015

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media