skip to main content
10.1145/3649169acmconferencesBook PagePublication PagesppoppConference Proceedingsconference-collections
PMAM '24: Proceedings of the 15th International Workshop on Programming Models and Applications for Multicores and Manycores
ACM2024 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
PPoPP '24: The 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming Edinburgh United Kingdom 3 March 2024
ISBN:
979-8-4007-0599-1
Published:
06 March 2024
Sponsors:
Recommend ACM DL
ALREADY A SUBSCRIBER?SIGN IN

Reflects downloads up to 03 Mar 2025Bibliometrics
Abstract

No abstract available.

Skip Table Of Content Section
research-article
Acceleration of the Pre-processing Stage of the MVS Workflow using Graphics Processors

Migrating CPU code to the CUDA programming language has been a challenge for some time. While the code for many high-performance and massively data-parallel applications has been successfully ported to GPUs, this task has received comparatively less ...

research-article
Open Access
Automatic Static Analysis-Guided Optimization of CUDA Kernels

We propose a framework for using static resource analysis to guide the automatic optimization of general-purpose GPU (GPGPU) kernels written in CUDA, NVIDIA's framework for GPGPU programming. In our proposed framework, optimizations are applied to the ...

research-article
Open Access
MUPPET: Optimizing Performance in OpenMP via Mutation Testing

Performance optimization continues to be a challenge in modern HPC software. Existing performance optimization techniques, including profiling-based and auto-tuning techniques, fail to indicate program modifications at the source level thus preventing ...

research-article
Parallel Pattern Language Code Generation

Memory and power constraints limit the current landscape of high-performance computing. Hardware specializations in clusters lead to heterogeneity, Non-Uniform Memory Architecture (NUMA) effects, and accelerator offloading. These increase the complexity ...

research-article
Open Access
Pure C++ Approach to Optimized Parallel Traversal of Regular Data Structures

Many computational problems consider memory throughput a performance bottleneck. The problem becomes even more pronounced in the case of parallel platforms, where the ratio between computing elements and memory bandwidth shifts towards computing. ...

research-article
Open Access
Zero-Overhead Parallel Scans for Multi-Core CPUs

We present three novel parallel scan algorithms for multi-core CPUs which do not need to fix the number of available cores at the start, and have zero overhead compared to sequential scans when executed on a single core. These two properties are in ...

Index terms have been assigned to the content through auto-classification.

Recommendations

Acceptance Rates

Overall Acceptance Rate 53 of 97 submissions, 55%
YearSubmittedAcceptedRate
PMAM '2015853%
PMAM'19171059%
PMAM'1817953%
PMAM'1714750%
PMAM '15341956%
Overall975355%