No abstract available.
Proceeding Downloads
Introducing software pipelining for the A64FX processor into LLVM
Software pipelining is an essential optimization for accelerating High-Performance Computing(HPC) applications on CPUs. Modern CPUs achieve high performance through many-core and wide SIMD instructions. Software pipelining is an optimization that ...
An Overview on Mixing MPI and OpenMP Dependent Tasking on A64FX
- Romain Pereira,
- Adrien Roussel,
- Miwako Tsuji,
- Patrick Carribault,
- Mitsuhisa Sato,
- Hitoshi Murai,
- Thierry Gautier
The adoption of ARM processor architectures is on the rise in the HPC ecosystem. Fugaku supercomputer is a homogeneous ARM-based machine, and is one among the most powerful machine in the world. In the programming world, dependent task-based programming ...
High-throughput drug discovery on the Fujitsu A64FX architecture
High-performance computational kernels that optimally exploit modern vector-capable processors are critical in running large-scale drug discovery campaigns efficiently and promptly compatible with the constraints posed by urgent computing needs. Yet, ...
Impact of Write-Allocate Elimination on Fujitsu A64FX
ARM-based CPU architectures are currently driving massive disruptions in the High Performance Computing (HPC) community. Deployment of the 48-core Fujitsu A64FX ARM architecture based processor in RIKEN “Fugaku” supercomputer (#2 in the June 2023 Top500 ...
First Impressions of the NVIDIA Grace CPU Superchip and NVIDIA Grace Hopper Superchip for Scientific Workloads
The engineering samples of the NVIDIA Grace CPU Superchip and NVIDIA Grace Hopper Superchips were tested using different benchmarks and scientific applications. The benchmarks include HPCC and HPCG. The real application-based benchmark includes AI-...
NVIDIA Grace Superchip Early Evaluation for HPC Applications
- Fabio Banchelli,
- Joan Vinyals-Ylla-Catala,
- Josep Pocurull,
- Marc Clascà,
- Kilian Peiro,
- Filippo Spiga,
- Marta Garcia-Gasulla,
- Filippo Mantovani
Arm-based system in HPC are a reality since more than a decade. However, when a new chip enters the market always implies challenges, not only at ISA level, but also with regards to the SoC integration, the memory subsystem, the board integration, the ...
Performance Evaluation of the Fourth-Generation Xeon with Different Memory Characteristics
At the Supercomputer System of Academic Center for Computing and Media Studies Kyoto University, the fourth-generation Xeon (code-named Sapphire Rapids) is employed. The system consists of two subsystems—one equipped solely with high-bandwidth memory, ...
MPI-Adapter2: An Automatic ABI Translation Library Builder for MPI Application Binary Portability
This paper proposes an automatic MPI ABI (Application Binary Interface) translation library builder named MPI-Adapter2. The container-based job environment is becoming widespread in computer centers. However, when a user uses the container image in ...
Using Intel oneAPI for Multi-hybrid Acceleration Programming with GPU and FPGA Coupling
Intel oneAPI is a programming framework that accepts various accelerators such as GPUs, FPGAs, and multi-core CPUs, with a focus on HPC applications. Users can apply their code written in a single language, DPC++, to this heterogeneous programming ...
Optimize Efficiency of Utilizing Systems by Dynamic Core Binding
Load balancing at both the process and thread levels is imperative for minimizing application computation time in the context of MPI/OpenMP hybrid parallelization. This necessity arises from the constraint that, within a typical hybrid parallel ...
HPCnix: make HPC Apps more easier like shell script
In the area of high-performance computing (HPC), it is expected to extract extreme computing performance using a highly optimized framework without even common OS APIs and frameworks for personal desktops. However, this makes the development cost higher ...
Parallel Multi-Physics Coupled Simulation of a Midrex Blast Furnace
Traditional steelmaking is a major source of carbon dioxide emissions, but green steel production offers a sustainable alternative. Green steel is produced using hydrogen as a reducing agent instead of carbon monoxide, which results in only water vapour ...
The Implementation of Gas-liquid Two-phase Flow Simulations with Surfactant Transport Based on GPU Computing and Adaptive Mesh Refinement
We proposed an implementation for surfactant transport simulations in gas-liquid two-phase flows. This implementation employs a tree-based interface-adapted adaptive mesh refinement (AMR) method, assigning a high-resolution mesh around the interface ...
The Error-Energy Tradeoff in Molecular and Molecular-Continuum Fluid Simulations
Energy consumption plays a crucial role when designing simulation studies. In this work, we take a step towards modelling the relationship between statistical error and energy consumption for molecular and molecular-continuum flow simulations. After ...
Index Terms
- Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Workshops
Recommendations
Acceptance Rates
Year | Submitted | Accepted | Rate |
---|---|---|---|
HPCAsia '23 | 34 | 15 | 44% |
HPCAsia '23 Workshops | 10 | 9 | 90% |
HPCAsia '19 | 32 | 15 | 47% |
HPCAsia '18 | 67 | 30 | 45% |
Overall | 143 | 69 | 48% |