Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region

HPCAsia '23: Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region

February 2023

2023 Proceeding

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

HPC ASIA 2023: International Conference on High Performance Computing in Asia-Pacific Region Singapore Singapore 27 February 2023- 2 March 2023

ISBN:

978-1-4503-9805-3

Published:

27 February 2023

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Bibliometrics

Citation count

Downloads (6 weeks)

263

Downloads (12 months)

2,229

Downloads (cumulative)

3,175

Sections

HPCAsia '23: Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region

2023

Previous Next

Abstract

No abstract available.

Proceeding Downloads

PDFFront matter (Welcome Message from Co-Chairs of the Organizing Committee, Message from the Program Chair, Organization, Sponsors)

Skip Table Of Content Section

Select All

Export Citations Save to Binder

SESSION: Session: Best Paper Finalist

research-article

Reducing shared memory footprint to leverage high throughput on Tensor Cores and its flexible API extension library

Hiroyuki Ootomo,
Rio Yokota

pp 1–8https://doi.org/10.1145/3578178.3578238

Matrix-matrix multiplication is used for various linear algebra algorithms such as matrix decomposition and tensor contraction. NVIDIA Tensor Core is a mixed-precision matrix-matrix multiplication and addition computing unit, where the theoretical peak ...

- 2
- 215
Metrics
Total Citations2
Total Downloads215
Last 12 Months125
Last 6 weeks11

Abstract
Get Access

research-article

Efficient Large Integer Multiplication with Arm SVE Instructions

Takuya Edamatsu,
Daisuke Takahashi

pp 9–17https://doi.org/10.1145/3578178.3578193

In this study, we implement large integer multiplication with the Arm Scalable Vector Extension (SVE) instructions. SVE is a single instruction, multiple data (SIMD) instruction set for the Arm AArch64 architecture. We use a reduced-radix representation ...

- 0
- 211
Metrics
Total Citations0
Total Downloads211
Last 12 Months97
Last 6 weeks13

Abstract
Get Access

research-article

Effectiveness of the Oversubscribing Scheduling on Supercomputer Systems

Shohei Minami,
Toshio Endo,
Akihiro Nomura

pp 18–28https://doi.org/10.1145/3578178.3578221

High responsiveness is substantial for users’ satisfaction in supercomputer systems. Recently, the use of interactive jobs in addition to traditional batch jobs is attracting attention. It is getting important to handle those jobs consolidated for ...

- 1
- 100
Metrics
Total Citations1
Total Downloads100
Last 12 Months43
Last 6 weeks2

Abstract
Get Access

research-article

A new data conversion method for mixed precision Krylov solvers with FP16/BF16 Jacobi preconditioners

Takuya Ina,
Yasuhiro Idomura,
Toshiyuki Imamura,
Naoyuki Onodera

pp 29–34https://doi.org/10.1145/3578178.3578222

Mixed precision Krylov solvers with the Jacobi preconditioner often show significant convergence degradation when the Jacobi preconditioner is computed in low precision such as FP16 and BF16. It is found that this convergence degradation is attributed ...

- 0
- 86
Metrics
Total Citations0
Total Downloads86
Last 12 Months41
Last 6 weeks2

Abstract
Get Access

SESSION: Session: Programming Models and Systems

research-article

Fault Tolerance for Ensemble-based Molecular-Continuum Flow Simulations

Vahid Jafari,
Philipp Neumann

pp 35–45https://doi.org/10.1145/3578178.3578220

Molecular dynamics (MD) simulations exhibit big computational efforts, which makes them very time-consuming. This particularly holds for molecular-continuum simulations in fluid dynamics, which rely on the simulation of MD ensembles that are coupled to ...

- 1
- 72
Metrics
Total Citations1
Total Downloads72
Last 12 Months32
Last 6 weeks2

Abstract
Get Access

research-article

Comparison of Reproducible Parallel Preconditioned BiCGSTAB Algorithm Based on ExBLAS and ReproBLAS

Xiaojun Lei,
Tongxiang Gu,
Stef Graillat,
Xiaowen Xu,
Jing Meng

pp 46–54https://doi.org/10.1145/3578178.3578234

Krylov subspace algorithms are important methods for solving linear systems. In order to efficiently solve large-scale linear systems, parallelism techniques are often applied. However, parallelism often enlarge the non-associativity of floating-point ...

- 1
- 67
Metrics
Total Citations1
Total Downloads67
Last 12 Months33
Last 6 weeks1

Abstract
Get Access

research-article

Open Access

A Case Study on DaCe Portability & Performance for Batched Discrete Fourier Transforms

Måns Ivar Andersson,
Stefano Markidis

pp 55–63https://doi.org/10.1145/3578178.3578239

With the emergence of new computer architectures, portability and performance-portability become significant concerns for developing HPC applications. This work reports our experience and lessons learned using DaCe to create and optimize batched ...

- 1
- 237
Metrics
Total Citations1
Total Downloads237
Last 12 Months177
Last 6 weeks19

More
- View online with eReader
- Abstract
HTML
PDF

research-article

Memory Usage Prediction of HPC Workloads Using Feature Engineering and Machine Learning

Md Nahid Newaz,
Md Atiqul Mollah

pp 64–74https://doi.org/10.1145/3578178.3578241

In High Performance Computing (HPC) systems, numerous applications of varying scale and domain are scheduled to run concurrently, and share the available CPU and memory capacities among themselves. Applications whose run-time memory usage are not known ...

- 1
- 139
Metrics
Total Citations1
Total Downloads139
Last 12 Months89
Last 6 weeks9

Abstract
Get Access

SESSION: Session: Data Storage, Applications, and Algorithms

research-article

Open Access

Associative Operator Precedence Parsing: A Method To Increase Data Parsing Parallelism

Le Li,
Kenjiro Taura

pp 75–87https://doi.org/10.1145/3578178.3578233

Many data often come with a high volume in textual format (JSON, XML, CSV). Because parsing can easily dominate data analysis time, researchers have been working on parallelizing parsing. Operator Precedence Parsing (OPP), among candidate parsing methods,...

- 0
- 634
Metrics
Total Citations0
Total Downloads634
Last 12 Months557
Last 6 weeks49

More
- View online with eReader
- Abstract
HTML
PDF

research-article

Public Access

Fault-Tolerant LOBPCG for Nuclear CI Calculations

Meiyue Shao,
Dossay Oryspayev,
Chao Yang,
Pieter Maris,
Brandon Cook

pp 88–95https://doi.org/10.1145/3578178.3578240

Exascale computing platforms with millions of compute units and with thousands of nodes are predicted to experience frequent faults which interrupt applications’ execution. In this context resilience against faults becomes important. We examine user and ...

- 1
- 84
Metrics
Total Citations1
Total Downloads84
Last 12 Months29
Last 6 weeks7

More
- View online with eReader
- Abstract
HTML
PDF

research-article

Parallelization of Automatic Tuning for Hyperparameter Optimization of Pedestrian Route Prediction Applications using Machine Learning

Sorataro Fujika,
Yuga Yajima,
Teruo Tanaka,
Akihiro Fujii,
Yuka Kato,
Satoshi Ohshima,
Takahiro Katagiri

pp 96–105https://doi.org/10.1145/3578178.3578235

We study software automatic tuning. Automatic tuning tools using iterative one-dimensional search estimate hyperparameters of machine learning programs. Iterative one-dimensional search searches the parameter space consisting of possible values of the ...

- 0
- 67
Metrics
Total Citations0
Total Downloads67
Last 12 Months24
Last 6 weeks0

Abstract
Get Access

SESSION: Session: Architectures and Networks

research-article

Open Access

LibCOS: Enabling Converged HPC and Cloud Data Stores with MPI

Daniel Araújo De Medeiros,
Stefano Markidis,
Ivy Bo Peng

pp 106–116https://doi.org/10.1145/3578178.3578236

Recently, federated HPC and cloud resources are becoming increasingly strategic for providing diversified and geographically available computing resources. However, accessing data stores across HPC and cloud storage systems is challenging. Many cloud ...

- 0
- 401
Metrics
Total Citations0
Total Downloads401
Last 12 Months330
Last 6 weeks79

More
- View online with eReader
- Abstract
HTML
PDF

research-article

Open Access

GPU–FPGA-accelerated Radiative Transfer Simulation with Inter-FPGA Communication

Ryohei Kobayashi,
Norihisa Fujita,
Yoshiki Yamaguchi,
Taisuke Boku,
Kohji Yoshikawa,
Makito Abe,
Masayuki Umemura

pp 117–125https://doi.org/10.1145/3578178.3578231

The complementary use of graphics processing units (GPUs) and field programmable gate arrays (FPGAs) is a major topic of interest in the high-performance computing (HPC) field. GPU–FPGA-accelerated computing is an effective tool for multiphysics ...

- 2
- 271
Metrics
Total Citations2
Total Downloads271
Last 12 Months199
Last 6 weeks34

More
- View online with eReader
- Abstract
HTML
PDF

research-article

Exploiting Data Parallelism in Graph-Based Simultaneous Localization and Mapping: A Case Study with GPU Accelerations

Junyuan Zheng,
Yuan He,
Masaaki Kondo

pp 126–139https://doi.org/10.1145/3578178.3578237

Graph-based simultaneous localization and mapping (G-SLAM) is an intuitive SLAM implementation where graphs are used to represent poses, landmarks and sensor measurements when a mobile robot builds a map of the environment and locates itself in it. ...

- 1
- 67
Metrics
Total Citations1
Total Downloads67
Last 12 Months28
Last 6 weeks4

Abstract
Get Access

research-article

Open Access

ESSPER: Elastic and Scalable FPGA-Cluster System for High-Performance Reconfigurable Computing with Supercomputer Fugaku

Kentaro Sano,
Atsushi Koshiba,
Takaaki Miyajima,
Tomohiro Ueno

pp 140–150https://doi.org/10.1145/3578178.3579341

FPGA clusters have yet to be a mainstream of HPC, even for accelerators, and several challenges exist in their architecture and system organization. This work presents ESSPER, a flexible and scalable FPGA cluster prototype system for reconfigurable HPC ...

- 5
- 464
Metrics
Total Citations5
Total Downloads464
Last 12 Months396
Last 6 weeks29

More
- View online with eReader
- Abstract
HTML
PDF

Save to Binder

Create a New Binder

Name

Index Terms

Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region
1. General and reference
2. Software and its engineering
  1. Software organization and properties
    1. Contextual software domains
      1. Operating systems

Index terms have been assigned to the content through auto-classification.

Recommendations

UbiMob '05: Proceedings of the 2nd French-speaking conference on Mobility and ubiquity computing
Read More
UbiMob '08: Proceedings of the 4th French-speaking conference on Mobility and ubiquity computing
Read More
UbiMob '09: Proceedings of the 5th French-Speaking Conference on Mobility and Ubiquity Computing
Read More

Acceptance Rates

HPCAsia '23 Paper Acceptance Rate15of34submissions,44%Overall Acceptance Rate69of143submissions,48%

Year	Submitted	Accepted	Rate
HPCAsia '23	34	15	44%
HPCAsia '23 Workshops	10	9	90%
HPCAsia '19	32	15	47%
HPCAsia '18	67	30	45%
Overall	143	69	48%

Comments

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Proceeding Downloads

Save to Binder

Index Terms

Recommendations

UbiMob '05: Proceedings of the 2nd French-speaking conference on Mobility and ubiquity computing

UbiMob '08: Proceedings of the 4th French-speaking conference on Mobility and ubiquity computing

UbiMob '09: Proceedings of the 5th French-Speaking Conference on Mobility and Ubiquity Computing

Acceptance Rates