Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region

HPCAsia '24: Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region

January 2024

2024 Proceeding

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

HPCAsia 2024: International Conference on High Performance Computing in Asia-Pacific Region Nagoya Japan January 25 - 27, 2024

ISBN:

979-8-4007-0889-3

Published:

19 January 2024

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Bibliometrics

Citation count

Downloads (6 weeks)

567

Downloads (12 months)

2,613

Downloads (cumulative)

2,613

Sections

HPCAsia '24: Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region

2024

Previous Next

Abstract

No abstract available.

Proceeding Downloads

PDFFront matter (Welcome Message from Workshop Chair, Messages from the Organizer, Organization, Sponsors)

Skip Table Of Content Section

Select All

Export Citations Save to Binder

SESSION: Session: Best Paper Finalists – 1 Programming Models and System Software

research-article

Open Access

Non-Blocking GPU-CPU Notifications to Enable More GPU-CPU Parallelism

Bengisu Elis,
Olga Pearce,
David Boehme,
Jason Burmark,
Martin Schulz

pp 1–11https://doi.org/10.1145/3635035.3635036

GPUs are increasingly popular in HPC systems, and more applications are adopting GPUs each day. However, the control synchronization of GPUs with CPUs is suboptimal and only possible after GPU kernel termination points, resulting in serialized host and ...

- 0
- 201
Metrics
Total Citations0
Total Downloads201
Last 12 Months201
Last 6 weeks35

More
- View online with eReader
- Abstract
HTML
PDF

research-article

Open Access

Portable Implementations of Work Stealing

Masahiro Yasugi,
Tasuku Hiraishi,
Chihiro Takeuchi

pp 12–22https://doi.org/10.1145/3635035.3635041

Work stealing is a well-known technique for dynamic load balancing; however, manually writing work-stealing protocols is error-prone. We can use the Tascell parallel programming language for the correct and portable implementation of work stealing; the ...

- 0
- 108
Metrics
Total Citations0
Total Downloads108
Last 12 Months108
Last 6 weeks16

More
- View online with eReader
- Abstract
HTML
PDF

research-article

sKokkos: Enabling Kokkos with Transparent Device Selection on Heterogeneous Systems using OpenACC

Pedro Valero-Lara,
Seyong Lee,
Joel Denny,
Keita Teranishi,
Jeffrey Vetter,
Marc Gonzalez-Tallada

pp 23–34https://doi.org/10.1145/3635035.3635043

This paper presents a new feature to enable Kokkos with transparent device selection. For application developers, it is not easy to identify which device is the most appropriate to use in a heterogeneous system, since this depends on the characteristics ...

- 0
- 91
Metrics
Total Citations0
Total Downloads91
Last 12 Months91
Last 6 weeks9

Abstract
Get Access

SESSION: Session: Best Paper Finalists – 2 Application and Algorithms

research-article

Open Access

Parallelized Remapping Algorithms for km-scale Global Weather and Climate Simulations with Icosahedral Grid System

Chihiro Kodama,
Hisashi Yashiro,
Takashi Arakawa,
Daisuke Takasuka,
Shuhei Matsugishi,
Hirofumi Tomita

pp 35–46https://doi.org/10.1145/3635035.3635040

In weather and climate research, latitude–longitude grid data are typically used for analysis and visualization, and remapping from model native grids to latitude–longitude grids typically requires a significant amount of time. Here, we developed a ...

- 0
- 144
Metrics
Total Citations0
Total Downloads144
Last 12 Months144
Last 6 weeks39
- 1
Supplementary Material
3635040-corrigendum.pdf

More
- View online with eReader
- Abstract
HTML
PDF

research-article

Approximate Block Diagonalization of Symmetric Matrices Using Quantum Annealing

Koushi Teramoto,
Masaki Kugaya,
Shuhei Kudo,
Yasuhiko Takenaga,
Yusaku Yamamoto

pp 47–54https://doi.org/10.1145/3635035.3635044

We consider the problem of transforming a given symmetric matrix into a nearly block diagonal form by permutation of its rows and columns. Such a transformation is useful as preconditioning to accelerate the convergence of an eigenvalue solver, but the ...

- 0
- 84
Metrics
Total Citations0
Total Downloads84
Last 12 Months84
Last 6 weeks2

Abstract
Get Access

research-article

QUBO formulation using inequalities for problems with complex constraints

Tomoko Komiyama,
Tomohiro Suzuki

pp 55–61https://doi.org/10.1145/3635035.3635042

Quantum annealing is an optimization technique that uses quantum fluctuation effects to search for solutions and is being applied as a metaheuristic method. Quantum annealing solves a problem expressed as quadratic unconstrained binary optimization (...

- 0
- 94
Metrics
Total Citations0
Total Downloads94
Last 12 Months94
Last 6 weeks13

Abstract
Get Access

SESSION: Session: Research Paper – 1 Architectures and Networks

research-article

Evaluation of POSIT Arithmetic with Accelerators

Naohito Nakasato,
Yuki Murakami,
Fumiya Kono,
Maho Nakata

pp 62–72https://doi.org/10.1145/3635035.3635046

We present an evaluation of 32-bit POSIT arithmetic through its implementation as accelerators on FPGAs and GPUs. POSIT, a floating-point number format, adaptively changes the size of its fractional part. We developed hardware designs for FPGAs and ...

- 0
- 89
Metrics
Total Citations0
Total Downloads89
Last 12 Months89
Last 6 weeks9

Abstract
Get Access

research-article

Open Access

Low-latency Communication in RISC-V Clusters

Michalis Gianioudis,
Pantelis Xirouchakis,
Charisios Loukas,
Evangelos Mageiropoulos,
Orestis Mousouros,
Sokratis Mpartzis,
Aggelos Ioannou,
Vassilis Papaefstathiou,
Manolis Katevenis,
Nikolaos Chrysos

pp 73–83https://doi.org/10.1145/3635035.3635050

Low-latency inter-node communication is important in HPC clusters. In this work, we design and integrate a low-cost interconnect, capable for low-latency user-level communication with open-source RISC-V processors, obviating the need for bulky and ...

- 0
- 337
Metrics
Total Citations0
Total Downloads337
Last 12 Months337
Last 6 weeks103
- 1
Supplementary Material
p73-gianioudis-corrigendum

More
- View online with eReader
- Abstract
HTML
PDF

research-article

Open Access

Flexible Systolic Array Platform on Virtual 2-D Multi-FPGA Plane

Tomohiro Ueno,
Emanuele Del Sozzo,
Kentaro Sano

pp 84–94https://doi.org/10.1145/3635035.3637285

Systolic arrays are a promising approach to achieving high-performance processing based on highly parallelized designs in various fields, such as AI and bioinformatics. Many previous studies have devoted considerable effort to exploring efficient ...

- 0
- 174
Metrics
Total Citations0
Total Downloads174
Last 12 Months174
Last 6 weeks39

More
- View online with eReader
- Abstract
HTML
PDF

SESSION: Session: Research Paper – 2 Parallelism

research-article

Open Access

An Efficient Task-Parallel Pipeline Programming Framework

Cheng-Hsiang Chiu,
Zhicheng Xiong,
Zizheng Guo,
Tsung-Wei Huang,
Yibo Lin

pp 95–106https://doi.org/10.1145/3635035.3635037

The pipeline is a fundamental pattern to parallelize a series of stage tasks over a sequence of data in loops. Mainstream pipeline programming frameworks count on data abstractions to perform pipeline scheduling. Although this design is convenient for ...

- 0
- 151
Metrics
Total Citations0
Total Downloads151
Last 12 Months151
Last 6 weeks47

More
- View online with eReader
- Abstract
HTML
PDF

research-article

Task-based low-rank hybrid parallel Cholesky factorization for distributed memory environment

Han Jiao,
Jilin Zhang,
Tomohiro Suzuki

pp 107–116https://doi.org/10.1145/3635035.3635039

The primary targets for improving efficiency for large-scale matrix factorization are reducing synchronization, addressing the overlap in communication and computation, and improving load balance. In recent years, tiled algorithms with task parallelism ...

- 0
- 78
Metrics
Total Citations0
Total Downloads78
Last 12 Months78
Last 6 weeks7

Abstract
Get Access

research-article

AshPipe: Asynchronous Hybrid Pipeline Parallel for DNN Training

Ryubu Hosoki,
Toshio Endo,
Takahiro Hirofuchi,
Tsutomu Ikegami

pp 117–126https://doi.org/10.1145/3635035.3635045

Deep Neural Networks (DNNs) have become increasingly computationally intensive and have larger parameters, requiring efficient parallelization or distribution using multiple accelerators. Pipeline parallelism has been proposed as an effective way to ...

- 0
- 107
Metrics
Total Citations0
Total Downloads107
Last 12 Months107
Last 6 weeks11

Abstract
Get Access

SESSION: Session: Research Paper – 3 GPU Computing

research-article

Open Access

Bruck Algorithm Performance Analysis for Multi-GPU All-to-All Communication

Andres Sewell,
Ke Fan,
Ahmedur Rahman Shovon,
Landon Dyken,
Sidharth Kumar,
Steve Petruzza

pp 127–133https://doi.org/10.1145/3635035.3635047

In high-performance computing, collective communication is critical for facilitating comprehensive data exchange involving all processes within an MPI communicator. Due to their inherently global nature, many collective operations present scalability ...

- 0
- 391
Metrics
Total Citations0
Total Downloads391
Last 12 Months391
Last 6 weeks192

More
- View online with eReader
- Abstract
HTML
PDF

research-article

Efficient GPU-Implementation of H-P Sort Based on Improved Histogram Computation

Kaito Takase,
Takumi Hagihara,
Noriyuki Fujimoto,
Koichi Wada

pp 134–144https://doi.org/10.1145/3635035.3635051

We present an enhanced GPU implementation of the H-P sort algorithm, which is a widely used method for integer sorting based on histogram computation and prefix sum calculation. This work extends a previous high-performance GPU version of the algorithm, ...

- 0
- 53
Metrics
Total Citations0
Total Downloads53
Last 12 Months53
Last 6 weeks5

Abstract
Get Access

SESSION: Session: Research Paper – 4 Applications

research-article

Eulerian elastoplastic simulation of vehicle structures by building-cube method on supercomputer Fugaku

Koji Nishiguchi,
Shusuke Takeuchi,
Hirofumi Sugiyama,
Shigenobu Okazawa,
Tadasuke Katsuhara,
Keiichi Yonehara,
Shigeki Kojima,
Kosho Kawahara,
Hiroya Hoshiba,
Junji Kato

pp 145–153https://doi.org/10.1145/3635035.3635038

This paper presents a novel numerical method for the elastoplastic simulation of vehicle component structures under large deformation problems, such as crash-worthiness analysis. Elastoplastic simulation of vehicle structures is essential for designing ...

- 0
- 82
Metrics
Total Citations0
Total Downloads82
Last 12 Months82
Last 6 weeks5

Abstract
Get Access

research-article

Open Access

Analysis Towards Energy-Aware Image-based In Situ Visualization on the Fugaku

Razil Tahir,
Jorji Nonaka,
Ken Iwata,
Taisei Matsushima,
Naohisa Sakamoto,
Chongke Bi,
Masahiro Nakao,
Hitoshi Murai

pp 154–163https://doi.org/10.1145/3635035.3635048

Energy efficiency has become a serious concern when running applications on HPC systems. Although these systems were designed to mainly run simulation codes as fast as possible, due to the ever-increasing size of the simulation outputs, the in situ ...

- 0
- 135
Metrics
Total Citations0
Total Downloads135
Last 12 Months135
Last 6 weeks30

More
- View online with eReader
- Abstract
HTML
PDF

research-article

Information Entropy-based Camera Focus Point and Zoom Level Adjustment for Smart In-Situ Visualization

Taisei Matsushima,
Ken Iwata,
Naohisa Sakamoto,
Jorji Nonaka,
Chongke Bi

pp 164–173https://doi.org/10.1145/3635035.3635049

With the recent developments in computational science and HPC technology, large-scale numerical simulations have become common in various scientific and technological fields. The output volume data from these simulations have also become larger and more ...

- 0
- 56
Metrics
Total Citations0
Total Downloads56
Last 12 Months56
Last 6 weeks5

Abstract
Get Access

Save to Binder

Create a New Binder

Name

Index Terms

Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region

Index terms have been assigned to the content through auto-classification.

Recommendations

HPCAsia '24 Workshops: Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Workshops
Read More
UbiMob '05: Proceedings of the 2nd French-speaking conference on Mobility and ubiquity computing
Read More
UbiMob '08: Proceedings of the 4th French-speaking conference on Mobility and ubiquity computing
Read More

Acceptance Rates

Overall Acceptance Rate69of143submissions,48%

Year	Submitted	Accepted	Rate
HPCAsia '23	34	15	44%
HPCAsia '23 Workshops	10	9	90%
HPCAsia '19	32	15	47%
HPCAsia '18	67	30	45%
Overall	143	69	48%

Comments

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Proceeding Downloads

Save to Binder

Index Terms

Recommendations

HPCAsia '24 Workshops: Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Workshops

UbiMob '05: Proceedings of the 2nd French-speaking conference on Mobility and ubiquity computing

UbiMob '08: Proceedings of the 4th French-speaking conference on Mobility and ubiquity computing

Acceptance Rates