skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Educating HPC Users in the use of advanced computing technology

Conference · · 2021 IEEE/ACM Ninth Workshop on Education for High Performance Computing (EduHPC)

We examine a multi-modal approach to educating and training users of an advanced computing technology testbed at the Institute for Advanced Computational Science at Stony Brook University. Ookami provides researchers worldwide with access to 176 Fujitsu A64FX compute nodes, this being the same processor technology powering the Japanese Fugaku supercomputer, the fastest computer in the world since June 2020. However, achieving high-performance on this Arm-based, leadership computing technology requires that users be familiar with details of computer architecture, performance analysis and modeling, and high-performance programming models that are commonly omitted in introductory programming courses. Indeed, regardless of their seniority, many of the testbed users are surprisingly unfamiliar with basic concepts such as vectorization, pipelining, latency/bandwidth, roofline models, computing energy/power, threads, and non-uniform memory access. These same concepts also pervade mainstream x86 technologies, so this is of widespread concern. Due to the national/global nature of our user community that is also very diverse in both discipline and experience, the inability to offer formal classes, and our experience that most people do not tend to read online documentation or training materials in sufficient depth, we have consciously employed multiple approaches that heavily emphasize (online) personal interactions and transfer of skills. Online documentation has been organized around best-practices and FAQs; twice-weekly hackathons and office hours via Zoom enable deep dives by both the team and the user community with multiple broad benefits; a Slack channel provides both real time and archived answers and discussions; and workshops, training and webinars target community needs as they arise. Furthermore, the perspective that these tools are being used in an educational setting rather than just for project communication makes them more effective and contributes to community success.

Research Organization:
Stony Brook Univ., NY (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Nuclear Physics (NP)
DOE Contract Number:
FG02-87ER40317
OSTI ID:
1907878
Journal Information:
2021 IEEE/ACM Ninth Workshop on Education for High Performance Computing (EduHPC), Conference: 2021 IEEE/ACM Ninth Workshop on Education for High Performance Computing (EduHPC), St. Louis, MO (United States), 14 Nov 2021
Country of Publication:
United States
Language:
English

References (20)

Pragmatic optimizations for better scientific utilization of large supercomputers journal November 2012
Experience report: refactoring the mesh interface in FLASH, a multiphysics software conference October 2018
FLASH: An Adaptive Mesh Hydrodynamics Code for Modeling Astrophysical Thermonuclear Flashes journal November 2000
Open XDMoD: A Tool for the Comprehensive Management of High-Performance Computing Resources journal July 2015
Comparing the behavior of OpenMP Implementations with various Applications on two different Fujitsu A64FX platforms conference July 2021
On Validating an Astrophysical Simulation Code journal November 2002
The Response of Model and Astrophysical Thermonuclear Flames to Curvature and Stretch journal October 2003
The Tau Parallel Performance System journal May 2006
Helium Detonations on Neutron Stars journal March 2001
Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures conference November 2018
On the nonlinear evolution of wind-driven gravity waves journal July 2004
Terascale turbulence computation using the FLASH3 application framework on the IBM Blue Gene/L system journal January 2008
Evolution of FLASH, a multi-physics scientific simulation code for high-performance computing journal October 2013
On Heavy Element Enrichment in Classical Novae journal February 2004
Automatic Throughput and Critical Path Analysis of x86 and ARM Assembly Kernels conference November 2019
High-Performance Reactive Fluid Flow Simulations Using Adaptive Mesh Refinement on Thousands of Processors conference January 2000
The Software development process of FLASH, a multiphysics simulation code conference May 2013
Flash code: studying astrophysical thermonuclear flashes journal March 2000
FLASH MHD simulations of experiments that study shock-generated magnetic fields journal December 2015
Programming abstractions for orchestration of HPC scientific computing (keynote) conference June 2019

Similar Records

Ookami: Deployment and Initial Experiences
Conference · Sat Jul 17 00:00:00 EDT 2021 · Practice and Experience in Advanced Research Computing · OSTI ID:1907878

Experiences with Porting the FLASH Code to Ookami, an HPE Apollo 80 A64FX Platform
Conference · Thu Jan 13 00:00:00 EST 2022 · International Conference on High Performance Computing in Asia-Pacific Region Workshops · OSTI ID:1907878

Performance of an Astrophysical Radiation Hydrodynamics Code under Scalable Vector Extension Optimization
Journal Article · Tue Oct 18 00:00:00 EDT 2022 · Proceedings - IEEE International Conference on Cluster Computing · OSTI ID:1907878