Data Federation Challenges in Remote Near-Real-Time Fusion Experiment Data Processing
- ORNL
- Princeton Plasma Physics Laboratory (PPPL)
- National Fusion Research Institute (NFRI), Daejon, Korea
- Georgia Institute of Technology, Atlanta
Fusion energy experiments and simulations provide critical information needed to plan future fusion reactors. As next-generation devices like ITER move toward long-pulse experiments, analyses, including AI and ML, should be performed in a wide range of time and computing constraints, from near-real-time constraints, between-shot analysis, and to campaign-wide long-term analysis. However, the data volume, velocity, and variety make it extremely challenging for analyses using only local computational resources. Researchers need the ability to compose and execute workflows spanning edge resources to large-scale high-performance computing facilities.We present Delta, a system to address data analysis challenges, including AI/ML, in fusion science, by leveraging the ADIOS I/O library and middleware, to support executing science workflows over the wide area network for near-real-time streaming. We discuss the data federation challenges in performing remote workflows, focusing on on-going research work in (1) managing, reducing, and streaming data to minimize I/O and data movement overheads, (2) decompressing and reorganizing data for analysis, and (3) executing workflows for automated data analysis. We introduce examples for deep-learning based data analysis for the fusion domain and demonstrate how we use Delta to construct end-to-end workflows for a fusion device in Korea, connecting a remote DOE facility in the USA. The capability demonstrated by this project is the basis for improving the state of the art for near-real-time data federation amongst remote facilities.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-00OR22725
- OSTI ID:
- 1843720
- Resource Relation:
- Journal Volume: 1315; Conference: Smoky Mountains Computational Sciences and Engineering Conference (SMC) - Oak Ridge, Tennessee, United States of America - 8/26/2020 12:00:00 PM-8/28/2020 12:00:00 PM
- Country of Publication:
- United States
- Language:
- English
Similar Records
AI-Science for Performance Optimization and Diagnosis of Science Instrument Federations
Workflows Community Summit 2022: A Roadmap Revolution