Mochi: Composing Data Services for High-Performance Computing Environments
- Argonne National Lab. (ANL), Lemont, IL (United States)
- Carnegie Mellon Univ., Pittsburgh, PA (United States)
- Vector Inst. for Artificial Intelligence, Toronto (Canada)
- Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
- HDF Group Champaign, IL (United States)
Technology enhancements and the growing breadth of application workflows running on high-performance computing (HPC) platforms drive the development of new data services that provide high performance on these new platforms, provide capable and productive interfaces and abstractions for a variety of applications, and are readily adapted when new technologies are deployed. The Mochi framework enables composition of specialized distributed data services from a collection of connectable modules and subservices. Rather than forcing all applications to use a one-size-fits-all data staging and I/O software configuration, Mochi allows each application to use a data service specialized to its needs and access patterns. This paper introduces the Mochi framework and methodology. The Mochi core components and microservices are described. Examples of the application of the Mochi methodology to the development of four specialized services are detailed. Finally, a performance evaluation of a Mochi core component, a Mochi microservice, and a composed service providing an object model is performed. The paper concludes by positioning Mochi relative to related work in the HPC space and indicating directions for future work.
- Research Organization:
- Argonne National Lab. (ANL), Argonne, IL (United States)
- Sponsoring Organization:
- USDOE National Nuclear Security Administration (NNSA); USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- Grant/Contract Number:
- AC02-06CH11357; AC02-05CH11231
- OSTI ID:
- 1596688
- Journal Information:
- Journal of Computer Science and Technology, Vol. 35, Issue 1; ISSN 1000-9000
- Publisher:
- Springer NatureCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Web of Science
Scaling Embedded In-Situ Indexing with DeltaFS
|
conference | November 2018 |
Evaluation of HPC Application I/O on Object Storage Systems
|
conference | November 2018 |
BESPOKV: Application Tailored Scale-Out Key-Value Stores
|
conference | November 2018 |
Toward Scalable and Asynchronous Object-Centric Data Management for HPC
|
conference | May 2018 |
Dragonfly+: Low Cost Topology for Scaling Datacenters
|
conference | February 2017 |
Mercury: Enabling remote procedure call for high-performance computing
|
conference | September 2013 |
LABIOS: A Distributed Label-Based I/O System
|
conference | January 2019 |
Overview of 3D NAND Technologies and Outlook Invited Paper
|
conference | October 2018 |
Slim Fly: A Cost Effective Low-Diameter Network Topology
|
conference | November 2014 |
FusionFS: Toward supporting data-intensive scientific applications on extreme-scale high-performance computing systems
|
conference | October 2014 |
ROOT — An object oriented data analysis framework
|
journal | April 1997 |
Malacology: A Programmable Storage System
|
conference | January 2017 |
Long-Time Dynamics through Parallel Trajectory Splicing
|
journal | December 2015 |
DataSpaces: an interaction and coordination framework for coupled simulation workflows
|
journal | February 2011 |
Technology-Driven, Highly-Scalable Dragonfly Topology
|
journal | June 2008 |
PapyrusKV: a high-performance parallel key-value store for distributed NVM architectures
|
conference | January 2017 |
The design and implementation of a log-structured file system
|
journal | February 1992 |
Argobots: A Lightweight Low-Level Threading and Tasking Framework
|
journal | March 2018 |
Methodology for the Rapid Development of Scalable HPC Data Services
|
conference | November 2018 |
Programmable Caches with a Data Management Language and Policy Engine
|
conference | May 2018 |
RADOS: a scalable, reliable storage service for petabyte-scale storage clusters
|
conference | January 2007 |
Platform Storage Performance With 3D XPoint Technology
|
journal | September 2017 |
GekkoFS - A Temporary Distributed File System for HPC Applications
|
conference | September 2018 |
Massively parallel loading
|
conference | January 2013 |
The NumPy Array: A Structure for Efficient Numerical Computation
|
journal | March 2011 |
An Ephemeral Burst-Buffer File System for Scientific Applications
|
conference | November 2016 |
Similar Records
A Software Defined Storage Approach to Exascale Storage Services (Final Technical Report)
Enabling NVM for Data-Intensive Scientific Services