skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Mochi: Composing Data Services for High-Performance Computing Environments

Journal Article · · Journal of Computer Science and Technology
 [1];  [2];  [1];  [2];  [1];  [1];  [2];  [3];  [4];  [1];  [4];  [5];  [4];  [4];  [1];  [5];  [2]
  1. Argonne National Lab. (ANL), Lemont, IL (United States)
  2. Carnegie Mellon Univ., Pittsburgh, PA (United States)
  3. Vector Inst. for Artificial Intelligence, Toronto (Canada)
  4. Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
  5. HDF Group Champaign, IL (United States)

Technology enhancements and the growing breadth of application workflows running on high-performance computing (HPC) platforms drive the development of new data services that provide high performance on these new platforms, provide capable and productive interfaces and abstractions for a variety of applications, and are readily adapted when new technologies are deployed. The Mochi framework enables composition of specialized distributed data services from a collection of connectable modules and subservices. Rather than forcing all applications to use a one-size-fits-all data staging and I/O software configuration, Mochi allows each application to use a data service specialized to its needs and access patterns. This paper introduces the Mochi framework and methodology. The Mochi core components and microservices are described. Examples of the application of the Mochi methodology to the development of four specialized services are detailed. Finally, a performance evaluation of a Mochi core component, a Mochi microservice, and a composed service providing an object model is performed. The paper concludes by positioning Mochi relative to related work in the HPC space and indicating directions for future work.

Research Organization:
Argonne National Lab. (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA); USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
Grant/Contract Number:
AC02-06CH11357; AC02-05CH11231
OSTI ID:
1596688
Journal Information:
Journal of Computer Science and Technology, Vol. 35, Issue 1; ISSN 1000-9000
Publisher:
Springer NatureCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 17 works
Citation information provided by
Web of Science

References (26)

Scaling Embedded In-Situ Indexing with DeltaFS conference November 2018
Evaluation of HPC Application I/O on Object Storage Systems conference November 2018
BESPOKV: Application Tailored Scale-Out Key-Value Stores conference November 2018
Toward Scalable and Asynchronous Object-Centric Data Management for HPC conference May 2018
Dragonfly+: Low Cost Topology for Scaling Datacenters
  • Shpiner, Alexander; Haramaty, Zachy; Eliad, Saar
  • 2017 IEEE 3rd International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era (HiPINEB) https://doi.org/10.1109/HiPINEB.2017.11
conference February 2017
Mercury: Enabling remote procedure call for high-performance computing conference September 2013
LABIOS: A Distributed Label-Based I/O System
  • Kougkas, Anthony; Devarajan, Hariharan; Lofstead, Jay
  • Proceedings of the 28th International Symposium on High-Performance Parallel and Distributed Computing - HPDC '19 https://doi.org/10.1145/3307681.3325405
conference January 2019
Overview of 3D NAND Technologies and Outlook Invited Paper conference October 2018
Slim Fly: A Cost Effective Low-Diameter Network Topology conference November 2014
FusionFS: Toward supporting data-intensive scientific applications on extreme-scale high-performance computing systems conference October 2014
ROOT — An object oriented data analysis framework journal April 1997
Malacology: A Programmable Storage System conference January 2017
Long-Time Dynamics through Parallel Trajectory Splicing journal December 2015
DataSpaces: an interaction and coordination framework for coupled simulation workflows journal February 2011
Technology-Driven, Highly-Scalable Dragonfly Topology journal June 2008
PapyrusKV: a high-performance parallel key-value store for distributed NVM architectures
  • Kim, Jungwon; Lee, Seyong; Vetter, Jeffrey S.
  • Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '17 https://doi.org/10.1145/3126908.3126943
conference January 2017
The design and implementation of a log-structured file system journal February 1992
Argobots: A Lightweight Low-Level Threading and Tasking Framework journal March 2018
Methodology for the Rapid Development of Scalable HPC Data Services conference November 2018
Programmable Caches with a Data Management Language and Policy Engine conference May 2018
RADOS: a scalable, reliable storage service for petabyte-scale storage clusters
  • Weil, Sage A.; Leung, Andrew W.; Brandt, Scott A.
  • Proceedings of the 2nd international workshop on Petascale data storage held in conjunction with Supercomputing '07 - PDSW '07 https://doi.org/10.1145/1374596.1374606
conference January 2007
Platform Storage Performance With 3D XPoint Technology journal September 2017
GekkoFS - A Temporary Distributed File System for HPC Applications conference September 2018
Massively parallel loading
  • Frings, Wolfgang; Ahn, Dong H.; LeGendre, Matthew
  • Proceedings of the 27th international ACM conference on International conference on supercomputing - ICS '13 https://doi.org/10.1145/2464996.2465020
conference January 2013
The NumPy Array: A Structure for Efficient Numerical Computation journal March 2011
An Ephemeral Burst-Buffer File System for Scientific Applications
  • Wang, Teng; Mohror, Kathryn; Moody, Adam
  • SC16: International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2016.68
conference November 2016