skip to main content
10.1145/3581784.3613216acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections

I/O in WRF: A Case Study in Modern Parallel I/O Techniques

Published:11 November 2023Publication History

ABSTRACT

Large-scale parallel applications can face significant I/O performance bottlenecks, making efficient I/O crucial. This work presents a comparative study of several parallel I/O implementations in the Weather Research and Forecasting model, including PnetCDF blocking and non-blocking I/O options, netCDF4, HDF5 Log VOL, and ADIOS. For I/O methods creating files in a canonical data layout, PnetCDF's non-blocking option offers up to 2x improvement over its blocking option and up to 4.5x over HDF5 via netCDF4, demonstrating the effectiveness of the write request aggregation technique. The HDF5 Log VOL outperforms ADIOS with a 4x improvement in write performance when creating files in the log layout, although both require non-negligible time to convert the file back to canonical order for post-run analysis. From these results we extract some observations that can guide I/O strategies for modern parallel codes.

References

  1. [n. d.]. Community Multiscale Air Quality Modeling System (CMAQ). Zenodo. Software. Google ScholarGoogle ScholarCross RefCross Ref
  2. Ehtesham Ahmed, Naeem Saddique, Firas Al Janabi, Klemens Barfus, Malik Rizwan Asghar, Abid Sarwar, and Peter Krebs. 2023. Flood Predictability of One-Way and Two-Way WRF Nesting Coupled Hydrometeorological Flow Simulations in a Transboundary Chenab River Basin, Pakistan. Remote Sens. 15, 2 (2023), 457. Google ScholarGoogle ScholarCross RefCross Ref
  3. Katie Antypas, Nicholas Wright, Nicholas P Cardo, Allison Andrews, and Matthew Cordery. 2014. Cori: A Cray XC pre-exascale system for NERSC. Cray User Group Proceedings. Cray (2014).Google ScholarGoogle Scholar
  4. Tricia Balle and Pete Johnsen. 2016. Improving I/O Performance of the Weather Research and Forecast (WRF) Model.Google ScholarGoogle Scholar
  5. Suren Byna, Mohamad Chaarawi, Quincey Koziol, John Mainzer, and Frank Willmore. 2017. Tuning HDF5 Subfiling Performance on Parallel File Systems. In the Cray User Group Meeting.Google ScholarGoogle Scholar
  6. Philip Carns, Kevin Harms, William Allcock, Charles Bacon, Samuel Lang, Robert Latham, and Robert Ross. 2011. Understanding and Improving Computational Science Storage Access through Continuous Characterization. ACM Trans. Storage 7, 3, Article 8 (oct 2011), 26 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Jacqueline Chen, Alok Choudhary, Bronis R. de Supinski, Matt DeVries, Evatt Hawkes, Scott Klasky, Wei-keng Liao, Kwan-Liu Ma, Jim Crummey, Norbert Podhorszki, Ramanan Sankaran, Sameer Shende, and Chialin Yoo. 2009. Terascale Direct Numerical Simulations of Turbulent Combustion Using S3D. Computational Science and Discovery 2 (January 2009).Google ScholarGoogle Scholar
  8. Jacob Finkenrath, Giannis Koutsou, Swen Metzger, Hendrik Elbern, and Jonas Berndt. 2019. Approaching exascale with the Weather Research and Forecasting Solar model. Google ScholarGoogle ScholarCross RefCross Ref
  9. Kui Gao, Chen Jin, Alok Choudhary, and Wei-Keng Liao. 2011. Supporting computational data model representation with high-performance I/O in parallel netCDF. In 2011 18th International Conference on High Performance Computing. 1--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Kui Gao, Wei-Keng Liao, Alok Choudhary, Robert Ross, and Robert Latham. 2009. Combining I/O operations for multiple array variables in parallel netCDF. In 2009 IEEE International Conference on Cluster Computing and Workshops. 1--10. Google ScholarGoogle ScholarCross RefCross Ref
  11. Kui Gao, Wei-Keng Liao, Arifa Nisar, Alok N. Choudhary, Robert B. Ross, and Robert Latham. 2009. Using Subfiling to Improve Programming Flexibility and Performance of Parallel Shared-file I/O. In the International Conference Parallel Processing.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. William F. Godoy, Norbert Podhorszki, Ruonan Wang, Chuck Atkins, Greg Eisenhauer, Junmin Gu, Philip Davis, Jong Choi, Kai Germaschewski, Kevin Huck, Axel Huebl, Mark Kim, James Kress, Tahsin Kurc, Qing Liu, Jeremy Logan, Kshitij Mehta, George Ostrouchov, Manish Parashar, Franz Poeschel, David Pugmire, Eric Suchyta, Keichi Takahashi, Nick Thompson, Seiji Tsutsumi, Lipeng Wan, Matthew Wolf, Kesheng Wu, and Scott Klasky. 2020. ADIOS 2: The Adaptable Input Output System. A framework for high-performance data management. SoftwareX 12 (2020), 100561. Google ScholarGoogle ScholarCross RefCross Ref
  13. The HDF Group. 1997--2023. Hierarchical Data Format, version 5. https://www.hdfgroup.org/HDF5.Google ScholarGoogle Scholar
  14. The HDF Group. 2022. HDF5 release 1.14.0. https://portal.hdfgroup.org/display/support/HDF5+1.14.0.Google ScholarGoogle Scholar
  15. Qiao Kang, Sunwoo Lee, Kaiyuan Hou, Robert Ross, Ankit Agrawal, Alok Choudhary, and Wei-Keng Liao. 2020. Improving MPI Collective I/O for High Volume Non-Contiguous Requests with Intra-Node Aggregation. IEEE Transactions on Parallel and Distributed Systems 31, 11 (2020), 2682--2695. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Akira Kyle. 2018. Weather Research and Forecast (WRF) Scaling, Performance Assessment and Optimization. https://akirakyle.github.io/WRF_benchmarks/. Accessed on June 2022.Google ScholarGoogle Scholar
  17. Rob Latham, Chris Daley, Wei-keng Liao, Kui Gao, Rob Ross, Anshu Dubey, and Alok Choudhary. 2012. A case study for scientific I/O: improving the FLASH astrophysics code. Computational Science & Discovery 5, 1 (2012), 015001.Google ScholarGoogle ScholarCross RefCross Ref
  18. Michael Laufer. 2022. WRF-ADIOS2-to-NetCDF4. https://github.com/MichaelLaufer/WRF-ADIOS2-to-NetCDF4. Accessed on Mar, 2023.Google ScholarGoogle Scholar
  19. Michael Laufer and Erick Fredj. 2022. High Performance Parallel I/O and In-Situ Analysis in the WRF Model with ADIOS2. CoRR abs/2201.08228 (2022). arXiv:2201.08228 https://arxiv.org/abs/2201.08228Google ScholarGoogle Scholar
  20. Jianwei Li, Wei-Keng Liao, Alok Choudhary, Robert Ross, Rajeev Thakur, William Gropp, Rob Latham, Andrew Siegel, Brad Gallagher, and Michael Zingale. 2003. Parallel netCDF: A High-Performance Scientific I/O Interface. In SC Conference. IEEE Computer Society, Los Alamitos, CA, USA, 39. Google ScholarGoogle ScholarCross RefCross Ref
  21. Wei-Keng Liao and Alok Choudhary. 2008. Dynamically Adapting File Domain Partitioning Methods for Collective I/O Based on Underlying Parallel File System Locking Protocols. In the ACM/IEEE Conference on Supercomputing.Google ScholarGoogle ScholarCross RefCross Ref
  22. Wei-Keng Liao, Kaiyuan Hou, and Zanhua Huang. 2019--2023. Log VOL. https://github.com/DataLib-ECP/vol-log-based.Google ScholarGoogle Scholar
  23. Barry H. Lynn, Seth Cohen, Leonard Druyan, Adam S. Phillips, Dennis Shea, Haim-Zvi Krugliak, and Alexander P. Khain. 2020. An Examination of the Impact of Grid Spacing on WRF Simulations of Wintertime Precipitation in the Mid-Atlantic United States. Weather and Forecasting 35, 6 (2020), 2317--2343. Google ScholarGoogle ScholarCross RefCross Ref
  24. Message Passing Interface Forum. 2021. MPI: A Message-Passing Interface Standard Version 4.0. https://www.mpi-forum.orgGoogle ScholarGoogle Scholar
  25. OpenFOAM Foundation. [n. d.]. OpenFOAM - The Open Source Computational Fluid Dynamics (CFD) Toolbox. https://www.openfoam.org.Google ScholarGoogle Scholar
  26. Juan Rosario, Rajesh Bordawekar, and Alok Choudhary. 1993. Improved parallel I/O via a two-phase run-time access strategy. ACM SIGARCH Computer Architecture News 21 (12 1993), 31--38. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Manu Shantharam, Mahidhar Tatineni, Dongju Choi, and Amitava Majumdar. 2018. Understanding I/O Bottlenecks and Tuning for High Performance I/O on Large HPC Systems: A Case Study. In Proceedings of the Practice and Experience on Advanced Research Computing (Pittsburgh, PA, USA) (PEARC '18). Association for Computing Machinery, New York, NY, USA, Article 54, 6 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. William C. Skamarock, Joseph B. Klemp, Jimy Dudhia, David O. Gill, Zhiquan Liu, Judith Berner, Wei Wang, Jordan G. Powers, Michael G. Duda, Dale M. Barker, and Xiang-yu Huang. 2019. A description of the advanced research WRF Version 4. techreport NCAR/TN-556+STR. NCAR Tech. 145 pages.Google ScholarGoogle Scholar
  29. R Thakur, E Lusk, and W Gropp. 1997. Users guide for ROMIO: A high-performance, portable MPI-IO implementation. (10 1997). Google ScholarGoogle ScholarCross RefCross Ref
  30. Unidata. 2022. NetCDF version 4.9.0 [software]. Google ScholarGoogle ScholarCross RefCross Ref
  31. Xiwen Wang, Weijia Wang, Yuan He, Shulei Zhang, Wei Huang, R. Iestyn Woolway, Kun Shi, and Xiaofan Yang. 2023. Numerical simulation of thermal stratification in Lake Qiandaohu using an improved WRF-Lake model. Journal of Hydrology 618 (2023), 129184. Google ScholarGoogle ScholarCross RefCross Ref
  32. D. C. Wong, C. E. Yang, J. S. Fu, K. Wong, and Y. Gao. 2015. An approach to enhance PnetCDF performance in environmental modeling applications. Geoscientific Model Development 8, 4 (2015), 1033--1046. Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. I/O in WRF: A Case Study in Modern Parallel I/O Techniques

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SC '23: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
      November 2023
      1428 pages
      ISBN:9798400701092
      DOI:10.1145/3581784

      Copyright © 2023 ACM

      Publication rights licensed to ACM. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of the United States government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 11 November 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,516of6,373submissions,24%
    • Article Metrics

      • Downloads (Last 12 months)312
      • Downloads (Last 6 weeks)44

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader