skip to main content
10.1145/2792745.2792778acmotherconferencesArticle/Chapter ViewAbstractPublication PagesxsedeConference Proceedingsconference-collections
research-article

Advanced user environment design and implementation on integrated multi-architecture supercomputers

Published: 26 July 2015 Publication History

Abstract

Scientists and engineers using supercomputer clusters should be able to focus on their scientific and technical work instead of worrying about operating their user environment. However, creating a convenient and effective user environment on modern supercomputers becomes more and more challenging due to the complexity of these large-scale systems. In this report, we discuss important design issues and goals in user environment that must support multiple compiler suites, various applications, and diverse libraries on heterogeneous computing architectures. We present our implementation on the latest high-performance computing system, Yellowstone, which is a powerful dedicated resource for earth system science deployed by the National Center for Atmospheric Research. Our newly designed user environment is built upon a hierarchical module structure, customized wrapper scripts, pre-defined system modules, Lmod modules implementation, and several creative tools. The resulting implementation realizes many great features including streamlined control, versioning, user customization, automated documentation, etc., and accommodates both novice and experienced users. The design and implementation also minimize the effort of the administrator and support team in managing users environment. The smooth application and positive feedback from our users demonstrate that our design and implementation on the Yellowstone system have been well accepted and have facilitated thousands of users all over the world.

References

[1]
Environment Modules Project. modules.sourceforge.net/.
[2]
K. Agrawal, M. Fahey, R. McLay, and D. James. User environment tracking and problem detection with xalt. In Proceedings of the First International Workshop on HPC User Support Tools, HUST '14, pages 32--40, Piscataway, NJ, USA, 2014. IEEE Press.
[3]
L. Busby and A. Moody. Dotkit. dotkit.sourceforge.net/.
[4]
Computational and Information Systems Laboratory. Yellowstone: IBM iDataPlex System (NCAR Community Computing). National Center for Atmospheric Research, Boulder, Colorado, 2012.
[5]
Computational and Information Systems Laboratory, National Center for Atmospheric Research. Lists of software packages on Yellowstone. www2.cisl.ucar.edu/resources/yellowstone/software.
[6]
J. Dennis, J. Edwards, R. Jacob, R. Loy, A. Mirin, and M. Vertenstein. Parallel I/O library (PIO). www.cesm.ucar.edu/models/pio/.
[7]
D. Eadline. Keeping it straight: Environment modules. www.admin-magazine.com/HPC/Articles/Managing-the-Build-Environment-with-Environment-Modules.
[8]
M. Fahey and R. McLay. XALT. github.com/Fahey-McLay/xalt.
[9]
J. Furlani and P. Osel. Abstract Yourself With Modules. In Proceedings of the Tenth USENIX System Administration Conference (LISA X), LISA '96, pages 193--204, 1996.
[10]
M. Geimer, K. Hoste, and R. McLay. Modern scientific software management using easybuild and lmod. In Proceedings of the First International Workshop on HPC User Support Tools, HUST '14, pages 41--51, Piscataway, NJ, USA, 2014. IEEE Press.
[11]
HDF Group. HDF5 home page. www.hdfgroup.org/HDF5/.
[12]
M. Howison, A. Shen, and A. Loomis. Building software environments for research computing clusters. In 27th Large Installation System Administration Conference (LISA 13), pages 1--6, Washington, D.C., Nov. 2013. USENIX Association.
[13]
R. Ierusalimschy, L. H. D. Figueiredo, and W. C. Filho. Lua---an extensible extension language. Softw., Pract. Exper., 26(6):635--652, 1996.
[14]
R. Kelly, S. Ghosh, S. Liu, D. D. Vento, and R. Valent. The NWSC benchmark suite using scientific throughput to measure supercomputer performance. In State of the Practice Reports, SC '11, pages 7:1--7:5. ACM, 2011.
[15]
J. Layton. Environment modules - a great tool for clusters. www.admin-magazine.com/HPC/Articles/Environment-Modules.
[16]
S. Liu and R. McLay. Sanity Tool. sourceforge.net/projects/sanitytool/.
[17]
R. McLay, K. Schulz, W. Barth, and T. Minyard. Best practices for the deployment and management of production HPC clusters. In State of the Practice Reports, SC'11, pages 9:1--9:11, New York, NY, USA, 2011. ACM.
[18]
Northwestern University and Argonne National Laboratory. Parallel netCDF: A Parallel I/O Library for NetCDF File Access. trac.mcs.anl.gov/projects/parallel-netcdf.
[19]
T. Parker. New Cray package provides more integrated programming environment. www.cisl.ucar.edu/zine/96/summer/articles/3.progenv.html.
[20]
TACC Staff. Texas Advanced Computing Center, Lonestar User Guide. tacc.utexas.edu/user-services/user-guides/lonestar-user-guide.
[21]
TACC Staff. Texas Advanced Computing Center, Stampede User Guide. tacc.utexas.edu/user-services/user-guides/stampede-user-guide.
[22]
Unidata. NetCDF Downloads. www.unidata.ucar.edu/downloads/netcdf.
[23]
University Corporation for Atmospheric Research (UCAR). CESM Home page. www2.cesm.ucar.edu/.
[24]
University Corporation for Atmospheric Research (UCAR). WRF Home page. wrf-model.org/index.php.

Cited By

View all
  • (2016)Sanity toolProceedings of the Third International Workshop on HPC User Support Tools10.5555/3018834.3018838(24-30)Online publication date: 13-Nov-2016
  • (2016)Sanity Tool: Lightweight Diagnostics for Individual User Accounts on Supercomputer Systems2016 Third International Workshop on HPC User Support Tools (HUST)10.1109/HUST.2016.008(24-30)Online publication date: Nov-2016

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
XSEDE '15: Proceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced Cyberinfrastructure
July 2015
296 pages
ISBN:9781450337205
DOI:10.1145/2792745
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

  • San Diego Super Computing Ctr: San Diego Super Computing Ctr
  • HPCWire: HPCWire
  • Omnibond: Omnibond Systems, LLC
  • SGI
  • Internet2
  • Indiana University: Indiana University
  • CASC: The Coalition for Academic Scientific Computation
  • NICS: National Institute for Computational Sciences
  • Intel: Intel
  • DDN: DataDirect Networks, Inc
  • DELL
  • CORSA: CORSA Technology
  • ALLINEA: Allinea Software
  • Cray
  • RENCI: Renaissance Computing Institute

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 July 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. installation management
  2. modules
  3. software environment
  4. system software
  5. user environment

Qualifiers

  • Research-article

Conference

XSEDE '15
Sponsor:
  • San Diego Super Computing Ctr
  • HPCWire
  • Omnibond
  • Indiana University
  • CASC
  • NICS
  • Intel
  • DDN
  • CORSA
  • ALLINEA
  • RENCI

Acceptance Rates

XSEDE '15 Paper Acceptance Rate 49 of 70 submissions, 70%;
Overall Acceptance Rate 129 of 190 submissions, 68%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2016)Sanity toolProceedings of the Third International Workshop on HPC User Support Tools10.5555/3018834.3018838(24-30)Online publication date: 13-Nov-2016
  • (2016)Sanity Tool: Lightweight Diagnostics for Individual User Accounts on Supercomputer Systems2016 Third International Workshop on HPC User Support Tools (HUST)10.1109/HUST.2016.008(24-30)Online publication date: Nov-2016

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media