Light-Weight Parallel I/O Analysis at Scale

Wright, Steven A.; Hammond, Simon D.; Pennycook, Simon J.; Jarvis, Stephen A.

doi:10.1007/978-3-642-24749-1_18

Steven A. Wright¹⁷,
Simon D. Hammond¹⁷,
Simon J. Pennycook¹⁷ &
…
Stephen A. Jarvis¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 6977))

Included in the following conference series:

European Performance Engineering Workshop

533 Accesses

Abstract

Input/output (I/O) operations can represent a significant proportion of the run-time when large scientific applications are run in parallel. Although there have been advances in the form of file-format libraries, file system design and I/O hardware, a growing divergence exists between the performance of parallel file systems and compute processing rates.

In this paper we utilise RIOT, an input/output tracing toolkit being developed at the University of Warwick, to assess the performance of three standard industry I/O benchmarks and mini-applications. We present a case study demonstrating the tracing and analysis capabilities of RIOT at scale, using MPI-IO, Parallel HDF-5 and MPI-IO augmented with the Parallel Log-structured File System (PLFS) middle-ware being developed by the Los Alamos National Laboratory.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Characterizing I/O Optimization Effect Through Holistic Log Data Analysis of Parallel File Systems and Interconnects

Understanding Metadata Latency with MDWorkbench

Analyzing Parallel Applications for Unnecessary I/O Semantics that Inhibit File System Performance

References

Almási, G.S., Archer, C., Castaños, J.G., Erway, C.C., Heidelberger, P., Martorell, X., Moreira, J.E., Pinnow, K., Ratterman, J., Smeds, N., Steinmacher-burow, B., Gropp, W.D., Toonen, B.: Implementing MPI on the BlueGene/L Supercomputer. In: Danelutto, M., Vanneschi, M., Laforenza, D. (eds.) Euro-Par 2004. LNCS, vol. 3149, pp. 833–845. Springer, Heidelberg (2004)
Chapter Google Scholar
Bailey, D.H., Barszcz, E., Barton, J.T., Browning, D.S., Carter, R.L., Dagum, L., Fatoohi, R.A., Frederickson, P.O., Lasinski, T.A., Schreiber, R.S., Simon, H.D., Venkatakrishnan, V., Weeratunga, S.K.: The NAS Parallel Benchmarks. International Journal of High Performance Computing Applications 5(3), 63–73 (1991)
Article Google Scholar
Bailey, D.H., Barszcz, E., Barton, J.T., Browning, D.S., Carter, R.L., Dagum, L., Fatoohi, R.A., Fineberg, S., Frederickson, P.O., Lasinski, T.A., Schreiber, R.S., Simon, H.D., Venkatakrishnan, V., Weeratunga, S.K.: The NAS Parallel Benchmarks. Tech. Rep. RNR-94-007, NASA Ames Research Center (March 1994)
Google Scholar
Bent, J., Gibson, G., Grider, G., McClelland, B., Nowoczynski, P., Nunez, J., Polte, M., Wingate, M.: PLFS: A Checkpoint Filesystem for Parallel Applications. In: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (SC 2009) (2009)
Google Scholar
Bull: BullX Cluster Suite Application Developer’s Guide (April 2010)
Google Scholar
Carns, P., Latham, R., Ross, R., Iskra, K., Land, S., Riley, K.: 24/7 Characterization of Petascale I/O Workloads. In: Proceedings of the IEEE International Conference on Cluster Computing and Workshops (CLUSTER 2009), pp. 1–10 (September 2009)
Google Scholar
Fryxell, B., Olson, K., Ricker, P., Timmes, F.X., Zingale, M., Lamb, D.Q., MacNeice, P., Rosner, R., Truran, J.W., Tufo, H.: FLASH: An Adaptive Mesh Hydrodynamics Code for Modeling Astrophysical Thermonuclear Flashes. The Astrophysical Journal Supplement Series 131(1), 273 (2000)
Article Google Scholar
Fuerlinger, K., Wright, N.J., Skinner, D.: Effective Performance Measurement at Petascale Using IPM. In: Proceedings of the IEEE 16th International Conference on Parallel and Distributed Systems (ICPADS 2010), pp. 373–380 (December 2010)
Google Scholar
Gabriel, E., Fagg, G.E., Bosilca, G., Angskun, T., Dongarra, J., Squyres, J.M., Sahay, V., Kambadur, P., Barrett, B.W., Lumsdaine, A., Castain, R.H., Daniel, D.J., Graham, R.L., Woodall, T.S.: Open MPI: Goals, concept, and design of a next generation MPI implementation. In: Kranzlmüller, D., Kacsuk, P., Dongarra, J. (eds.) EuroPVM/MPI 2004. LNCS, vol. 3241, pp. 97–104. Springer, Heidelberg (2004)
Chapter Google Scholar
Gropp, W.D.: MPICH2: A New Start for MPI Implementations. In: Kranzlmüller, D., Kacsuk, P., Dongarra, J., Volkert, J. (eds.) PVM/MPI 2002. LNCS, vol. 2474, pp. 7–42. Springer, Heidelberg (2002)
Chapter Google Scholar
Koziol, Q., Matzke, R.: HDF5 – A New Generation of HDF: Reference Manual and User Guide. Tech. rep., National Center for Supercomputing Applications, Champaign, Illinois, USA (1998)
Google Scholar
Li, J., Liao, W., Choudhary, A., Ross, R., Thakur, R., Gropp, W., Latham, R., Siegel, A., Gallagher, B., Zingale, M.: Parallel netCDF: A High-Performance Scientific I/O Interface. In: Proceedings of the ACM/IEEE International Conference on Supercomputing, SC 2003 (November 2003)
Google Scholar
Message Passing Interface Forum: MPI2: A Message Passing Interface Standard. High Performance Computing Applications 12(1-2), 1–299 (1998)
Google Scholar
Nowoczynski, P., Stone, N., Yanovich, J., Sommerfield, J.: Zest Checkpoint Storage System for Large Supercomputers. In: Proceedings of the 3rd Annual Workshop on Petascale Data Storage (PDSW 2008), pp. 1–5 (November 2008)
Google Scholar
Polte, M., Simsa, J., Tantisiriroj, W., Gibson, G., Dayal, S., Chainani, M., Uppugandla, D.K.: Fast Log-based Concurrent Writing of Checkpoints. In: Proceedings of the 3rd Annual Workshop on Petascale Data Storage (PDSW 2008), pp. 1–4 (November 2008)
Google Scholar
Polte, M., Lofstead, J., Bent, J., Gibson, G., Klasky, S.A., Liu, Q., Parashar, M., Podhorszki, N., Schwan, K., Wingate, M., Wolf, M.: And Eat It Too: High Read Performance in Write-Optimized HPC I/O Middleware File Formats. In: Proceedings of the 4th Annual Workshop on Petascale Data Storage (PDSW 2009), pp. 21–25 (November 2009)
Google Scholar
Rew, R.K., Davis, G.P.: NetCDF: An Interface for Scientific Data Access. IEEE Computer Graphics and Applications 10(4), 76–82 (1990)
Article Google Scholar
Rosner, R., Calder, A., Dursi, J., Fryxell, B., Lamb, D.Q., Niemeyer, J.C., Olson, K., Ricker, P., Timmes, F.X., Truran, J.W., Tueo, H., Young, Y., Zingale, M., Lusk, E., Stevens, R.: Flash Code: Studying Astrophysical Thermonuclear Flashes. Computing in Science & Engineering 2(2), 33–41 (2000)
Article Google Scholar
Shan, H., Antypas, K., Shalf, J.: Characterizing and Predicting the I/O Performance of HPC Applications using a Parameterized Synthetic Benchmark. In: Proceedings of the ACM/IEEE International Conference on Supercomputing, SC 2008 (November 2008)
Google Scholar
Shan, H., Shalf, J.: Using IOR to Analyze the I/O Performance for HPC Platforms. In: Cray User Group Conference (CUG 2007), Seattle, WA, USA (May 2007)
Google Scholar
Thakur, R., Lusk, E., Gropp, W.: ROMIO: A High-Performance, Portable MPI-IO Implementation. Tech. Rep. ANL/MCS-TM-234, Mathematics and Computer Science Division, Argonne National Laboratory (1997)
Google Scholar
Uselton, A., Howison, M., Wright, N.J., Skinner, D., Keen, N., Shalf, J., Karavanic, K.L., Oliker, L.: Parallel I/O Performance: From Events to Ensembles. In: Proceedings of the IEEE International Symposium on Parallel Distributed Processing (IPDPS 2010), pp. 1–11 (April 2010)
Google Scholar
Wang, Y., Kaeli, D.: Source Level Transformations to Improve I/O Data Partitioning. In: Proceedings of the 1st International Workshop on Storage Network Architecture and Parallel I/Os (SNAPI 2003) (September-October 2003)
Google Scholar
Wang, Y., Kaeli, D.: Profile-guided I/O Partitioning. In: Proceedings of the 17th Annual International Conference on Supercomputing (ICS 2003), pp. 252–260 (June 2003)
Google Scholar
Wolman, B., Olson, T.: IOBENCH: A System Independent IO Benchmark. ACM SIGARCH Computer Architecture News 17(5), 55–70 (1989)
Article Google Scholar
Wright, S.A., Pennycook, S.J., Hammond, S.D., Jarvis, S.A.: RIOT – A Parallel Input/Output Tracer. In: Proceedings of the 27th Annual UK Performance Engineering Workshop (UKPEW 2011), pp. 25–39 (July 2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Performance Computing and Visualisation Department of Computer Science, University of Warwick, UK
Steven A. Wright, Simon D. Hammond, Simon J. Pennycook & Stephen A. Jarvis

Authors

Steven A. Wright
View author publications
You can also search for this author in PubMed Google Scholar
Simon D. Hammond
View author publications
You can also search for this author in PubMed Google Scholar
Simon J. Pennycook
View author publications
You can also search for this author in PubMed Google Scholar
Stephen A. Jarvis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computing Science, Newcastle University, Newcastle upon Tyne, UK
Nigel Thomas

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wright, S.A., Hammond, S.D., Pennycook, S.J., Jarvis, S.A. (2011). Light-Weight Parallel I/O Analysis at Scale. In: Thomas, N. (eds) Computer Performance Engineering. EPEW 2011. Lecture Notes in Computer Science, vol 6977. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24749-1_18

Download citation

DOI: https://doi.org/10.1007/978-3-642-24749-1_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24748-4
Online ISBN: 978-3-642-24749-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics