Abstract
We explore several methods utilizing system-wide shared memory to improve the performance of MPI-IO, particularly for non-contiguous file access. We introduce an abstraction called the datatype iterator that permits efficient, dynamic generation of (offset, length) pairs for a given MPI derived datatype. Combining datatype iterators with overlapped I/O and computation, we demonstrate how a shared memory MPI implementation can utilize more than 90% of the available disk bandwidth (in some cases representing a 5× performance improvement over existing methods) even for extreme cases of non-contiguous datatypes. We generalize our results to suggest possible parallel I/O performance improvements on systems without global shared memory.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Gropp, W., Huss-Lederman, S., Lumsdaine, A., Lusk, E., Nitzberg, B., Saphir, W., Snir, M.: MPI – The Complete Reference, vol. 2. MIT Press, Cambridge (1998)
Thakur, R., Gropp, W., Lusk, E.: Data sieving and collective I/O in ROMIO. In: Proceedings of the Seventh Symposium on the Frontiers of Massively Parallel Computation, pp. 182–189. IEEE Computer Society Press, Los Alamitos (1999)
Thakur, R., Gropp, W., Lusk, E.: On implementing MPI-IO portably and with high performance. In: Proceedings of the Sixth Workshop on Input/Output in Parallel and Distributed Systems, May 1999, pp. 23–32 (1999)
Ching, A., Choudhary, A., Liao, W.-K., Ross, R., Gropp, W.: Efficient structured data access in parallel file systems. In: Proceedings of the IEEE International Conference on Cluster Computing, pp. 326–335. IEEE Computer Society Press, Los Alamitos (2003)
MPICH2 home page (August 2005), http://www.mcs.anl.gov/mpi/mpich2/
HPCS – High Productivity Computer Systems (April 2006), http://www.highproductivity.org/
Vildibill, M.: Sun’s Hero program: Changing the productivity game (April 2006), http://www.hpcwire.com/hpc/614805.html
Ross, R., Miller, N., Gropp, W.: Implementing fast and reusable datatype processing. In: Proceedings of the 10th European PVM/MPI Users Group Meeting, pp. 404–413. Springer, Heidelberg (2003)
Worringen, J., Träff, J.L., Ritzdorf, H.: Fast parallel non-contiguous file access. In: Proceedings of SC2003: High Performance Networking and Computing. IEEE Computer Society Press, Los Alamitos (2003)
Worringen, J., Gäer, A., Reker, F.: Exploiting transparent remote memory access for non-contiguous- and one-sided-communication. In: Proceedings of the International Parallel and Distributed Processing Symposium, pp. 163–172. IEEE Computer Society Press, Los Alamitos (2002)
Sun StorageTek QFS software (April 2006), http://www.sun.com/storagetek/management_software/data_management/qfs/
LAM/MPI parallel computing (April 2005), http://www.lam-mpi.org/
Ross, R.: Parallel I/O benchmarking consortium (August 2005), http://www.mcs.anl.gov/~rross/pio-benchmark/
ASC center for astrophysical thermonuclear flashes (April 2006), http://flash.uchicago.edu/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hastings, A.B., Choudhary, A. (2006). Exploiting Shared Memory to Improve Parallel I/O Performance. In: Mohr, B., Träff, J.L., Worringen, J., Dongarra, J. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2006. Lecture Notes in Computer Science, vol 4192. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11846802_33
Download citation
DOI: https://doi.org/10.1007/11846802_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-39110-4
Online ISBN: 978-3-540-39112-8
eBook Packages: Computer ScienceComputer Science (R0)