Abstract
The performance of scientific parallel programs with high file-I/O-activity running on top of cluster computers strongly depends on the qualitative and quantitative characteristics of the requested I/O-accesses. It also depends on the corresponding mechanisms and policies being used at the parallel file system level. This paper presents the motivation and design of a set of MPI-IO-hints. These hints are used to select the distribution function with which a parallel file system manipulates an opened file. The implementation of a new physical distribution function called varstrip_dist is also presented in this article. This function is proposed based upon spatial characteristics presented by I/O-access patterns observed at the application level.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Patterson, D.A., Chen, P.M.: Storage Performance - Metrics and Benchmarks (1998), http://citeseer.ist.psu.edu/91919.html
Patterson, D.A., Chen, P.M.: Maximizing Performance in a Striped Disk Array. In: Proc. 17th Annual Symposium on Computer Architecture (17th ISCA 1990), Computer Architecture News, pp. 322–331 (1990)
Hsu, W.W., Smith, A.J.: Characteristics of I/O traffic in personal computer and server workloads. IBM Syst. J. 42, 347–372 (2003)
Hsu, W.W., Smith, A.J.: The performance impact of I/O optimizations and disk improvements. IBM Journal of Research and Development 48, 255–289 (2004)
Sterling, T.: An Overview of Cluster Computing. Beowulf Cluster Computing with Linux., pp. 15–29 (2002)
PVFS2, http://www.pvfs.org/pvfs2/
Ligon, W.B., Ross, R.B.: Implementation and Performance of a Parallel File System for High Performance Distributed Applications. In: Proceedings of the Fifth IEEE International Symposium on High Performance Distributed Computing, pp. 471–480 (1996)
Ross, R.B., Carns, P.H., Ligon III, W.B., Latham, R.: Using the Parallel Virtual File System (2002), http://www.parl.clemson.edu/pvfs/user-guide.html
Madhyastha, T.M.: Automatic Classification of Input/Output Access Patterns. PhD Thesis (1997)
Madhyastha, T.M., Reed, D.A.: Exploiting Global Input/Output Access Pattern Classification. In: Proceedings of SC 1997: High Performance Networking and Computing (1997)
Thakur, R., Gropp, W., Lusk, E.: On implementing MPI-IO portably and with high performance. In: Proceedings of the 6th Workshop on I/O in Parallel and Distributed Systems (IOPADS 1999), pp. 23–32 (1999)
Thakur, R.S., Gropp, W., Lusk, E.: A Case for ung MPI’s derived datatypes to improve I/O Performance. In: Proceedings of Supercomputing 1998, CD-ROM (1998)
Rabenseifner, R., Koniges, A.E., Prost, J.-P., Hedges, R.: The Parallel Effective I/O Bandwidth Benchmark: b_eff_io. Parallel I/O for Cluster Computing, 107–132 (2004)
Miller, E.L., Katz, R.H.: Input/output behavior of supercomputing applications. SC, 567–576 (1991)
MPI-2, http://www.mpi-forum.org
Gropp, W., Lusk, E., Rajeev, T.: Using MPI-2: Advanced Features of the Message-Passing Interface, pp. 15–16 (1999)
Patterson, D., Gibson, G., Randy, K.: A case for redundant arrays of inexpensive disks (RAID). In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 109–116 (1988)
PVFS Development Team: PVFS 2 Concepts: the new guy’s guide to PVFS. PVFS 2 Documentation (2004)
PVFS Development Team: PVFS 2 Distribution Design Notes. PVFS 2 Documentation (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lucas, H.V., Ludwig, T. (2005). Hint Controlled Distribution with Parallel File Systems. In: Di Martino, B., Kranzlmüller, D., Dongarra, J. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2005. Lecture Notes in Computer Science, vol 3666. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11557265_18
Download citation
DOI: https://doi.org/10.1007/11557265_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29009-4
Online ISBN: 978-3-540-31943-6
eBook Packages: Computer ScienceComputer Science (R0)