Abstract
This paper evaluates the impact of the parallel I/O scheduling strategy on the performance of the file access in a parallel file system for clusters of commodity computers (Clusterfile). We argue that the parallel I/O scheduling strategy should be seen as a complement to other file access optimizations like striping over several I/O servers, non-contiguous I/O and collective I/O. Our study is based on three simple decentralized parallel I/O heuristics implemented inside Clusterfile. The measurements in a real environment show that the performance of parallel file access may vary with as much as 86% for writing and 804% for reading with the employed heuristic and with the schedule block granularity.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
DeBenedictis, E., Rosario, J.D.: nCUBE Parallel I/O Software. In: Proceedings of 11th International Phoenix Conference on Computers and Communication (1992)
LoVerso, S.J., et al.: sfs: A Parallel File System for the CM-5. In: Proceedings of the Summer 1993 USENIX Conference, pp. 291–305 (1993)
Huber, J.V., et al.: PPFS: A High Performance Portable File System. In: Proceedings of the 9th ACM International Conference on Supercomputing, ACM Press, New York (1995)
Corbett, P.F., Feitelson, D.G.: The Vesta Parallel File System. ACM Transactions on Computer Systems (1996)
Carretero, J., et al.: ParFiSys: A Parallel File System for MPP. ACM SIGOPS 30(2) (1996)
Freedman, C.S., Burger, J., DeWitt, D.J.: SPIFFI-A Scalable Parallel File System for the Intel Paragon. IEEE Transactions on Parallel and Distributed Systems (October 1996)
Nieuwejaar, N., Kotz, D.: The Galley Parallel File System. Parallel Computing (1997)
O’Keefe, M.T.: Shared file systems and fibre channel. In: Proceedings of the Sixth NASA Goddard Space Flight Center Conference on Mass Storage Systems and Technologies (1998)
Ligon, W.B., Ross, R.B.: An Overview of the Parallel Virtual File System. In: Proceedings of the Extreme Linux Workshop (June 1999)
Schmuck, F., Haskin, R.: GPFS: A Shared-Disk File System for Large Computing Clusters. In: Proceedings of FAST (2002)
Garcia-Carballeira, F., et al.: The Design of the Expand Parallel File System. The International Journal of High Performance Computing Applications 17(1), 21–38 (2003)
Winslett, M., et al.: The Panda library for parallel I/O of large multidimensional arrays. In: Proceedings of Scalable Parallel Libraries Conference III (October 1996)
Message Passing Interface Forum: MPI2: Extensions to the Message Passing Interface (1997)
Thakur, R., Gropp, W., Lusk, E.: Data Sieving and Collective I/O in ROMIO. In: Proc. of the 7th Symposium on the Frontiers of Massively Parallel Computation, February 1999, pp. 182–189 (1999)
Thakur, R., Gropp, W., Lusk, E.: On Implementing MPI-IO Portably and with High Performance. In: Proc. of the Sixth Workshop on I/O in Parallel and Distributed Systems, May 1999, pp. 23–32 (1999)
Isaila, F., Tichy, W.: View I/O:improving the performance of non-contiguous I/O. In: Third IEEE International Conference on Cluster Computing, Hong Kong, Dec. 2003, pp. 336–343. IEEE Computer Society Press, Los Alamitos (2003)
del Rosario, J., Bordawekar, R., Choudhary, A.: Improved parallel I/O via a two-phase run-time access strategy. In: Proc. of IPPS Workshop on Input/Output in Parallel Computer Systems (1993)
Kotz, D.: Disk-directed I/O for MIMD Multiprocessors. In: Proc. of the First USENIX Symp. on Operating Systems Design and Implementation (1994)
Isaila, F., et al.: Integrating Collective I/O and Cooperative Caching into the “Clusterfile” Parallel File System. In: Proceedings of ACM International Conference on Supercomputing (ICS), Saint Malo, France, pp. 315–324. ACM Press, New York (2004)
Isaila, F., Tichy, W.: Clusterfile: A flexible physical layout parallel file system. Concurrency and Computation: Practice and Experience 15(7–8), 653–679 (2003)
Jain, R., et al.: Heuristics for scheduling I/O operations. IEEE Transactions on Parallel and Distributed Systems 8(3), 310–320 (1997), http://www.computer.org/pubs/tpds/1997/310.pdf
Chen, F., Majumdar, S.: Performance of parallel I/O scheduling strategies on a network of workstations. In: Proceedings of ICPADS 2001, Fort Lauderdale, Florida, April 2001, pp. 157–164 (2001)
Abawajy, J.H.: Performance Analysis of Parallel I/O Scheduling Approaches on Cluster Computing Systems. In: CCGRID ’03: Proceedings of the 3st International Symposium on Cluster Computing and the Grid, Washington, DC, USA, p. 724. IEEE Computer Society Press, Los Alamitos (2003)
Durand, D., Jain, R., Tseytlin, D.: Parallel I/O scheduling using randomized, distributed edge coloring algorithms. J. Parallel Distrib. Comput. 63(6), 611–618 (2003)
Lumb, C.R., Golding, R.A., Ganger, G.R.: D-SPTF: decentralized request distribution in brick-based storage systems. In: ASPLOS, pp. 37–47 (2004)
Pai, V., et al.: Locality-Aware Request Distribution in Cluster-based Network Servers. In: Proceedings of the ACM Eighth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-VIII), October 1998, ACM Press, New York (1998)
Lebre, A., Denneulin, Y., Van, T.T.: Controlling and Scheduling Parallel I/O in Multi-application Environments. Technical report, INRIA (2005)
Dahlin, M., et al.: Cooperative Caching: Using Remote Client Memory to Improve File System Performance. In: The First Symp. on Operating Systems Design and Implementation (Nov. 1994)
Wang, R.Y., Anderson, T.E., Dahlin, M.D.: Experience with a distributed file system implementation with adaptive. Technical report (1998)
Myricom. GM: the low-level message-passing system for Myrinet networks (2000), http://www.myri.com/
Wong, P., der Wijngaart, R.: NAS Parallel Benchmarks I/O Version 2.4. Technical Report NAS-03-002, NASA Ames Research Center, Moffet Field, CA (2003)
Message Passing Interface Forum: MPI: A Message-Passing Interface Standard (1995)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Isailă, F., Singh, D., Carretero, J., Garcia, F. (2007). On Evaluating Decentralized Parallel I/O Scheduling Strategies for Parallel File Systems. In: Daydé, M., Palma, J.M.L.M., Coutinho, Á.L.G.A., Pacitti, E., Lopes, J.C. (eds) High Performance Computing for Computational Science - VECPAR 2006. VECPAR 2006. Lecture Notes in Computer Science, vol 4395. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71351-7_10
Download citation
DOI: https://doi.org/10.1007/978-3-540-71351-7_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71350-0
Online ISBN: 978-3-540-71351-7
eBook Packages: Computer ScienceComputer Science (R0)