skip to main content
10.1145/1088149.1088192acmconferencesArticle/Chapter ViewAbstractPublication PagesicsConference Proceedingsconference-collections
Article

High performance support of parallel virtual file system (PVFS2) over Quadrics

Published:20 June 2005Publication History

ABSTRACT

Parallel I/O needs to keep pace with the demand of high performance computing applications on systems with ever-increasing speed. Exploiting high-end interconnect technologies to reduce the network access cost and scale the aggregated bandwidth is one of the ways to increase the performance of storage systems. In this paper, we explore the challenges of supporting parallel file system with modern features of Quadrics, including user-level communication and RDMA operations. We design and implement a Quadrics-capable version of a parallel file system (PVFS2). Our design overcomes the challenges imposed by Quadrics static communication model to dynamic client/server architectures. Quadrics QDMA and RDMA mechanisms are integrated and optimized for high performance data communication. Zero-copy PVFS2 list IO is achieved with a Single Event Associated MUltiple RDMA (SEAMUR) mechanism. Experimental results indicate that the performance of PVFS2, with Quadrics user-level protocols and RDMA operations, is significantly improved in terms of both data transfer and management operations. With four IO server nodes, our implementation improves PVFS2 aggregated read bandwidth by up to 140% compared to PVFS2 over TCP on top of Quadrics IP implementation. Moreover, it delivers significant performance improvement to application benchmarks such as mpi-tile-io [24] and BTIO [26]. To the best of our knowledge, this is the first work in the literature to report the design of a high performance parallel file system over Quadrics user-level communication protocols.

References

  1. The Parallel Virtual File System, version 2. http://www.pvfs.org/pvfs2.Google ScholarGoogle Scholar
  2. The Public Netperf Homepage. http://www.netperf.org/netperf/NetperfPage.html.Google ScholarGoogle Scholar
  3. J. Beecroft, D. Addison, F. Petrini, and M. McLaren. QsNet-Il: An Interconnect for Supercomputing Applications. In the Proceedings of Hot Chips '03, Stanford, CA, August 2003.Google ScholarGoogle Scholar
  4. N. J. Boden, D. Cohen, R. E. Felderman, A. E. Kulawik, C. L. Seitz, J. N. Seizovic, and W.-K. Su. Myrinet: A Gigabit-per-Second Local Area Network. IEEE Micro, 15(1):29--36, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. D. Bonachea, C. Bell, P. Hargrove, and M. Welcome. GAS-Net 2: An Alternative High-Performance Communication Interface, Nov. 2004.Google ScholarGoogle Scholar
  6. P. H. Carns, W. B. Ligon III, R. Ross, and P. Wyckoff. BMI: A Network Abstraction Layer for Parallel I/O, April 2005.Google ScholarGoogle Scholar
  7. A. Ching, A. Choudhary, W. Liao, R. Ross, and W. Gropp. Noncontiguous I/O through PVFS. In Proceedings of the IEEE International Conference on Cluster Computing, Chicago, IL, September 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Cluster File System, Inc. Lustre: A Scalable, High Performance File System. http://www.lustre.org/docs.html.Google ScholarGoogle Scholar
  9. A. M. David Nagle, Denis Serenyi. The Panasas ActiveScale Storage Cluster -- Delivering Scalable High Bandwidth Storage. In Proceedings of Supercomputing '04, November 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. DeBergalis, P. Corbett, S. Kleiman, A. Lent, D. Noveck. T. Talpey. and M. Wittle. The Direct Access File System. In Proceedings of Second USENIX Conference on File and Storage Technologies (FAST '03). 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. J. Duato. S. Yalamanchili. and L. Ni. Interconnection Networks: An Engineering Approach. The IEEE Computer Society Press. 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Huber. C. L. Elford. D. A. Reed. A. A. Chien. and D. S. Blumenthal. PPFS: A High Performance Portable Parallel File System. In Proceedings of the 9th ACM International Conference on Supercomputing. pages 385--394. Barcelona. Spain. July 1995. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. IBM Corp. IBM AIX Parallel I/O File System: Installation. Administration. and Use. Document Number SH34-6065-01. August 1995.Google ScholarGoogle Scholar
  14. Infiniband Trade Association. http://www.infinibandta.org.Google ScholarGoogle Scholar
  15. Intel Scalable Systems Division. Paragon System User's Guide, May 1995.Google ScholarGoogle Scholar
  16. R. Latham, R. Ross, and R. Thakur. The impact of file systems on mpi-io scalability. In Proceedings of the 11th European PVM/MPI Users' Group Meeting (Euro PVM/MPI 2004), pages 87--96, September 2004.Google ScholarGoogle ScholarCross RefCross Ref
  17. J. Liu, M. Banikazemi, B. Abali, and D. K. Panda. A Portable Client/Server Communication Middleware over SANs: Design and Performance Evaluation with InfiniBand. In SAN-02 Workshop (in conjunction with HPCA), February 2003.Google ScholarGoogle Scholar
  18. Message Passing Interface Forum. MPI-2: Extensions to the Message-Passing Interface, Jul 1997.Google ScholarGoogle Scholar
  19. N. Nieuwejaar and D. Kotz. The Galley Parallel File System. Parallel Computing, (4):447--476, June 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. P. H. Carns and W. B. Ligon III and R. B. Ross and R. Thakur. PVFS: A Parallel File System For Linux Clusters. In Proceedings of the 4th Annual Linux Showcase and Conference, pages 317--327, Atlanta, GA, October 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. D. A. Patterson, G. Gibson, and R. H. Katz. A Case for Redundant Arrays of Inexpensive Disks. In Proceedings of the 1988 ACM SIGMOD International Conference on Management of Data, Chicago, IL, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. F. Petrini, W.-C. Feng, A. Hoisie, S. Coll, and E. Frachtenberg. The Quadrics Network: High Performance Clustering Technology. IEEE Micro, 22(1):46--57, January-February 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Quadrics, Inc. Quadrics Linux Cluster Documentation.Google ScholarGoogle Scholar
  24. R. B. Ross. Parallel i/o benchmarking consortium. http://www-unix.mcs.anl.gov/rross/pio-benchmark/html/.Google ScholarGoogle Scholar
  25. R. Thakur, W. Gropp, and E. Lusk. On Implementing MPI-IO Portably and with High Performance. In Proceedings of the 6th Workshop on I/O in Parallel and Distributed Systems, pages 23--32. ACM Press, May 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. P. Wong and R. F. Van der Wijngaart. NAS Parallel Benchmarks I/O Version 2.4. Technical Report NAS-03-002, Computer Sciences Corporation, NASA Advanced Supercomputing (NAS) Division.Google ScholarGoogle Scholar
  27. J. Wu, P. Wychoff, and D. K. Panda. PVFS over InfiniBand: Design and Performance Evaluation. In Proceedings of the International Conference on Parallel Processing '03, Kaohsiung, Taiwan, October 2003.Google ScholarGoogle ScholarCross RefCross Ref
  28. J. Wu, P. Wychoff, and D. K. Panda. Supporting Efficient Noncontiguous Access in PVFS over InfiniBand. In Proceedings of Cluster Computing '03, Hong Kong, December 2004.Google ScholarGoogle Scholar
  29. W. Yu, T. S. Woodall, R. L. Graham, and D. K. Panda. Design and Implementation of Open MPI over Quadrics/Elan4. In Proceedings of the International Conference on Parallel and Distributed Processing Symposium '05, Colorado, Denver, April 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. R. Zahir. Lustre Storage Networking Transport Layer. http://www.lustre.org/docs.html.Google ScholarGoogle Scholar
  31. Y. Zhou, A. Bilas, S. Jagannathan, C. Dubnicki, J. F. Philbin, and K. Li. Experiences with VI Communication for Database Storage. In Proceedings of the 29th Annual International Symposium on Computer Architecture, pages 257--268. IEE Computer Society, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    ICS '05: Proceedings of the 19th annual international conference on Supercomputing
    June 2005
    414 pages
    ISBN:1595931678
    DOI:10.1145/1088149

    Copyright © 2005 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 20 June 2005

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • Article

    Acceptance Rates

    Overall Acceptance Rate584of2,055submissions,28%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader