skip to main content
10.1145/2020373.2020384acmotherconferencesArticle/Chapter ViewAbstractPublication PagespgasConference Proceedingsconference-collections
research-article

XcalableMP implementation and performance of NAS Parallel Benchmarks

Published:12 October 2010Publication History

ABSTRACT

XcalableMP is a parallel extension of existing languages, such as C and Fortran, that was proposed as a new programming model to facilitate program parallel applications for distributed memory systems. In order to investigate the performance of parallel programs written in XcalableMP, we have implemented NAS Parallel Benchmarks, specifically, the Embarrassingly Parallel (EP) benchmark, the Integer Sort (IS) benchmark, and the Conjugate Gradient (CG) benchmark, using XcalableMP. The results show that the performance of XcalableMP is comparable to that of MPI. In particular, the performances of IS with a histogram and CG with two-dimensional parallelization achieve almost the same performance. The results also demonstrate that XcalableMP allows a programmer to write efficient parallel applications at a lower programming cost.

References

  1. Message Passing Interface Forum. http://www.mpi-forum.org/Google ScholarGoogle Scholar
  2. Directive-based language eXtension for Scalable and performance-aware Parallel Programming. http://www.xcalablemp.org/Google ScholarGoogle Scholar
  3. Jinpil Lee and Mitsuhisa Sato, Implementation and Performance Evaluation of XcalableMP: A Parallel Programming Language for Distributed Memory Systems, 39th Annual International Conference on Parallel Processing (2010) Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. http://www.xcalablemp.org/HPCC09_XMP.pdfGoogle ScholarGoogle Scholar
  5. OpenMP http://openmp.org/wp/Google ScholarGoogle Scholar
  6. Robert W. Numrich and John Reid. Co-array Fortran for parallel programming, ACM SIGPLAN Fortran Forum Volume 17 Issue 2 (1998) Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. JAHPF (Japan Association of High Performance Fortran). http://www.hpfpc.org/jahpf/Google ScholarGoogle Scholar
  8. Yuanyuan Zhang, Hidetoshi Iwashita, Kuninori Ishii, Masanori Kaneko, Tomotake Nakamura and Kohichiro Hotta Hybrid Parallel Programming on SMP Clusters Using XPFortran and OpenMP, 6th International Workshop on OpenMP (IWOMP 2010), LNCS 6132, pp. 133--148, Springer, Heidelberg (2010) Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Iwashita, H., Sueyasu, N., Kamiya, S., van Waveren, M. VPP Fortran and the Design of HPF/JA Extensions. In: Concurrency and Computation: Practice and Experience, vol. 14(8-9), pp. 575--588. John Wiley & Sons Ltd., Chichester (2002)Google ScholarGoogle Scholar
  10. Jinpil Lee, Mitsuhisa Sato and Taisuke Boku. OpenMPD: A Directive Based Data Parallel Language Extensions for Distributed Memory Systems, Proceedings of the 37th International Conference on Parallel Processing, pp. 121--128, 2008 Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. http://www.hpcchallenge.org/Google ScholarGoogle Scholar
  12. Bailey, D. H. and et al. THE NAS PARALLEL BENCHMARKS, Technical Report NAS-94-007, Nasa Ames Research Center (1994)Google ScholarGoogle Scholar
  13. Frumkin, M., Jin, H. and Yan, J.: Implementation of NAS parallel benchmarks in high peroformance fortran, Technical Report NAS-98-009, Nasa Ames Research Center (1998)Google ScholarGoogle Scholar
  14. Nishitani, Y. et al.: Techiniques for compiling and implementing all NAS parallel benchmarks in HPF, Concurrency and Computation - Practice & Experience, Vol. 14, No. 8-9, Wiley, pp. 769--787 (2002)Google ScholarGoogle Scholar
  15. T. El-Ghazawi and F. Cantonnet. UPC performance and potential: A NPB experimental study. In Supercomputing 2002 (SC2002), November 2002 Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Titanium Project Home Page. http://titanium.cs.berkeley.edu/Google ScholarGoogle Scholar
  17. Kaushik Datta et al. Titanium Performance and Potential: An NPB Experimental Study, 18th International Workshop, LCPC 2005, pp. 200--214, Springer, Heidelberg (2006) Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Omni Compiler Project. NPB2.3-omni-C, http://www.hpcs.cs.tsukuba.ac.jp/omni-openmp/Google ScholarGoogle Scholar
  19. T2K Open Supercomputer. http://www.open-supercomputer.org/Google ScholarGoogle Scholar

Index Terms

  1. XcalableMP implementation and performance of NAS Parallel Benchmarks

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          PGAS '10: Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model
          October 2010
          134 pages
          ISBN:9781450304610
          DOI:10.1145/2020373

          Copyright © 2010 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 12 October 2010

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader