skip to main content
10.1145/2322156.2322159acmconferencesArticle/Chapter ViewAbstractPublication PagesicsConference Proceedingsconference-collections
research-article

On the scalability of the clusters-booster concept: a critical assessment of the DEEP architecture

Published:25 June 2012Publication History

ABSTRACT

Cluster computers are dominating high performance computing (HPC) today. The success of this architecture is based on the fact that it proffits from the improvements provided by mainstream computing well known under the label of Moore's law. But trying to get to Exascale within this decade might require additional endeavors beyond surfing this technology wave. In order to find possible directions for the future we review Amdahl's and Gustafson's thoughts on scalability. Based on this analysis we propose an advance architecture combining a Cluster with a so called Booster element comprising of accelerators interconnected by a high performance fabric. We argue that this architecture provides significant advantages compared to today's accelerated clusters and might pave the way for clusters into the era of Exascale computing. The DEEP project has been presented aiming for an implementation of this concept. Six applications from fields having the potential to exploit Exascale systems will be ported to DEEP.We analyze one application in detail and explore the consequences of the constraints of the DEEP systems on its scalability.

References

  1. http://www.top500.orgGoogle ScholarGoogle Scholar
  2. http://www.deep-project.euGoogle ScholarGoogle Scholar
  3. http://http://www.mpi-forum.orgGoogle ScholarGoogle Scholar
  4. Gordon E. Moore, "Cramming more components onto integrated circuits.", Electronics. 19, Nr. 3, 1965, pp. 114-117.Google ScholarGoogle Scholar
  5. www.cse.nd.edu/Reports/2008/TR-2008-13.pdfGoogle ScholarGoogle Scholar
  6. http://www.theregister.co.uk/2010/11/22/ibm_blue_gene_q_superGoogle ScholarGoogle Scholar
  7. http://developer.nvidia.com/gpudirectGoogle ScholarGoogle Scholar
  8. http://www.green500.orgGoogle ScholarGoogle Scholar
  9. H. Baier et al., "QPACE: power-efficient parallel architecture based on IBM PowerXCell 8i", Computer Science - R&D 25 (2010), pp. 149-154. doi:10.1007/s00450-010-0122-4.Google ScholarGoogle Scholar
  10. Gene Amdahl (1967), "Validity of the Single Processor Approach to Achieving Large-Scale Computing Capabilities", (PDF), AFIPS Conference Proceedings (30), pp. 483-485. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. John L. Gustafson, "Re-evaluating Amdahl's Law", Communications of the ACM 31(5), 1988, pp. 532-533. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Charles Clos, "A Study of Non-blocking Switching Networks", The Bell System Technical Journal, 1953, vol. 32, no. 2, pp. 406-424Google ScholarGoogle ScholarCross RefCross Ref
  13. http://www.intel.com/pressroom/archive/releases/2010/20100531comp.htmGoogle ScholarGoogle Scholar
  14. http://newsroom.intel.com/servlet/JiveServlet/download/38-6968/Intel_SC11_presentation.pdfGoogle ScholarGoogle Scholar
  15. Mondrian Nüssle et al., "A resource optimized remote-memory-access architecture for low-latency communication", The 38th International Conferenceon Parallel Processing (ICPP-2009), September 22-25, Vienna, Austria. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. H. Fröning und H. Litz, Effcient Hardware Support for the Partitioned Global Address Space, 10th Workshop on Communication Architecture for Clusters (CAC2010), co-located with 24th International Parallel and Distributed Processing Symposium (IPDPS 2010), Atlanta, Georgia, 2012.Google ScholarGoogle Scholar
  17. S. Markidis, G. Lapenta and Rizwan-Uddin, "Multi-scale simulations of plasma with iPIC3D", Mathematics and Computers in Simulation, pp. 1509-1519, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. U. Brackbill and D. W. Forslund, "Simulation of low frequency, electromagnetic phenomena in plasmas", Journal of Computational Physics, 1982, p. 271.Google ScholarGoogle ScholarCross RefCross Ref
  19. P. Ricci, G. Lapenta and J. U. Brackbill, "A simplified implicit Maxwell solver", Journal of Computational Physics (2002), p. 117. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. B. Marder, "A method for incorporating Gauss' law into electromagnetic PIC codes", J. Comput. Phys., vol. 68 (1987), p. 48 Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. A. Bruce Langdon, "On enforcing Gauss' law in electromagnetic particle-in-cell codes", Computer Physics Communications, vol. 70, Issue 3 (1992).Google ScholarGoogle Scholar
  22. A. Duran, E. Ayguaée, R. M. Badia, J. Labarta, L. Martinell, X. Martorell and J. Planas, "OmpSs: A Proposal for Programming Heterogeneous Multi-Core Architectures", in Parallel Processing Letters, vol. 21, Issue 2 (2011) pp. 173-193.Google ScholarGoogle ScholarCross RefCross Ref
  23. G. R. Gao, T. L. Sterling, R. Stevens, M. Hereld and W. Zhu, "ParalleX: A Study of A New Parallel Computation Model", in Proc. of 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), Long Beach, California, USAGoogle ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. On the scalability of the clusters-booster concept: a critical assessment of the DEEP architecture

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          FutureHPC '12: Proceedings of the Future HPC Systems: the Challenges of Power-Constrained Performance
          June 2012
          31 pages
          ISBN:9781450314534
          DOI:10.1145/2322156

          Copyright © 2012 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 25 June 2012

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader