ABSTRACT
This paper discusses the capabilities of the newest processors and GPUs to run a mixture of the most common chemistry applications. The baseline system for these comparisons is the 32-core Intel Broadwell processor which has been around for two years. Comparisons are made to the newer Intel Skylake and the AMD EPYC processors. The EPYC architecture has typically twice as many cores so one point of comparison is whether each code can effectively make use of the higher core count. These codes can be accelerated using GPUs with some taking advantage of 32-bit acceleration while others need good 64-bit performance. The consumer grade NVIDIA GeForce GTX 1080Ti cards are used as the baseline for the GPU comparisons. Higher level NVIDIA Quadro GP100 and Titan V cards are evaluated using each code. All applications use CUDA to enable GPU acceleration. AMD provides tools in its HIP package that allow translation of C and C++ CUDA code into source code that can be compiled with either NVIDIA's NVCC or AMD's HCC compilers. This project also involves investigating the performance and ease of converting CUDA code to run on the AMD Radeon Vega Frontier Edition GPU card.
- Kansas State University Beocat cluster website, http://beocat.ksu.edu/.Google Scholar
- H.J.C Berendsend, D. van der Spoel, R. van Drunen. 1995. GROMACS: A message-passing parallel molecular dynamics implementation. Computer Physics Communications 91, 1--3 (Sept. 1995), 43--56.Google Scholar
- Mark James Abraham, Teemu Murtola, Roland Schutz, Szilard Pall, Jeremy C. Smith, Berk Hess, Erik Linkahl. 2015. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1--2, (Sept. 2015), 19--25.Google Scholar
- James C. Phillips, Rosemary Braun, Wei Wang, James Gumbart, Emad Tajkhorshid, Elizabeth Villa, Christophe Hipot, Robert D. Skeel, Laxmikant Kale, and Klaus Schulten. 2005. Scalable molecular dynamics with NAMD. Journal of Computational Chemistry. 26 (2005) 1781--1802.Google ScholarCross Ref
- G. Kresse and J. Hafner. 1993. Ab-initio molecular dynamics for liquid metals. Physical Review B, 47 (1993) 558.Google ScholarCross Ref
- G. Kresse and J. Hafner. 1994. Ab-initio molecular-dynamics simulation of the liquid-metal-amorphous-semiconductor transition in germanium. Physical Review B, 49 (1994) 14251.Google ScholarCross Ref
- G. Kresse and J. Furthmuller. 1996. Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set. Computational Material Science, 6 (1996) 15.Google ScholarCross Ref
- G. Kresse and J. Furthmuller. 1996. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Physical Review B, 54 (1996) 11169.Google ScholarCross Ref
- C. Fonseca Guerra, J.G. Snijders, G. the Velde and E.J. Baerends. 1998. Towards an order-N DFT method. Theoretical Chemistry Accounts, 99 (1998) 391.Google Scholar
- C. Fonseca Guerra, J.G. Snijders, G. the Velde and E.J. Baerends. 1998. Towards an order-N DFT method. Theoretical Chemistry Accounts, 99 (1998) 391.Google Scholar
- ADF2017, SCM, Theoretical Chemistry, Vrije Universiteit, Amsterdam, The Netherlands, http://www.scm.com.Google Scholar
Index Terms
- Application Performance on the Newest Processors and GPUs
Recommendations
Accelerating the LOBPCG method on GPUs using a blocked sparse matrix vector product
HPC '15: Proceedings of the Symposium on High Performance ComputingThis paper presents a heterogeneous CPU-GPU implementation for a sparse iterative eigensolver -- the Locally Optimal Block Preconditioned Conjugate Gradient (LOBPCG). For the key routine generating the Krylov search spaces via the product of a sparse ...
Comparing the Performance of SYCL Runtimes for Molecular Dynamics Applications
IWOCL '23: Proceedings of the 2023 International Workshop on OpenCLSYCL is a cross-platform, royalty-free standard for programming a wide range of hardware accelerators. It is a powerful and convenient way to write standard C++ 17 code that can take full advantage of available devices. There are already multiple SYCL ...
Advances in the OpenCL offload support in GROMACS
IWOCL '19: Proceedings of the International Workshop on OpenCLGROMACS is a molecular dynamics (MD) simulation package widely used in research and education on machines ranging from laptops to workstation to the largest supercomputers. Built on a highly portable free and open source codebase GROMACS is known to have ...
Comments