skip to main content
10.1145/2792745.2792773acmotherconferencesArticle/Chapter ViewAbstractPublication PagesxsedeConference Proceedingsconference-collections
short-paper

Optimizing codes on the Xeon Phi: a case-study with LAMMPS

Published: 26 July 2015 Publication History

Abstract

Intel's Xeon Phi co-processor has the potential to provide an impressive 4 GFlops/Watt while promising users that they need only to recompile their code to get it to run on the accelerator. This paper reports our experience on running LAMMPS, a widely-used molecular dynamics code, on the Xeon Phi and the steps we took to optimize its performance on the device. Using performance analysis tools to pinpoint bottlenecks in the code, we were able to achieve a speedup of 2.8x from running the original code on the host processors vs. the optimized code on the Xeon Phi. These optimizations also resulted in an improved LAMMPS' performance on the host -- speeding up the execution by 7x.

References

[1]
Intel(R) Xeon Phi(TM) Coprocessor 7120P. http://tinyurl.com/pjyyya3.
[2]
LAMMPS* for Intel(R) Xeon Ph(TM) Coprocessor. http://tinyurl.com/kknb59u.
[3]
LAMMPS Molecular Dynamics Simulator. http://lammps.sandia.gov/.
[4]
M. A. Laurenzano, M. M. Tikir, L. Carrington, and A. Snavely. Pebil: Efficient static binary instrumentation for linux. In Performance Analysis of Systems & Software (ISPASS), 2010 IEEE International Symposium on, pages 175--183. IEEE, 2010.
[5]
B. Li, H.-C. Chang, S. Song, C.-Y. Su, T. Meyer, J. Mooring, and K. Cameron. The power-performance tradeoffs of the intel xeon phi on hpc applications. In Parallel Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE International, May 2014.
[6]
S. Pennycook, C. Hughes, M. Smelyanskiy, and S. Jarvis. Exploring simd for molecular dynamics, using intel xeon processors and intel xeon phi coprocessors. In Parallel Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on, pages 1085--1097, May 2013.
[7]
J. Peraza, A. Tiwari, M. Laurenzano, L. Carrington, W. Ward, and R. Campbell. Understanding the performance of stencil computations on intel's xeon phi. In Cluster Computing (CLUSTER), 2013 IEEE International Conference on, pages 1--5, Sept 2013.
[8]
TACC Stampede User Guide. https://portal.xsede.org/tacc-stampede.
[9]
Top500 Supercomputer Sites. http://www.top500.org/.

Cited By

View all
  • (2018)Task-Based Programming on Emerging Parallel Architectures for Finite-Differences Seismic Numerical KernelEuro-Par 2018: Parallel Processing10.1007/978-3-319-96983-1_54(764-777)Online publication date: 1-Aug-2018
  • (2017)Applying EMD/HHT analysis to power traces of applications executed on systems with Intel Xeon PhiThe International Journal of High Performance Computing Applications10.1177/1094342017731612(109434201773161)Online publication date: 31-Oct-2017
  • (2017)Running large‐scale CFD applications on Intel‐KNL–based clustersInternational Journal for Numerical Methods in Fluids10.1002/fld.447486:11(699-716)Online publication date: 14-Nov-2017

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
XSEDE '15: Proceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced Cyberinfrastructure
July 2015
296 pages
ISBN:9781450337205
DOI:10.1145/2792745
© 2015 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of the United States government. As such, the United States Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

Sponsors

  • San Diego Super Computing Ctr: San Diego Super Computing Ctr
  • HPCWire: HPCWire
  • Omnibond: Omnibond Systems, LLC
  • SGI
  • Internet2
  • Indiana University: Indiana University
  • CASC: The Coalition for Academic Scientific Computation
  • NICS: National Institute for Computational Sciences
  • Intel: Intel
  • DDN: DataDirect Networks, Inc
  • DELL
  • CORSA: CORSA Technology
  • ALLINEA: Allinea Software
  • Cray
  • RENCI: Renaissance Computing Institute

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 July 2015

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Short-paper

Funding Sources

Conference

XSEDE '15
Sponsor:
  • San Diego Super Computing Ctr
  • HPCWire
  • Omnibond
  • Indiana University
  • CASC
  • NICS
  • Intel
  • DDN
  • CORSA
  • ALLINEA
  • RENCI

Acceptance Rates

XSEDE '15 Paper Acceptance Rate 49 of 70 submissions, 70%;
Overall Acceptance Rate 129 of 190 submissions, 68%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2018)Task-Based Programming on Emerging Parallel Architectures for Finite-Differences Seismic Numerical KernelEuro-Par 2018: Parallel Processing10.1007/978-3-319-96983-1_54(764-777)Online publication date: 1-Aug-2018
  • (2017)Applying EMD/HHT analysis to power traces of applications executed on systems with Intel Xeon PhiThe International Journal of High Performance Computing Applications10.1177/1094342017731612(109434201773161)Online publication date: 31-Oct-2017
  • (2017)Running large‐scale CFD applications on Intel‐KNL–based clustersInternational Journal for Numerical Methods in Fluids10.1002/fld.447486:11(699-716)Online publication date: 14-Nov-2017

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media