Skip to main content

Petaflop Seismic Simulations in the Public Cloud

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11501))

Abstract

During the last decade cloud services and infrastructure as a service became a popular solution for diverse applications. Additionally, hardware support for virtualization closed performance gaps, compared to on-premises, bare-metal systems. This development is driven by offloaded hypervisors and full CPU virtualization. Today’s cloud service providers, such as Amazon or Google, offer the ability to assemble application-tailored clusters to maximize performance. However, from an interconnect point of view, one has to tackle a 4–5\(\times \) slow-down in terms of bandwidth and 25\(\times \) in terms of latency, compared to latest high-speed and low-latency interconnects. Taking into account the high per-node and accelerator-driven performance of latest supercomputers, we observe that the network-bandwidth performance of recent cloud offerings is within 2\(\times \) of large supercomputers. In order to address these challenges, we present a comprehensive application-centric approach for high-order seismic simulations utilizing the ADER discontinuous Galerkin finite element method, which exhibits excellent communication characteristics. This covers the tuning of the operating system, normally not possible on supercomputers, micro-benchmarking, and finally, the efficient execution of our solver in the public cloud. Due to this performance-oriented end-to-end workflow, we were able to achieve 1.09 PFLOPS on 768 AWS c5.18xlarge instances, offering 27,648 cores with 5 PFLOPS of theoretical computational power. This correlates to an achieved peak efficiency of over 20% and a close-to 90% parallel efficiency in a weak scaling setup. In terms of strong scalability, we were able to strong-scale a science scenario from 2 to 64 instances with 60% parallel efficiency. This work is, to the best of our knowledge, the first of its kind at such a large scale.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   74.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    Source: https://www.gartner.com/en/newsroom/press-releases/2018-09-12-gartner-forecasts-worldwide-public-cloud-revenue-to-grow-17-percent-in-2019.

  2. 2.

    LIBXSMM is available from: https://github.com/hfp/libxsmm.

  3. 3.

    http://equake-rc.info/static/publish/paper/SRF-Description-Graves_2.0.pdf.

  4. 4.

    https://catalog.data.gov/dataset/national-elevation-dataset-ned-1-3-arc-second-downloadable-data-collection-national-geospatial.

  5. 5.

    AWS ParallelCluster is available from: https://aws-parallelcluster.readthedocs.io.

  6. 6.

    Slurm GCP is available from: https://github.com/SchedMD/slurm-gcp.

  7. 7.

    AWS ParallelCluster supports further submission systems, e.g., AWS Batch or SGE.

  8. 8.

    https://ark.intel.com/products/120496/Intel-Xeon-Platinum-8180-Processor-38-5M-Cache-2-50-GHz.

  9. 9.

    AWS News Blog post: https://aws.amazon.com/blogs/aws/amazon-ec2-update-additional-instance-types-nitro-system-and-cpu-options/.

  10. 10.

    https://aws.amazon.com/ec2/instance-types/.

  11. 11.

    https://cloud.google.com/compute/docs/cpu-platforms.

  12. 12.

    https://aws.amazon.com/blogs/aws/new-c5n-instances-with-100-gbps-networking/.

  13. 13.

    http://mvapich.cse.ohio-state.edu/download/mvapich/osu-micro-benchmarks-5.5.tar.gz.

  14. 14.

    https://www.olcf.ornl.gov/olcf-resources/compute-systems/summit/.

  15. 15.

    Verification of FP32 for ADER-DG seismic wave propagation is recent work (see http://doi.org/10.17605/OSF.IO/H9G5N and http://opt.dial3343.org).

References

  1. Alliez, P., et al.: 3D mesh generation. In: CGAL User and Reference Manual (2018)

    Google Scholar 

  2. Breuer, A., et al.: Petascale local time stepping for the ADER-DG finite element method. In: IPDPS 2016 (2016)

    Google Scholar 

  3. Breuer, A., Heinecke, A., Rettenberger, S., Bader, M., Gabriel, A.-A., Pelties, C.: Sustained petascale performance of seismic simulations with SeisSol on SuperMUC. In: Kunkel, J.M., Ludwig, T., Meuer, H.W. (eds.) ISC 2014. LNCS, vol. 8488, pp. 1–18. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-07518-1_1

    Chapter  Google Scholar 

  4. Breuer, A., Heinecke, A., Cui, Y.: EDGE: extreme scale fused seismic simulations with the discontinuous Galerkin method. In: Kunkel, J.M., Yokota, R., Balaji, P., Keyes, D. (eds.) ISC 2017. LNCS, vol. 10266, pp. 41–60. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-58667-0_3

    Chapter  Google Scholar 

  5. Chen, P., Lee, E.-J.: Full-3D Seismic Waveform Inversion: Theory, Software and Practice. SG. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16604-9

    Book  MATH  Google Scholar 

  6. Custódio, S., et al.: The 2004 mw6.0 Parkfield, California, earthquake: inversion of near-source ground motion using multiple data sets. Geophys. Res. Lett. 32(23) (2005)

    Google Scholar 

  7. Deelman, E., et al.: The cost of doing science on the cloud: the montage example. In: SC 2008 (2008)

    Google Scholar 

  8. Evangelinos, C., et al.: Cloud computing for parallel scientific HPC applications: feasibility of running coupled atmosphere-applications (2008)

    Google Scholar 

  9. Geuzaine, C., et al.: Gmsh: a 3-d finite element mesh generator with built-in pre- and post-processing facilities. Numer. Methods Eng. 79(11), 1309 (2009)

    Article  MathSciNet  Google Scholar 

  10. Goto, K., et al.: Anatomy of high-performance matrix multiplication. ACM Trans. Math. Softw. 34, 12 (2008)

    Article  MathSciNet  Google Scholar 

  11. Graves, R., et al.: Cybershake: a physics-based seismic hazard model for Southern California. Pure Appl. Geophys. 168(3), 367–381 (2011)

    Article  Google Scholar 

  12. Heinecke, A., et al.: Petascale high order dynamic rupture earthquake simulations on heterogeneous supercomputers. In: SC 2014 (2014)

    Google Scholar 

  13. Intel: Intel Xeon Processor Scalable Family Specification Update (2018)

    Google Scholar 

  14. Jackson, K.R., et al.: Performance analysis of high performance computing applications on the Amazon web services cloud. In: CCCTS 2010 (2010)

    Google Scholar 

  15. Mauch, V., et al.: High performance cloud computing. Future Gener. Comput. Syst. 29, 1408 (2013)

    Article  Google Scholar 

  16. McCalpin, J.D.: HPL and DGEMM performance variability on the Xeon Platinum 8160 processor. In: SC 2018, pp. 18:1–18:13. IEEE Press, Piscataway (2018)

    Google Scholar 

  17. Mohammadi, M., et al.: Comparative benchmarking of cloud computing vendors with high performance linpack. In: HPCCC 2018 (2018)

    Google Scholar 

  18. Napper, J., et al.: Can cloud computing reach the top500? In: UCHPC-MAW 2009 (2009)

    Google Scholar 

  19. Schoeder, S., et al.: Efficient explicit time stepping of high order discontinuous Galerkin schemes for waves. arXiv e-prints arXiv:1805.03981, May 2018

  20. Small, P., et al.: The SCEC unified community velocity model software framework. Seismol. Res. Lett. 88(6), 1539 (2017)

    Article  Google Scholar 

  21. Top500 Authors: Top500 List, November 2013

    Google Scholar 

  22. Uphoff, C., et al.: Extreme scale multi-physics simulations of the tsunamigenic 2004 sumatra megathrust earthquake. In: SC 2017 (2017)

    Google Scholar 

  23. Yvinec, M.: 2D triangulation. In: CGAL User and Reference Manual (2018)

    Google Scholar 

  24. Zhao, L., et al.: Strain green’s tensors, reciprocity, and their applications to seismic source and structure studies. Bull. Seismol. Soc. Am. 96(5), 1753 (2006)

    Article  Google Scholar 

Download references

Acknowledgements

EDGE, EDGEcut and the discussed cloud-related scripts are available under BSD-3 from the linked resources at: http://dial3343.org. We thank David Lenz for his contributions to EDGEcut. We thank the AWS Cloud Credits for Research and Academic Google Cloud program. At AWS we thank Walker Stemple, Linda Hedges, Aaron Bucher, Heather Matson, Randy Ridgley and Pierre-Yves Aquilanti for their patient and very helpful support. This work was supported by the Southern California Earthquake Center through award #18211.

Optimization Notice. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance. Intel, Xeon, and Intel Xeon Phi are trademarks of Intel Corporation in the U.S. and/or other countries.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alexander Heinecke .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Breuer, A., Cui, Y., Heinecke, A. (2019). Petaflop Seismic Simulations in the Public Cloud. In: Weiland, M., Juckeland, G., Trinitis, C., Sadayappan, P. (eds) High Performance Computing. ISC High Performance 2019. Lecture Notes in Computer Science(), vol 11501. Springer, Cham. https://doi.org/10.1007/978-3-030-20656-7_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-20656-7_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-20655-0

  • Online ISBN: 978-3-030-20656-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics