Skip to main content

SW-TRRM: Parallel Optimization Research of the Random Ray Method Based on Sunway Bluelight II Supercomputer

  • Conference paper
  • First Online:
Algorithms and Architectures for Parallel Processing (ICA3PP 2023)

Abstract

The Random Ray Method (TRRM) is a new approach to solving partial differential equations (PDEs) based on the method of characteristics (MOC). It employs stochastic rather than deterministic discretization of characteristic tracks and can be used for the numerical simulation of nuclear reactors. In this paper, we propose SW-TRRM, a parallel optimization program for TRRM based on the Sunway Bluelight II Supercomputer for the first time. We present a two-level parallelization scheme that consists of thread-level and process-level optimization. At the thread-level, we introduce three schemes for speeding up within a single core group, including direct parallelization, parallelization by energy groups, and loop structure optimization. At the process-level, we implement task parallelization among multiple processes using domain replication. Moreover, we devise an algorithm to optimize the MPI collective communication across super-nodes. Experimental results show that SW-TRRM achieves a 17.40\(\times \) speedup within a single core group compared to the original TRRM program. When scaled up to 2,048 processes and 133,120 cores, SW-TRRM maintains good strong and weak scalability.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Tramm, J.R., Smith, K.S., Forget, B., Siegel, A.R.: The Random Ray Method for neutral particle transport. J. Comput. Phys. 342, 229–252 (2017)

    Article  MathSciNet  Google Scholar 

  2. Zhu, Q., Luo, H., Yang, C., Ding, M., Yin, W., Yuan, X.: Enabling and scaling the HPCG benchmark on the newest generation Sunway supercomputer with 42 million heterogeneous cores. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 1–13 (2021)

    Google Scholar 

  3. Lewis, E., Smith, M., Tsoulfanidis, N., Palmiotti, G., Taiwo, T., Blomquist, R.: Benchmark specification for Deterministic 2-D/3-D MOX fuel assembly transport calculations without spatial homogenization (C5G7 MOX). NEA/NSC 280, 2001 (2001)

    Google Scholar 

  4. Askew, J.: A characteristics formulation of the neutron transport equation in complicated geometries. Technical report, United Kingdom Atomic Energy Authority (1972)

    Google Scholar 

  5. Romano, P.K., Forget, B.: The OpenMC monte carlo particle transport code. Ann. Nucl. Energy 51, 274–281 (2013)

    Article  Google Scholar 

  6. Hu, W., et al.: 2.5 million-atom ab initio electronic-structure simulation of complex metallic heterostructures with DGDFT. In: 2022 SC22: International Conference for High Performance Computing, Networking, Storage and Analysis (SC), pp. 48–60. IEEE Computer Society (2022)

    Google Scholar 

  7. Boyd, W., Shaner, S., Li, L., Forget, B., Smith, K.: The OpenMOC method of characteristics neutral particle transport code. Ann. Nucl. Energy 68, 43–52 (2014)

    Article  Google Scholar 

  8. Zhu, K., Kong, B., Hou, J., Zhang, H., Guo, J., Li, F.: ARCHER-a new Three-Dimensional method of characteristics neutron transport code for Pebble-bed HTR with coarse mesh finite difference acceleration. Ann. Nucl. Energy 177, 109303 (2022)

    Article  Google Scholar 

  9. Chao, N., Yang, H., Liu, Y.K., Xia, H., Ayodeji, A.: A local method of characteristics for dose assessment. Radiat. Phys. Chem. 173, 108869 (2020)

    Article  Google Scholar 

  10. Tramm, J.R., Smith, K.S., Forget, B., Siegel, A.R.: ARRC: a random ray neutron transport code for nuclear reactor simulation. Ann. Nucl. Energy 112, 693–714 (2018)

    Article  Google Scholar 

  11. Tramm, J.R., Siegel, A.R.: Immortal rays: rethinking random ray neutron transport on GPU architectures. Parallel Comput. 108, 102832 (2021)

    Article  MathSciNet  Google Scholar 

  12. Yang, C., et al.: 10M-core scalable fully-implicit solver for nonhydrostatic atmospheric dynamics. In: SC 2016: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 57–68. IEEE (2016)

    Google Scholar 

  13. Fu, H., et al.: 18.9-Pflops nonlinear earthquake simulation on sunway TaihuLight: enabling depiction of 18-Hz and 8-meter scenarios. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 1–12 (2017)

    Google Scholar 

  14. Liu, Y., et al.: Closing the “Quantum Supremacy” gap: achieving real-time simulation of a random quantum circuit using a new sunway supercomputer. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 1–12 (2021)

    Google Scholar 

  15. Shang, H., et al.: TensorKMC: kinetic monte carlo simulation of 50 trillion atoms driven by deep learning on a new generation of sunway supercomputer. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 1–14 (2021)

    Google Scholar 

  16. Xiao, J., et al.: Symplectic structure-preserving particle-in-cell whole-volume simulation of tokamak plasmas to 111.3 trillion particles and 25.7 billion grids. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 1–13 (2021)

    Google Scholar 

  17. Shang, H., et al.: Extreme-scale ab initio quantum Raman spectra simulations on the leadership HPC system in China. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 1–13 (2021)

    Google Scholar 

  18. Ma, Z., et al.: BaGuaLu: targeting brain scale pretrained models with over 37 million cores. In: Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp. 192–204 (2022)

    Google Scholar 

  19. Li, M., et al.: Bridging the gap between deep learning and frustrated quantum spin system for extreme-scale simulations on new generation of sunway supercomputer. IEEE Trans. Parallel Distrib. Syst. 33(11), 2846–2859 (2022)

    Article  Google Scholar 

  20. Li, F., et al.: SW_Qsim: a minimize-memory quantum simulator with high-performance on a new sunway supercomputer. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 1–13 (2021)

    Google Scholar 

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China under Grant 62002186, 12105150, and 2021 Shandong Youth Innovation Talent Introduction and Education Plan (Parallel Computing Industrial Software Innovation Team Based on Chinese supercomputer), and TaiShan Scholars Program NO. tsqnz20221148, and the unveiling project of Qilu University of Technology (Shandong Academy of Sciences) under Grant 2022JBZ01-01.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tao Liu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ren, Z. et al. (2024). SW-TRRM: Parallel Optimization Research of the Random Ray Method Based on Sunway Bluelight II Supercomputer. In: Tari, Z., Li, K., Wu, H. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2023. Lecture Notes in Computer Science, vol 14491. Springer, Singapore. https://doi.org/10.1007/978-981-97-0808-6_22

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-0808-6_22

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-0807-9

  • Online ISBN: 978-981-97-0808-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics