Abstract
We evaluate the performance of a parallel 3D finite-difference method (FDM) simulation of seismic wave propagation using the Intel Xeon Phi coprocessor. Since a continued decrease in the byte/flop ratio of future machines is forecast, program optimization with a decrease byte/flop ratio was applied by fusing the original major kernel and omitting the storing and loading of intermediate variables. We confirm that 1) MPI/OpenMP hybrid parallel computing with hyper-threading is more efficient than pure MPI parallel computing and 2) the performance of the FDM simulation with a splitting of triple DO loops is 1.3 times faster than the modified code with triple DO loops, while no performance acceleration is achieved with a fused double DO-loop calculation. We consider that loop distribution optimization is effective for prefetching and the thread parallelization of each loop by its use and reuse on cache data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Top 500, http://www.top500.org
ppOpen-HPC project, http://ppopenhpc.cc.u-tokyo.ac.jp/wordpress/
MPI Web Site: http://www.mcs.anl.gov/research/projects/mpi/
OpenMP Web Site: http://www.openmp.org/
Noronha, R., Panda, DK.: Improving Scalability of OpenMP Applications on Multi-core Systems Using Large Page Support. In: Parallel and Distributed Processing Symposium 2007 IPDPS 2007, pp. 1–8 (2007)
Tsuji, M., Sato, M.: Performance evaluation of OpenMP and MPI hybrid programs on a large scale multi-core multi-socket cluster: T2 K Open Supercomputer. Parallel Process. Workshops 2009, pp. 206–213 (2009)
Nakajima, K.: OpenMP/MPI hybrid parallel multigrid method on fujitsu FX10 supercomputer system. In: IEEE International Conference on Cluster Computing Workshops, pp. 199–206 (2012)
Furumura, T.: Large-scale parallel simulation of seismic wave propagation and strong ground motions for the past and future earthquakes in Japan. J. Earth Simulator 3, 29–38 (2005)
Satoh, M., Tomita, H., Yashiro, H., Miura, H., Kodama, C., Seiki, T., Noda, A., Yamada, Y., Goto, D., Sawada, M., Miyoshi, T., Niwa, Y., Hara, M., Ohno, T., Iga, S., Arakawa, T., Inoue, T., Kubokawa, H.: The non-hydrostatic icosahedral atmospheric model: description and development. Prog. Earth Planet. Sci. 1, 1–18 (2014)
Katagiri, T., Ito, S., Ohshima, S.: Early Experiences for Adaptation of Auto-tuning by ppOpen-AT to an Explicit Method. In: Proceedings of MCSoC2013, pp. 153–158 (2013)
Acknowledgments
This work is supported by Core Research for Evolution Science and Technology (CREST), the Japan Science and Technology Agency (JST), Japan. We are grateful to Professor Tsuruoka at the Earthquake Research Institute, The University of Tokyo, for providing the Intel Xeon Phi coprocessor computing environment. We also thank our anonymous reviewers for a number of constructive comments for the revision of the manuscript.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Mori, F., Matsumoto, M., Furumura, T. (2015). Performance Optimization of the 3D FDM Simulation of Seismic Wave Propagation on the Intel Xeon Phi Coprocessor Using the ppOpen-APPL/FDM Library. In: Daydé, M., Marques, O., Nakajima, K. (eds) High Performance Computing for Computational Science -- VECPAR 2014. VECPAR 2014. Lecture Notes in Computer Science(), vol 8969. Springer, Cham. https://doi.org/10.1007/978-3-319-17353-5_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-17353-5_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-17352-8
Online ISBN: 978-3-319-17353-5
eBook Packages: Computer ScienceComputer Science (R0)