Abstract
This chapter explains the techniques of hybrid parallel execution to establish 10,000 more parallel executions and reduce communication time. Several terms are defined in the discussion of hybrid parallel execution. In addition, actual examples of hardware and programming for parallel execution are shown. Finally, an experimental methodology to develop hybrid MPI execution is shown.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Performance goes down dramatically if a programmer does not carefully think about optimization of memory access due to frequently far memory accesses on the ccNUMA organization. Hence, even if users can use such a machine with a large amount of memory, the users may not obtain the desired performance of parallel execution.
References
Manual for Large-Scale Visualization System UV2000, Information Technology Center (Nagoya University, 2016), http://www.icts.nagoya-u.ac.jp/ja/sc/pdf/uv2000manual_20160311.pdf
P.S. Pacheco, Parallel Programming with MPI (Morgan Kaufmann, 1996)
NVIDIA NVLINK, http://www.nvidia.com/object/nvlink.html
ScaLAPACK―Scalable Linear Algebra PACKage, http://www.netlib.org/scalapack/
LAPACK–Linear Algebra PACKage, http://www.netlib.org/lapack/
BLAS (Basic Linear Algebra Subprograms), http://www.netlib.org/blas/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Katagiri, T. (2019). Hybrid Parallelization Techniques. In: Geshi, M. (eds) The Art of High Performance Computing for Computational Science, Vol. 1. Springer, Singapore. https://doi.org/10.1007/978-981-13-6194-4_4
Download citation
DOI: https://doi.org/10.1007/978-981-13-6194-4_4
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-6193-7
Online ISBN: 978-981-13-6194-4
eBook Packages: Computer ScienceComputer Science (R0)