Abstract
SMP Clusters with fat nodes offer an interesting capability for large applications that employ a hybrid parallelization model: to improve load balance, the number of threads can be increased in order to speed-up busy MPI processes or decreased to slow down idle MPI processes, provided these processes reside on the same SMP node. We developed a library which performs this thread adjustment automatically during program execution. Experimental results demonstrate remarkable speed-ups with minimal programming effort.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Jost, G., Jin, H., an Mey, D., Hattay, F.: Comparing the OpenMP, MPI and Hybrid Programming Paradigms on an SMP Cluster, NAS Technical Report NAS-03-019, NASAAmes Research Center, Moffet Field, CA (November 2003)
Bova, S.W., Breshears, C.P., Gabb, H., Kuhn, B., Magro, B., Eigenmann, R., Gaertner, G., Salvini, S., Scott, H.: Parallel Programming with Message Passing and Directives. In: Computing in Science and Engineering, September 2001, pp. 22–37 (2001)
Rabenseifner, R.: Hybrid Parallel Programming on Parallel Platforms. In: EWOMP 2003 - Fifth European Workshop on OpenMP (2003), http://www.rz.rwth-aachen.de/ewomp03/omptalks/Tuesday/Session7/T01p.pdf
Hesse, M.K., Reinartz, B., Ballmann, J.: Inviscid Flow Computation for the Shuttle-Like Configuration PHOENIX. In: Breitsamter, C., Laschka, B., Heinemann, H.-J., Hilbig, R. (eds.) Notes on Numerical Fluid Mechanics, vol. 87, pp. 172–179. Springer, Heidelberg (2003)
Hesse, M.K., Reinartz, B., Ballmann, J.: Numerical Investigation of the Shuttle-Like Configuration PHOENIX. In: Krause, E., Jäger, W. (eds.) High Performance Computing in Science and Engineering 2002, pp. 379–390. Springer, Heidelberg (2002) ISBN 3-540-43860-2
Hesse, M.K., Reinartz, B., Ballmann, J.: Numerical Investigation of a Reusable Space Transportation System. In: Proceedings of the 3rd International Symposium on Atmospheric Reentry Vehicles and Systems, Arcachon/France, March 24-27 (2003)
Bailey, D., Barszcz, E., Barton, J., Browning, D., Carter, R., Dagum, L., Fatoohi, R., Fineberg, S., Frederickson, P., Lasinski, T., Schreiber, R., Simon, H., Venkatakrishnan, V., Weeratunga, S.: The NAS Parallel Benchmarks, NAS Technical Report RNR-94-007, NASA Ames Research Center, Moffet Field, CA (March 1994)
Bailey, D., Harris, T., Saphir, W., van der Wijngaart, R., Woo, A., Yarrow, M.: The NAS Parallel Benchmarks 2.0. NAS Technical Report NAS-95-020, NASA Ames Research Center, Moffet Field, CA (December 1995)
Van der Wijngaart, R.F., Jin, H.: NAS Parallel Benchmarks, Multi-Zone Versions. NAS Technical Report NAS-03-010, NASA Ames Research Center, Moffet Field, CA (July 2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Spiegel, A., an Mey, D., Bischof, C. (2006). Hybrid Parallelization of CFD Applications with Dynamic Thread Balancing. In: Dongarra, J., Madsen, K., Waśniewski, J. (eds) Applied Parallel Computing. State of the Art in Scientific Computing. PARA 2004. Lecture Notes in Computer Science, vol 3732. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11558958_51
Download citation
DOI: https://doi.org/10.1007/11558958_51
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29067-4
Online ISBN: 978-3-540-33498-9
eBook Packages: Computer ScienceComputer Science (R0)