Performance Evaluation of MPI, UPC and OpenMP on Multicore Architectures

Mallón, Damián A.; Taboada, Guillermo L.; Teijeiro, Carlos; Touriño, Juan; Fraguela, Basilio B.; Gómez, Andrés; Doallo, Ramón; Mouriño, J. Carlos

doi:10.1007/978-3-642-03770-2_24

Damián A. Mallón¹⁸,
Guillermo L. Taboada¹⁹,
Carlos Teijeiro¹⁹,
Juan Touriño¹⁹,
Basilio B. Fraguela¹⁹,
Andrés Gómez¹⁸,
Ramón Doallo¹⁹ &
…
J. Carlos Mouriño¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 5759))

Included in the following conference series:

European Parallel Virtual Machine / Message Passing Interface Users’ Group Meeting

1440 Accesses
31 Citations

Abstract

The current trend to multicore architectures underscores the need of parallelism. While new languages and alternatives for supporting more efficiently these systems are proposed, MPI faces this new challenge. Therefore, up-to-date performance evaluations of current options for programming multicore systems are needed. This paper evaluates MPI performance against Unified Parallel C (UPC) and OpenMP on multicore architectures. From the analysis of the results, it can be concluded that MPI is generally the best choice on multicore systems with both shared and hybrid shared/distributed memory, as it takes the highest advantage of data locality, the key factor for performance in these systems. Regarding UPC, although it exploits efficiently the data layout in memory, it suffers from remote shared memory accesses, whereas OpenMP usually lacks efficient data locality support and is restricted to shared memory systems, which limits its scalability.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A Methodology Approach to Compare Performance of Parallel Programming Models for Shared-Memory Architectures

Evaluating the Impact of OpenMP 4.0 Extensions on Relevant Parallel Workloads

Adaptive Parallelism in OpenMP Through Dynamic Variants

References

MPI Forum, http://www.mpi-forum.org (last visited: June 2009)
OpenMP, http://openmp.org (last visited: June 2009)
Unified Parallel, C., http://upc.gwu.edu (last visited: June 2009)
NAS Parallel Benchmarks, http://www.nas.nasa.gov/Resources/Software/npb.html (last visited: June 2009)
Rabenseifner, R., Hager, G., Jost, G.: Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes. In: Proc. of the 17th Euromicro Intl. Conf. on Parallel, Distributed, and Network-Based Processing (PDP 2009), Weimar (Germany), pp. 427–436 (2009)
Google Scholar
Rabenseifner, R., Hager, G., Jost, G., Keller, R.: Hybrid MPI and OpenMP Parallel Programming. In: Mohr, B., Träff, J.L., Worringen, J., Dongarra, J. (eds.) PVM/MPI 2006. LNCS, vol. 4192, p. 11. Springer, Heidelberg (2006)
Chapter Google Scholar
El-Ghazawi, T.A., Cantonnet, F., Yao, Y., Annareddy, S., Mohamed, A.S.: Productivity Analysis of the UPC Language. In: Proc. 3rd Workshop on Performance Modeling, Evaluation and Optimization of Parallel and Distributed Systems (PMEO 2004), Santa Fe (NM), pp. 1–7 (2004)
Google Scholar
El-Ghazawi, T.A., Sébastien, C.: UPC Benchmarking Issues. In: Proc. 30th IEEE Intl. Conf. on Parallel Processing (ICPP 2001), Valencia (Spain), pp. 365–372 (2001)
Google Scholar
El-Ghazawi, T.A., Cantonnet, F.: UPC Performance and Potential: a NPB Experimental Study. In: Proc. of the 15th ACM/IEEE Conf. on Supercomputing (SC 2002), Baltimore (MD), pp. 1–26 (2002)
Google Scholar
Cantonnet, F., Yao, Y., Annareddy, S., Mohamed, A., El-Ghazawi, T.A.: Performance Monitoring and Evaluation of a UPC Implementation on a NUMA Architecture. In: Proc. of the 2nd Workshop on Performance Modeling, Evaluation and Optimization of Parallel and Distributed Systems (PMEO 2003), Nice (France), 274 (8 Pages) (2003)
Google Scholar
Berkeley UPC, http://upc.lbl.gov/ (last visited: June 2009)
Mallón, D.A., Taboada, G.L., Touriño, J., Doallo, R.: NPB-MPJ: NAS Parallel Benchmarks Implementation for Message Passing in Java. In: Proc. of the 17th Euromicro Intl. Conf. on Parallel, Distributed, and Network-Based Processing (PDP 2009), Weimar (Germany), pp. 181–190 (2009)
Google Scholar
El-Ghazawi, T.A., Cantonnet, F., Yao, Y., Vetter, J.: Evaluation of UPC on the Cray X1. In: Proc. of the 47th Cray User Group meeting (CUG 2005), Albuquerque (NM), 10 Pages (2005)
Google Scholar
Kayi, A., Yao, Y., El-Ghazawi, T.A., Newby, G.: Experimental Evaluation of Emerging Multi-core Architectures. In: Proc. of the 6th Workshop on Performance Modeling, Evaluation and Optimization of Parallel and Distributed Systems (PMEO 2007), Long Beach (CA), pp. 1–6 (2007)
Google Scholar
Curtis-Maury, M., Ding, X., Antonopoulos, C.D., Nikolopoulos, D.S.: An Evaluation of OpenMP on Current and Emerging Multithreaded/Multicore Processors. In: Mueller, M.S., Chapman, B.M., de Supinski, B.R., Malony, A.D., Voss, M. (eds.) IWOMP 2005 and IWOMP 2006. LNCS, vol. 4315, pp. 133–144. Springer, Heidelberg (2008)
Chapter Google Scholar
Finis Terrae Supercomputer, http://www.top500.org/system/details/9500 (last visited: June 2009)
Taboada, G.L., Teijeiro, C., Touriño, J., Fraguela, B.B., Doallo, R., Mouriño, J.C., Mallón, D.A., Gómez, A.: Performance Evaluation of Unified Parallel C Collective Communications. In: Proc. of the 11th IEEE Intl. Conf. on High Performance Computing and Communications (HPCC 2009), Seoul (Korea), 10 Pages (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Galicia Supercomputing Center (CESGA), Santiago de Compostela, Spain
Damián A. Mallón, Andrés Gómez & J. Carlos Mouriño
Computer Architecture Group, University of A Coruña, A Coruña, Spain
Guillermo L. Taboada, Carlos Teijeiro, Juan Touriño, Basilio B. Fraguela & Ramón Doallo

Authors

Damián A. Mallón
View author publications
You can also search for this author in PubMed Google Scholar
Guillermo L. Taboada
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Teijeiro
View author publications
You can also search for this author in PubMed Google Scholar
Juan Touriño
View author publications
You can also search for this author in PubMed Google Scholar
Basilio B. Fraguela
View author publications
You can also search for this author in PubMed Google Scholar
Andrés Gómez
View author publications
You can also search for this author in PubMed Google Scholar
Ramón Doallo
View author publications
You can also search for this author in PubMed Google Scholar
J. Carlos Mouriño
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Information Technology, Åbo Akademi, 20500, Turku, Finland
Matti Ropo & Jan Westerholm &
Department of Electrical Engineering and Computer Science, University of Tennessee, 37996-3450, Knoxville, TN, USA
Jack Dongarra

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mallón, D.A. et al. (2009). Performance Evaluation of MPI, UPC and OpenMP on Multicore Architectures. In: Ropo, M., Westerholm, J., Dongarra, J. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2009. Lecture Notes in Computer Science, vol 5759. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03770-2_24

Download citation

DOI: https://doi.org/10.1007/978-3-642-03770-2_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03769-6
Online ISBN: 978-3-642-03770-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics