Abstract
The current trend to multicore architectures underscores the need of parallelism. While new languages and alternatives for supporting more efficiently these systems are proposed, MPI faces this new challenge. Therefore, up-to-date performance evaluations of current options for programming multicore systems are needed. This paper evaluates MPI performance against Unified Parallel C (UPC) and OpenMP on multicore architectures. From the analysis of the results, it can be concluded that MPI is generally the best choice on multicore systems with both shared and hybrid shared/distributed memory, as it takes the highest advantage of data locality, the key factor for performance in these systems. Regarding UPC, although it exploits efficiently the data layout in memory, it suffers from remote shared memory accesses, whereas OpenMP usually lacks efficient data locality support and is restricted to shared memory systems, which limits its scalability.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
MPI Forum, http://www.mpi-forum.org (last visited: June 2009)
OpenMP, http://openmp.org (last visited: June 2009)
Unified Parallel, C., http://upc.gwu.edu (last visited: June 2009)
NAS Parallel Benchmarks, http://www.nas.nasa.gov/Resources/Software/npb.html (last visited: June 2009)
Rabenseifner, R., Hager, G., Jost, G.: Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes. In: Proc. of the 17th Euromicro Intl. Conf. on Parallel, Distributed, and Network-Based Processing (PDP 2009), Weimar (Germany), pp. 427–436 (2009)
Rabenseifner, R., Hager, G., Jost, G., Keller, R.: Hybrid MPI and OpenMP Parallel Programming. In: Mohr, B., Träff, J.L., Worringen, J., Dongarra, J. (eds.) PVM/MPI 2006. LNCS, vol. 4192, p. 11. Springer, Heidelberg (2006)
El-Ghazawi, T.A., Cantonnet, F., Yao, Y., Annareddy, S., Mohamed, A.S.: Productivity Analysis of the UPC Language. In: Proc. 3rd Workshop on Performance Modeling, Evaluation and Optimization of Parallel and Distributed Systems (PMEO 2004), Santa Fe (NM), pp. 1–7 (2004)
El-Ghazawi, T.A., Sébastien, C.: UPC Benchmarking Issues. In: Proc. 30th IEEE Intl. Conf. on Parallel Processing (ICPP 2001), Valencia (Spain), pp. 365–372 (2001)
El-Ghazawi, T.A., Cantonnet, F.: UPC Performance and Potential: a NPB Experimental Study. In: Proc. of the 15th ACM/IEEE Conf. on Supercomputing (SC 2002), Baltimore (MD), pp. 1–26 (2002)
Cantonnet, F., Yao, Y., Annareddy, S., Mohamed, A., El-Ghazawi, T.A.: Performance Monitoring and Evaluation of a UPC Implementation on a NUMA Architecture. In: Proc. of the 2nd Workshop on Performance Modeling, Evaluation and Optimization of Parallel and Distributed Systems (PMEO 2003), Nice (France), 274 (8 Pages) (2003)
Berkeley UPC, http://upc.lbl.gov/ (last visited: June 2009)
Mallón, D.A., Taboada, G.L., Touriño, J., Doallo, R.: NPB-MPJ: NAS Parallel Benchmarks Implementation for Message Passing in Java. In: Proc. of the 17th Euromicro Intl. Conf. on Parallel, Distributed, and Network-Based Processing (PDP 2009), Weimar (Germany), pp. 181–190 (2009)
El-Ghazawi, T.A., Cantonnet, F., Yao, Y., Vetter, J.: Evaluation of UPC on the Cray X1. In: Proc. of the 47th Cray User Group meeting (CUG 2005), Albuquerque (NM), 10 Pages (2005)
Kayi, A., Yao, Y., El-Ghazawi, T.A., Newby, G.: Experimental Evaluation of Emerging Multi-core Architectures. In: Proc. of the 6th Workshop on Performance Modeling, Evaluation and Optimization of Parallel and Distributed Systems (PMEO 2007), Long Beach (CA), pp. 1–6 (2007)
Curtis-Maury, M., Ding, X., Antonopoulos, C.D., Nikolopoulos, D.S.: An Evaluation of OpenMP on Current and Emerging Multithreaded/Multicore Processors. In: Mueller, M.S., Chapman, B.M., de Supinski, B.R., Malony, A.D., Voss, M. (eds.) IWOMP 2005 and IWOMP 2006. LNCS, vol. 4315, pp. 133–144. Springer, Heidelberg (2008)
Finis Terrae Supercomputer, http://www.top500.org/system/details/9500 (last visited: June 2009)
Taboada, G.L., Teijeiro, C., Touriño, J., Fraguela, B.B., Doallo, R., Mouriño, J.C., Mallón, D.A., Gómez, A.: Performance Evaluation of Unified Parallel C Collective Communications. In: Proc. of the 11th IEEE Intl. Conf. on High Performance Computing and Communications (HPCC 2009), Seoul (Korea), 10 Pages (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mallón, D.A. et al. (2009). Performance Evaluation of MPI, UPC and OpenMP on Multicore Architectures. In: Ropo, M., Westerholm, J., Dongarra, J. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2009. Lecture Notes in Computer Science, vol 5759. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03770-2_24
Download citation
DOI: https://doi.org/10.1007/978-3-642-03770-2_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03769-6
Online ISBN: 978-3-642-03770-2
eBook Packages: Computer ScienceComputer Science (R0)