Abstract
PGAS models like OpenSHMEM provide interfaces to explicitly initiate one-sided remote memory accesses among processes. In addition, the model also provides synchronizing barriers to ensure a consistent view of the distributed memory at different phases of an application. The incorrect use of such interfaces affects the scalability achievable while using a parallel programming model. This study aims at understanding the effects of these constructs on the energy and power consumption behavior of OpenSHMEM applications. Our experiments show that cost incurred in terms of the total energy and power consumed depends on multiple factors across the software and hardware stack. We conclude that there is a significant impact on the power consumed by the CPU and DRAM due to multiple factors including the design of the data transfer patterns within an application, the design of the communication protocols within a middleware, the architectural constraints laid by the interconnect solutions, and also the levels of memory hierarchy within a compute node. This work motivates treating energy and power consumption as important factors while designing compute solutions for current and future distributed systems.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Intel 64 and ia-32 architectures software developers manual volume 3b: System programming guide, part 2
Thrifty: An exascale architecture for energy-proportional computing, http://science.energy.gov/~/media/ascr/pdf/research/cs/aa/A_oph_uiuc_thrifty_110215.pdf
Linux tuning guide, amd opteron 6200 series processors (April 2012)
Nvml api reference manual, ver.5.319.43 (August 2013)
Aboughazaleh, N., Childers, B., Melhem, R., Craven, M.: Collaborative compiler-os power management for time-sensitive applications. Tech. rep., Department of Computer Science, University of Pittsburgh (2002)
Benedict, S.: Review: Energy-aware performance analysis methodologies for hpc architectures-an exploratory study. J. Netw. Comput. Appl. 35(6), 1709–1719 (2012), http://dx.doi.org/10.1016/j.jnca.2012.08.003
Choi, J.W., Bedard, D., Fowler, R., Vuduc, R.: A theoretical framework for algorithm-architecture co-design. In: Proc. IEEE Int’l. Parallel and Distributed Processing Symp. (IPDPS), Boston, MA, USA (May 2013)
David, H., Gorbatov, E., Hanebutte, U.R., Khanna, R., Le, C.: Rapl: Memory power estimation and capping. In: 2010 ACM/IEEE International Symposium on Low-Power Electronics and Design (ISLPED), pp. 189–194 (2010)
Dongarra, J., Ltaief, H., Luszczek, P., Weaver, V.M.: Energy Footprint of Advanced Dense Numerical Linear Algebra Using Tile Algorithms on Multicore Architectures. In: 2012 Second International Conference on Cloud and Green Computing, pp. 274–281 (2012), http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6382829
High Performance Computing Tools group and at UH, Extreme Scale Systems Center at ORNL: Openshmem application programming interface, version 1.0. Tech. rep., University of Houston (UH), Oak Ridge National Laboratory, ORNL (2012), http://www.openshmem.org
Hackenberg, D., Ilsche, T., Schone, R., Molka, D., Schmidt, M., Nagel, W.: Power measurement techniques on standard compute nodes: A quantitative comparison. In: 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 194–204 (2013)
Hoefler, T.: Software and hardware techniques for power-efficient hpc networking. Computing in Science Engineering 12(6), 30–37 (2010)
Kerrisk, M.: Linux programmer’s manual (2012), http://man7.org/linux/man-pages/man7/cpuset.7.html
Knüpfer, A., Brunst, H., Doleschal, J., Jurenz, M., Lieber, M., Mickler, H., Müller, M., Nagel, W.: The vampir performance analysis tool-set. In: Resch, M., Keller, R., Himmler, V., Krammer, B., Schulz, A. (eds.) Tools for High Performance Computing, pp. 139–155. Springer, Heidelberg (2008), http://dx.doi.org/10.1007/978-3-540-68564-7_9
Korthikanti, V.A., Agha, G.: Towards optimizing energy costs of algorithms for shared memory architectures. In: Proceedings of the 22nd ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2010, p. 157 (2010), http://portal.acm.org/citation.cfm?doid=1810479.1810510
Li, D., de Supinski, B.R., Schulz, M., Cameron, K., Nikolopoulos, D.S.: Hybrid MPI/OpenMP power-aware computing. In: 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), pp. 1–12 (2010), http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5470463
Markatos, E., Crovella, M., Das, P., Dubnicki, C., LeBlanc, T.: The effects of multiprogramming on barrier synchronization. In: Proceedings of the Third IEEE Symposium on Parallel and Distributed Processing, pp. 662–669 (1991)
Mucci, P.J., Browne, S., Deane, C., Ho, G.: Papi: A portable interface to hardware performance counters. In: Proceedings of the Department of Defense HPCMP Users Group Conference, pp. 7–10 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Jana, S., Hernandez, O., Poole, S., Hsu, CH., Chapman, B.M. (2014). Analyzing the Energy and Power Consumption of Remote Memory Accesses in the OpenSHMEM Model. In: Poole, S., Hernandez, O., Shamis, P. (eds) OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools. OpenSHMEM 2014. Lecture Notes in Computer Science, vol 8356. Springer, Cham. https://doi.org/10.1007/978-3-319-05215-1_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-05215-1_5
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05214-4
Online ISBN: 978-3-319-05215-1
eBook Packages: Computer ScienceComputer Science (R0)