An approach to optimise the energy efficiency of iterative computation on integrated GPU–CPU systems

Garzón, E. M.; Moreno, J. J.; Martínez, J. A.

doi:10.1007/s11227-016-1643-9

An approach to optimise the energy efficiency of iterative computation on integrated GPU–CPU systems

Published: 10 February 2016

Volume 73, pages 114–125, (2017)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

E. M. Garzón¹,
J. J. Moreno¹ &
J. A. Martínez¹

587 Accesses
Explore all metrics

Abstract

Currently, the energy efficiency of computational systems is of paramount relevance. In this work, an approach for improving energy efficiency is proposed in the context of the iterative computation on integrated GPU-CPU systems. The proposal, referred to as E-ADITHE, combines iterative procedures with: (1) a heuristic scheme for processing units selection according to the estimation of energy efficiency and (2) the load balancing on heterogeneous processors. There is a wide variety of iterative algorithms related to science and engineering which can take advantage of E-ADITHE. The Beltrami filter has been selected as a representative example of such procedures and its OpenCL version has been used to validate E-ADITHE. The analysis of the results shows that E-ADITHE improves automatically the energy efficiency of parallel iterative algorithm on modern heterogeneous processors.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Energy Efficient Dynamic Load Balancing over MultiGPU Heterogeneous Systems

A heuristic technique to improve energy efficiency with dynamic load balancing

Article 19 December 2018

Exploring Energy Efficiency for GPU-Accelerated POWER Servers

Notes

References

AMD (2015) AMD compute cores. A new era of computing. AMD enables CPU and GPU cores to work together on a single APU chip. http://www.amd.com/en-us/innovations/software-technologies/processors-for-business/compute-cores
Chen X, Xu C, Dick RP, Mao ZM (2010) Performance and power modeling in a multi-programmed multi-core environment. In: Proceedings of the 47th design automation conference, DAC ’10. ACM, New York, pp 813–818
Clarke D, Ilic A, Lastovetsky A, Rychkov V, Sousa L, Zhong Z (2014) Design and optimization of scientific applications for highly heterogeneous and hierarchical HPC platforms using functional computation performance models. Wiley, New York, pp 235–260
Cocaa-Fernndez A, Ranilla J, Snchez L (2015) Energy-efficient allocation of computing node slots in HPC clusters through parameter learning and hybrid genetic fuzzy system modeling. J Supercomput 71(3):1163–1174
Article Google Scholar
Deng Y, Hu Y, Meng Xi, Zhu Y, Zhang Z, Han J (2014) Predictively booting nodes to minimize performance degradation of a power-aware web cluster. Clust Comput 17(4):1309–1322
Article Google Scholar
Fernandez JJ (2009) Tomobflow: feature-preserving noise filtering for electron tomography. BMC Bioinform 10:178
Article Google Scholar
Fernández JJ, Martínez JA (2010) Three-dimensional feature-preserving noise reduction for real-time electron tomography. Digit Signal Process 20(4):1162–1172
Article Google Scholar
Garey MR, Johnson DS (1979) Computers and intractability: a guide to the theory of NP-completeness (Series of Books in the Mathematical Sciences) W.H. Freeman, 1st edn
Hong S, Kim H (2010) An integrated GPU power and performance model. SIGARCH Comput Archit News 38(3):280–289
Article Google Scholar
Kaleem R, Barik R, Shpeisman T, Lewis BT, Hu Ch, Pingali K (2014) Adaptive heterogeneous scheduling for integrated GPUs. In: Proceedings of the 23rd international conference on parallel architectures and compilation, PACT ’14. ACM, New York, pp 151–162
Kang Y, Choi W, Kim B, Kim J (2014) On tradeoff between the two compromise factors in assigning tasks on a cluster computing. Clust Comput 17(3):861–870
Article Google Scholar
Kimmel R, Sochen NA, Malladi R (1997) From high energy physics to low level vision. Lect Notes Comput Sci 1252:236–247
Article Google Scholar
Leng J, Hetherington T, ElTantawy A, Gilani S, Kim NS, Aamodt TM, Reddi VJ (2013) GPUWattch: enabling energy optimizations in GPGPUs. SIGARCH Comput Archit News 41(3):487–498
Article Google Scholar
Martínez JA, Vázquez F, Garzón EM, Fernández JJ (2011) Real-time electron tomography based on GPU computing. In: Euro-Par 2010 Parallel Processing Workshops, LNCS, vol 6586. Springer, Berlin, Heidelberg, pp 201–208
Martinez JA, Almeida F, Garzon EM, Acosta A, Blanco V (2011) Adaptive load balancing of iterative computation on heterogeneous nondedicated systems. J Supercomput 58(3):385–393
Article Google Scholar
Martinez JA, Garzon EM, Plaza A, Garcia I (2011) Automatic tuning of iterative computation on heterogeneous multiprocessors with ADITHE. J Supercomput 58(2):151–159
Article Google Scholar
Mittal S, Vetter JS (2014) A survey of methods for analyzing and improving GPU energy efficiency. ACM Comput Surv 47(2):19:1–19:23
Article Google Scholar
NVIDIA (2015) Tegra processors. http://www.nvidia.com/object/tegra-x1-processor.html
Press WH, Flannery BP, Teukolsky SA (1992) Vetterling WT numerical recipes: the art of scientific computing. Cambridge University Press, Cambridge
MATH Google Scholar
Scogland TRW, Lin H, Feng W (2010) A first look at integrated gpus for green high-performance computing. Comput Sci Res Dev 25(3–4):125–134
Article Google Scholar
Tian Y, Lin C, Li K (2014) Managing performance and power consumption tradeoff for multiple heterogeneous servers in cloud computing. Clust Comput 17(3):943–955
Article Google Scholar
Ukidave Y, Kaeli DR (2013) Analyzing optimization techniques for power efficiency on heterogeneous platforms. In: Parallel and distributed processing symposium workshops PhD Forum (IPDPSW), 2013 IEEE 27th International, pp 1040–1049
Wang H, Sathish V, Singh R, Schulte MJ, Kim NS (2012) Workload and Power budget partitioning for single-chip heterogeneous processors. In: Proceedings of the 21st international conference on parallel architectures and compilation techniques, PACT ’12. ACM, New York, pp 401–410
Weaver VM, Johnson M, Kasichayanula K, Ralph J, Luszczek P, Terpstra D, Moore S (2012) Measuring energy and power with PAPI. In: Proceedings of the 2012 41st international conference on parallel processing workshops, ICPPW ’12. IEEE Computer Society, Washington, DC, pp 262–268
Yuffe M, Knoll E, Mehalel M, Shor J, Kurts T (2011) A fully integrated multi-CPU, GPU and memory controller 32nm processor. In: Solid-state circuits conference digest of technical papers (ISSCC), 2011 IEEE International, pp 264–266
Zhong Z, Rychkov V, Lastovetsky A (2014) Data partitioning on multicore and multi-GPU platforms using functional performance models. Comput IEEE Trans PP(99):1–1

Download references

Author information

Authors and Affiliations

Department of Informatics, University of Almería ceiA3, 04120, Almería, Spain
E. M. Garzón, J. J. Moreno & J. A. Martínez

Authors

E. M. Garzón
View author publications
You can also search for this author inPubMed Google Scholar
J. J. Moreno
View author publications
You can also search for this author inPubMed Google Scholar
J. A. Martínez
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to E. M. Garzón.

Additional information

This work has been funded by Grants from the Spanish Ministry of Science and Innovation (TIN2012-37483-C03-03, CAPAP-H5 network TIN2014-53522) and Junta de Andalucia (P12-TIC-301) in part financed by the European Regional Development Fund (ERDF).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Garzón, E.M., Moreno, J.J. & Martínez, J.A. An approach to optimise the energy efficiency of iterative computation on integrated GPU–CPU systems. J Supercomput 73, 114–125 (2017). https://doi.org/10.1007/s11227-016-1643-9

Download citation

Published: 10 February 2016
Issue Date: January 2017
DOI: https://doi.org/10.1007/s11227-016-1643-9

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An approach to optimise the energy efficiency of iterative computation on integrated GPU–CPU systems

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Energy Efficient Dynamic Load Balancing over MultiGPU Heterogeneous Systems

A heuristic technique to improve energy efficiency with dynamic load balancing

Exploring Energy Efficiency for GPU-Accelerated POWER Servers

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now