skip to main content
10.1145/3705956.3705962acmotherconferencesArticle/Chapter ViewAbstractPublication PageshpcctConference Proceedingsconference-collections
research-article

Resource Aware power modelling for compute-bound tasks on GPUs

Published: 28 December 2024 Publication History

Abstract

Scientific computing workloads require significant amount of energy when executed on Graphical Processing Units (GPUs). To make the execution of such workloads energy-optimal, power models that can accurately estimate power, hence total energy, are required. In this paper, we propose a GPU power model as a linear function of core frequency, number of active GPU lanes, and number of Thread Blocks (TBs) for compute-bound kernels; these are kernels that require minimal memory access and CPU-GPU transfers. We implement and validate our model using kernels from different domains. Since we use common architectural components such as core frequency, lanes, and a general programming paradigm such as TBs, we build a model on a Pascal GPU which can be easily extended to future GPU generations. We find that the constant power term in our model is the primary component. We achieve a Mean Absolute Percentage Error (MAPE) of 5% and 8% on two different kernels from the CUDA SDK samples and Rodinia benchmark, during validation on a Pascal GPU. We also obtain a MAPE of 14.8% from our evaluation of the GICOV kernel from the Rodinia benchmark on an Ampere GPU; we find these initial results encouraging, and look forward to extend our power model further over a wider range of kernels and architectures.

References

[1]
Peng B., Huang X., Wang S., and Jiang J. [n. d.]. A Real-Time medical ultrasound simulator based on a Generative Adversarial Network Model. In Proceedings of the International Conference on Image Processing, 2019. 4629–4633.
[2]
Robert A. Bridges, Neena Imam, and Tiffany M. Mintz. 2016. Understanding GPU Power: A Survey of Profiling, Modeling, and Simulation Methods. ACM Computing surveys 49 (2016), 1–27.
[3]
Shuai Che, Michael Boyer, Jiayuan Meng, David Tarjan, Jeremy W. Sheaffer, Sang-Ha Lee, and Kevin Skadron. 2009. Rodinia: A Benchmark Suite for Heterogeneous Computing. In Proceedings of the IEEE International Symposium on Workload Characterization, October 2009. 44–54.
[4]
Lee D., Dinov I., Dong B., Gutman B., Yanovsky I., and A. W. Toga. [n. d.]. CUDA optimization strategies for compute- and memory-bound neuroimaging algorithms. In Computer methods and programs in biomedicine, 2019, Vol. 106. 175–187. Issue 3.
[5]
Stone John E., Phillips James C., Freddolino Peter L., Hardy David J., Trabuco Leonardo G., and Schulten Klaus. 2007. Accelerating molecular modeling applications with graphics processors. Journal of Computational Chemistry 28, 16 (2007), 2618–2640.
[6]
Joseph L. Greathouse and Gabriel H. Loh. 2018. Machine Learning for Performance and Power Modeling of Heterogeneous Systems. In Proceedings of the IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 2018. ACM, New York, NY, USA, 1–6.
[7]
Sunpyo Hong and Hyesoon Kim. 2010. An integrated GPU power and performance model. In Proceedings of the International Symposium on Computer Architecture, June 2010. 280–289.
[8]
Canturk Isci and Margaret Martonosi. 2003. Runtime power monitoring in high-end processors: Methodology and empirical data. In Proceedings of 36th IEEE/ACM International Symposium on Microarchitecture (MICRO), 2003. 93–104.
[9]
R Jacques, R Taylor, J Wong, and T. McNutt. 2010. Towards real-time radiation therapy: Gpu accelerated superposition/convolution. In Computer Methods and Programs in BioMedicine, Vol. 98. 285–292. Issue 3.
[10]
Nuno Roma João Guerreiro, Aleksandar Ilic and Pedro Tomás. 2018. GPGPU Power Modeling for Multi-Domain Voltage-Frequency Scaling. In Proceedings of the IEEE International Symposium on High Performance Computer Architecture,IEEE, 2018. 789–800.
[11]
Sungbo Jung. 2009. Parallelized pairwise sequence alignment using CUDA on multiple GPUs. In Proceedings of the UT-ORNL-KBRIN Bioinformatics Summit, 2009, Vol. 10.
[12]
Vijay Kandiah, Scott Peverelle, Mahmoud Khairy, Junrui Pan, Amogh Manjunath, Timothy G. Rogers, Tor M. Aamodt, and Nikos Hardavellas. 2021. AccelWattch: A Power Modeling Framework for Modern GPUs. In Proceedings of the 54th Annual IEEE/ACM International Symposium on Microarchitecture, (MICRO ’21), October 18–22, 2021, Virtual Event, Greece. ACM, New York, NY, USA, 738–753.
[13]
Vijay Kandiah, Scott Peverelle, Mahmoud Khairy, Junrui Pan, Amogh Manjunath, Timothy G. Rogers, Tor M. Aamodt, and Nikos Hardavellas. 2021. AccelWattch MICRO’21 Artifact Appendix Manual. https://github.com/accel-sim/accel-sim-framework/blob/release-accelwattch/AccelWattch.md
[14]
Vijay Kandiah, Scott Peverelle, Mahmoud Khairy, Junrui Pan, Amogh Manjunath, Timothy G. Rogers, Tor M. Aamodt, and Nikos Hardavellas. 2021. Accelwattch microbenchmarks. https://github.com/accel-sim/gpu-app-collection/tree/dev/src/cuda/accelwattch-ubench
[15]
Jan Lucas and Ben Juurlink. 2016. ALUPower: Data Dependent Power Consumption in GPUs. In Proceedings of the International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems,IEEE, 2016. 95–104.
[16]
J. Lucas, S. Lal, M. Andersch, M. Alvarez-Mesa, and B. Juurlink. 2013. GPU Power Modeling and Architectural Enhancements for GPU Energy Efficiency. In Proceedings of the International Symposium on Performance Analysis of Systems and Software (ISPASS), IEEE, 2013. 97–106.
[17]
Xiaohan Ma, Mian Dong, Lin Zhong, and Zhigang Deng. 2009. Statistical Power Consumption Analysis and Modeling for GPU-based Computing. In Proceedings of Workshop on Power Aware Computing Systems (HotPower), 2009
[18]
Hitoshi Nagasaka, Naoya Maruyama, Akira Nukada, and Toshio Endo. 2010. Statistical power modeling of GPU kernels using performance counters. In In Proceedings of the International Green Computing Conference, 2010. 115–122.
[20]
[21]
NVidia. 2017. An Even Easier Introduction to CUDA. https://developer.nvidia.com/blog/even-easier-introduction-cuda/.
[22]
NVidia. 2022. NVidia Management Library Application Programming Interface. https://docs.nvidia.com/deploy/archive/R515/nvml-api/index.html.
[26]
Guvench O. Vanommeslaeghe K. and MacKerell A. D. Jr. 2014. Molecular mechanics. Current pharmaceutical design. Journal of Computational Chemistry 20, 20 (2014), 3281–3292.
[27]
Balaji Subramaniam Vignesh Adhinarayanan and Wu-Chun Feng. 2016. Online Power Estimation of Graphics Processing Units. In Proceedings of the 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), 2016. 245–254.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
HPCCT '24: Proceedings of the 2024 8th High Performance Computing and Cluster Technologies Conference
July 2024
55 pages
ISBN:9798400716881
DOI:10.1145/3705956
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 December 2024

Check for updates

Author Tags

  1. Core Frequency
  2. GPU
  3. Power Modelling
  4. Processing Blocks
  5. Thread Blocks
  6. gpusets

Qualifiers

  • Research-article

Conference

HPCCT 2024

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 25
    Total Downloads
  • Downloads (Last 12 months)25
  • Downloads (Last 6 weeks)8
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media