skip to main content
10.1145/2463209.2488754acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
research-article

Aging-aware compiler-directed VLIW assignment for GPGPU architectures

Published: 29 May 2013 Publication History

Abstract

Negative bias temperature instability (NBTI) adversely affects the reliability of a processor by introducing new delay-induced faults. However, the effect of these delay variations is not uniformly spread across functional units and instructions: some are affected more (hence less reliable) than others. This paper proposes a NBTI-aware compiler-directed very long instruction word (VLIW) assignment scheme that uniformly distributes the stress of instructions with the aim of minimizing aging of GPGPU architecture without any performance penalty. The proposed solution is an entirely software technique based on static workload characterization and online execution with NBTI monitoring that equalizes the expected lifetime of each processing element by regenerating aging-aware healthy kernels that respond to the specific health state of GPGPU. We demonstrate our approach on AMD Evergreen architecture where iso-throughput executions of the healthy kernels reduce NBTI-induced voltage threshold shift up to 49% (11%) compared to naïve kernel executions, with (without) architectural support for power-gating. The kernel adaption flow takes average of 13 millisecond on a typical host machine thus making it suitable for practical implementation.

References

[1]
P. Gupta, et al., "Underdesigned and Opportunistic Computing in Presence of Hardware Variability," IEEE Trans. on CAD of Integrated Circuits and Systems, pp. 489--499, 2012.
[2]
G. Chen, et al., "Dynamic NBTI of p-MOS transistors and its impact on MOSFET scaling," IEEE Electron Device Letters, pp. 734--736, Dec. 2002.
[3]
K. Bernstein, et al., "High-performance CMOS variability in the 65-nm regime and beyond," IBM Journal of Research and Development, pp.433--449, 2006.
[4]
G. Chen, et al., "Dynamic NBTI of PMOS transistors and its impact on device lifetime," Proc. IEEE Reliability Physics Symposium, pp. 196--202, 2003.
[5]
S. Chakravarthi, et al., "A Comprehensive Framework for Predictive Modeling of Negative Bias Temperature Instability," Proc. IEEE Reliability Physics Symposium, April 2004.
[6]
S. V. Kumar, et al., "An analytical model for negative bias temperature instability," Proc. ACM/IEEE ICCAD, pp. 493--496, 2006.
[7]
W. Wang, et al., "The Impact of NBTI Effect on Combinational Circuit: Modeling, Simulation, and Analysis," IEEE Trans. on VLSI Systems, Feb. 2010.
[8]
S. Bhardwaj, et al., "Predictive modeling of the NBTI effect for reliable design," Proc. IEEE CICC, pp. 189--192, 2006.
[9]
J. T. Adriaens, et al., "The case for GPGPU spatial multitasking," Proc. IEEE HPCA, 2012.
[10]
J. Nickolls, et al., "The GPU Computing Era," IEEE Micro, March-April 2010.
[11]
AMD Corporation. ATI Radeon HD 5870 Graphics.
[12]
P. Singh, et al., "Dynamic NBTI Management Using a 45 nm Multi Degradation Sensor" IEEE Trans. on Circuits and Systems, pp.2026--2037, Sept. 2011.
[13]
A. Rahimi, et al., "Hierarchically Focused Guardbanding: An Adaptive Approach to Mitigate PVT Variations and Aging," Proc. ACM/IEEE DATE, 2013.
[14]
A. Calimera, et al., "NBTI-aware power gating for concurrent leakage and aging optimization," Proc. ACM/IEEE ISLPED, pp. 127--132, 2009.
[15]
F. Paterna, et al., "Adaptive Idleness Distribution for Non-Uniform Aging Tolerance in MultiProcessor Systems-on-Chip," Proc. ACM/IEEE DATE, 2009.
[16]
A. Tiwari and J. Torrellas, "Facelift: Hiding and slowing down aging in multi-cores," Proc. ACM/IEEE MICRO, pp. 129--140, 2008.
[17]
U. Karpuzcu, et al., "The bubblewrap many-core: popping cores for sequential acceleration," Proc. ACM/IEEE MICRO, pp. 447--458, 2009.
[18]
T. Chan, et al., "On the efficacy of NBTI mitigation techniques," Proc. ACM/IEEE DATE, 2011.
[19]
J. Lee, et al., "Analyzing throughput of GPGPUs exploiting within-die core-to-core frequency variation," Proc. IEEE ISPASS, pp.237--246, 2011.
[20]
E. Gunadi, et al., "Combating aging with the colt duty cycle equalizer," Proc. IEEE/ACM MICRO, pp. 103--114, 2010.
[21]
F. Firouzi, et al., "NBTI Mitigation by Optimized NOP Assignment and Insertion," Proc. IEEE/ACM DATE, pp. 218--223, 2012.
[22]
F. Ahmed, et al., "Wearout-aware compiler-directed register assignment for embedded systems," Proc. IEEE ISQED, pp.33--40, 2012.
[23]
S. Wang, et al., "Low Power Aging-Aware Register File Design by Duty Cycle Balancing," Proc. IEEE/ACM DATE, pp. 546--549, 2012.
[24]
A. Calimera, et al., "Design Techniques for NBTI-Tolerant Power-Gating Architectures," IEEE Transactions on Circuits and Systems II, April 2012.
[25]
W. Wang, et al., "An efficient method to identify critical gates under circuit aging," Proc. IEEE/ACM ICCAD, pp.735--740, 2007.
[26]
AMD Evergreen Family Instruction Set Architecture, 2011.
[27]
H. Kaul, et al., "A 300 mV 494GOPS/W Reconfigurable Dual-Supply 4-Way SIMD Vector Processing Accelerator in 45 nm CMOS," IEEE Journal of Solid-State Circuits, Vol.45, No.1, pp.95--102, Jan. 2010.
[28]
Multi2Sim {Online}. Available: http://www.multi2sim.org/
[29]
AMD APP SDK 2.5 {online }. Available: www.amd.com/stream/

Cited By

View all
  • (2021)Aging-Aware Parallel ExecutionIEEE Embedded Systems Letters10.1109/LES.2020.302185413:3(122-125)Online publication date: Sep-2021
  • (2021)Mitigating the processor aging through dynamic concurrency throttlingJournal of Parallel and Distributed Computing10.1016/j.jpdc.2021.05.006156(86-100)Online publication date: Oct-2021
  • (2020)Improving Reliability of Soft Real-Time Embedded Systems on Integrated CPU and GPU PlatformsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2019.294068139:10(2218-2229)Online publication date: Oct-2020
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
DAC '13: Proceedings of the 50th Annual Design Automation Conference
May 2013
1285 pages
ISBN:9781450320719
DOI:10.1145/2463209
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 May 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. GPGPU
  2. NBTI
  3. VLIW
  4. adaptive kernel
  5. aging-aware compilation
  6. dynamic binary optimizer

Qualifiers

  • Research-article

Conference

DAC '13
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,770 of 5,499 submissions, 32%

Upcoming Conference

DAC '25
62nd ACM/IEEE Design Automation Conference
June 22 - 26, 2025
San Francisco , CA , USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 08 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2021)Aging-Aware Parallel ExecutionIEEE Embedded Systems Letters10.1109/LES.2020.302185413:3(122-125)Online publication date: Sep-2021
  • (2021)Mitigating the processor aging through dynamic concurrency throttlingJournal of Parallel and Distributed Computing10.1016/j.jpdc.2021.05.006156(86-100)Online publication date: Oct-2021
  • (2020)Improving Reliability of Soft Real-Time Embedded Systems on Integrated CPU and GPU PlatformsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2019.294068139:10(2218-2229)Online publication date: Oct-2020
  • (2020)Hardware/Software Codesign for Energy Efficiency and Robustness: From Error-Tolerant Computing to Approximate ComputingDependable Embedded Systems10.1007/978-3-030-52017-5_22(527-543)Online publication date: 10-Dec-2020
  • (2019)Transparent Aging-Aware Thread Throttling2019 31st International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)10.1109/SBAC-PAD.2019.00014(1-8)Online publication date: Oct-2019
  • (2018)Aging-Aware Workload Management on Embedded GPU Under Process VariationIEEE Transactions on Computers10.1109/TC.2018.278990467:7(920-933)Online publication date: 1-Jul-2018
  • (2017)Low-overhead Aging-aware Resource Management on Embedded GPUsProceedings of the 54th Annual Design Automation Conference 201710.1145/3061639.3062277(1-6)Online publication date: 18-Jun-2017
  • (2016)Invited - Optimizing device reliability effects at the intersection of physics, circuits, and architectureProceedings of the 53rd Annual Design Automation Conference10.1145/2897937.2905016(1-6)Online publication date: 5-Jun-2016
  • (2016)A Novel Peak Power Supply Noise Measurement and Adaptation System for Integrated CircuitsIEEE Transactions on Very Large Scale Integration (VLSI) Systems10.1109/TVLSI.2015.249200024:5(1715-1727)Online publication date: May-2016
  • (2016)Variability Mitigation in Nanometer CMOS Integrated Systems: A Survey of Techniques From Circuits to SoftwareProceedings of the IEEE10.1109/JPROC.2016.2518864104:7(1410-1448)Online publication date: Jul-2016
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media