research-article

GATE: A Generalized Dataflow-level Approximation Tuning Engine For Data Parallel Architectures

Authors:

Yongjun ParkAuthors Info & Claims

DAC '19: Proceedings of the 56th Annual Design Automation Conference 2019

Article No.: 24, Pages 1 - 6

https://doi.org/10.1145/3316781.3317833

Published: 02 June 2019 Publication History

Abstract

Although approximate computing is widely used, it requires substantial programming effort to find appropriate approximation patterns among multiple pre-defined patterns to achieve a high performance. Therefore, we propose an automatic approximation framework called GATE to uncover hidden opportunities from any data-parallel program regardless of the code pattern or application characteristics using two compiler techniques, namely subgraph-level approximation (SGLA) and approximate thread merge(ATM). GATE also features conservative/aggressive tuning and dynamic calibration to maximize the performance while maintaining the TOQ level during runtime. Our framework achieves an average performance gain of 2.54x over the baseline with minimum accuracy loss.

References

[1]

A. Anant, M. C. Rinard, S. Sidiroglou, S. Misailovic, and H. Hoffmann. Using code perforation to improve performance, reduce energy consumption, and respond to failures. 2009.

[2]

W. Baek and T. M. Chilimbi. Green: a framework for supporting energy-conscious programming using controlled approximation. In Proc. of the '10 Conference on Programming Language Design and Implementation, pages 198--209, 2010.

Digital Library

[3]

S. Che et al. Rodinia: A benchmark suite for heterogeneous computing. In Proc. of the IEEE Symposium on Workload Characterization, pages 44--54, 2009.

Digital Library

[4]

J. Kessenich, B. Ouriel, and R. Krisch. Spir-v specification provisional (version 1.1, revision 4), 2016.

[5]

KHRONOS Group. OpenCL - the open standard for parallel programming of heterogeneous systems, 2010. http://www.khronos.org.

[6]

C. Lattner and V. Adve. LLVM: A compilation framework for lifelong program analysis & transformation. In Proc. of the 2004 International Symposium on Code Generation and Optimization, pages 75--86, 2004.

Digital Library

[7]

M. A. Laurenzano, P. Hill, M. Samadi, S. Mahlke, J. Mars, and L. Tang. Input responsiveness: Using canary inputs to dynamically steer approximation. In Proceedings of the 37th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI '16, pages 161--176, New York, NY, USA, 2016. ACM.

Digital Library

[8]

S. Mittal. A survey of techniques for approximate computing. ACM Comput. Surv., 48(4):62:1--62:33, Mar. 2016.

Digital Library

[9]

J. Nickolls et al. NVIDIA CUDA software and GPU parallel computing architecture. In Microprocessor Forum, May 2007.

[10]

G. Nvidia. computing sdk. GPU computing SDK," https://developer.nvidia.com/gpu-computing-sdk, 22(07):2013, 2013.

[11]

Y. Park, S. Seo, H. Park, H. K. Cho, and S. Mahlke. Simd defragmenter: Efficient ilp realization on data-parallel architectures. In ACM SIGARCH Computer Architecture News, volume 40, pages 363--374. ACM, 2012.

Digital Library

[12]

Polybench. the polyhedral benchmark suite, 2011. http://www.cse.ohio-state.edu/pouchet/software/polybench.

[13]

O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg, and L. Fei-Fei. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision (IJCV), 115(3):211--252, 2015.

Digital Library

[14]

M. Samadi, D. A. Jamshidi, J. Lee, and S. Mahlke. Paraprox: Pattern-based approximation for data parallel applications. In Proceedings of the 19th International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS '14, pages 35--50, New York, NY, USA, 2014. ACM.

Digital Library

[15]

M. Samadi, J. Lee, D. A. Jamshidi, A. Hormati, and S. Mahlke. Sage: Self-tuning approximation for graphics engines. In 2013 46th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), pages 13--24, Dec 2013.

Digital Library

Cited By

GATE: A Generalized Dataflow-level Approximation Tuning Engine For Data Parallel Architectures
1. Software and its engineering
  1. Software notations and tools

Recommendations

Towards a theory of meaningfulness and truth: an introduction to variational semantics
Coincidental extension of scattered context languages

All the results given in the paper hold true. In the proof of Theorem 1, change steps IV, V, and VI to

IV. for every $a,b,c \in T$ , add $(\langle a\rangle,\langle b \rangle,\langle c \rangle,\$) \rightarrow (\langle 0a \rangle,\langle 0b \rangle,\langle 0c \rangle,\S)$ to P ;

V. for every $a,b,c,d \in T$ , add $(Y, \langle 0a \rangle, Y, \langle 0b \rangle, Y, \langle 0c \rangle, \S) \rightarrow \# , \langle 0a \rangle, X, \langle 0b \rangle, Y, \langle 0c \rangle, \S)$ , $(\langle 0a\rangle, \langle 0b \rangle, \langle 0c \rangle, \S) \rightarrow (\langle 4a \rangle, \langle 1b \rangle, \langle 2c \rangle, \S)$ , $(\langle 4a \rangle, X, \langle 1b \rangle, Y, \langle 2c \rangle, \S) \rightarrow (\langle 4a \rangle, \# , \langle 1b \rangle, X, \langle 2c \rangle, \S)$ , $(\langle 4a \rangle, \langle 1b \rangle, \langle 2c \rangle, \langle d \rangle, \S \rightarrow (a, \langle 4b \rangle, \langle 1c \rangle, \langle 2d \rangle, \S)$ , $(\langle 4a \rangle, \langle 1b \rangle, \langle 2c \rangle, \S) \rightarrow (a, \langle 1b \rangle, \langle 3c \rangle, \S)$ , $(\langle 1a \rangle, X, \langle 3b \rangle, Y, \S) \rightarrow (\langle 1a \rangle, \# , \langle 3b \rangle, \# , \S)$ to P ;

VI. for every $a, b \in T$ , add $(\langle 1a \rangle, X, \langle 3b \rangle, \S) \rightarrow (a, \# , b, \# )$ to P .
Some Multiple Power Series with Zero-One Coefficients

The paper is concerned with sums of the type \[S_{n,j} = \sum {x_1^{a_1 } x_2^{a_2 } \cdots x_n^{a_n } } \quad (n > 1),\] where the summation is over either \[( * )\qquad ja_i \leqq a_1 + a_2 + \cdots + a_n \quad (1 \leqq j \leqq n;1 \leqq i \leqq ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

DAC '19: Proceedings of the 56th Annual Design Automation Conference 2019

June 2019

1378 pages

ISBN:9781450367257

DOI:10.1145/3316781

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGDA: ACM Special Interest Group on Design Automation
IEEE-CEDA

In-Cooperation

SIGBED: ACM Special Interest Group on Embedded Systems

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 June 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

Conference

DAC '19

Sponsor:

SIGDA

DAC '19: The 56th Annual Design Automation Conference 2019

June 2 - 6, 2019

NV, Las Vegas, USA

Acceptance Rates

Overall Acceptance Rate 1,770 of 5,499 submissions, 32%

Upcoming Conference

DAC '25

Sponsor:
sigda

62nd ACM/IEEE Design Automation Conference

June 22 - 26, 2025

San Francisco , CA , USA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
231
Total Downloads

Downloads (Last 12 months)6
Downloads (Last 6 weeks)0

Reflects downloads up to 03 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten