poster

GOPipe: a granularity-oblivious programming framework for pipelined stencil executions on GPU

Authors:
Chanyoung Oh

University of Seoul

University of Seoul
View Profile

,
Zhen Zheng

Tsinghua University

Tsinghua University
View Profile

,
Xipeng Shen

North Carolina State University

North Carolina State University
View Profile

,
Jidong Zhai

Tsinghua University

Tsinghua University
View Profile

,
Youngmin Yi

University of Seoul

University of Seoul
View Profile

PPoPP '19: Proceedings of the 24th Symposium on Principles and Practice of Parallel ProgrammingFebruary 2019Pages 431–432https://doi.org/10.1145/3293883.3301494

Published:16 February 2019Publication History

PPoPP '19: Proceedings of the 24th Symposium on Principles and Practice of Parallel Programming

Pages 431–432

ABSTRACT

Recent studies have shown promising performance benefits of pipelined stencil applications. An important factor for the computing efficiency of such pipelines is the granularity of a task. We presents GOPipe, the first granularity-oblivious programming framework for efficient pipelined stencil executions. With GOPipe, programmers no longer need to specify the appropriate task granularity. GOPipe automatically finds it, and schedules tasks of that granularity while observing all inter-task and inter-stage data dependencies. In our experiments on four real-life applications, GOPipe outperforms the state-of-the-art by up to 4.57× with a much better programming productivity.

References

Markus Steinberger, Michael Kenzel, Pedro Boechat, Bernhard Kerbl, Mark Dokter, and Dieter Schmalstieg. 2014. Whippletree: Task-based Scheduling of Dynamic Workloads on the GPU. ACM Transactions on Graphics 33, 6 (2014), 1--11. Google ScholarDigital Library
Zhen Zheng, Chanyoung Oh, Jidong Zhai, Xipeng Shen, Youngmin Yi, and Wenguang Chen. 2017. VersaPipe: A Versatile Programming Framework for Pipelined Computing on GPU. In Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-50). Cambridge, MA, USA, 587--599. Google ScholarDigital Library

Index Terms

GOPipe: a granularity-oblivious programming framework for pipelined stencil executions on GPU
1. Computing methodologies
  1. Parallel computing methodologies
2. General and reference
  1. Cross-computing tools and techniques
    1. Performance

Recommendations

GOPipe: A Granularity-Oblivious Programming Framework for Pipelined Stencil Executions on GPU
PACT '20: Proceedings of the ACM International Conference on Parallel Architectures and Compilation Techniques

Recent studies have shown promising performance benefits when multiple stages of a pipelined stencil application are mapped to different parts of a GPU to run concurrently. An important factor for the computing efficiency of such pipelines is the ...
Read More
Juggler: a dependence-aware task-based execution framework for GPUs
PPoPP '18: Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming

Scientific applications with single instruction, multiple data (SIMD) computations show considerable performance improvements when run on today's graphics processing units (GPUs). However, the existence of data dependences across thread blocks may ...
Read More
Juggler: a dependence-aware task-based execution framework for GPUs
PPoPP '18

Scientific applications with single instruction, multiple data (SIMD) computations show considerable performance improvements when run on today's graphics processing units (GPUs). However, the existence of data dependences across thread blocks may ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
PPoPP '19: Proceedings of the 24th Symposium on Principles and Practice of Parallel Programming
February 2019
472 pages
ISBN:9781450362252
DOI:10.1145/3293883
General Chair:
Jeff Hollingsworth
University of Maryland
,
Program Chair:
Idit Keidar
Technion, Israel
Copyright © 2019 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 16 February 2019
Check for updates
Author Tags
GPU
data dependence
pipelined execution
Qualifiers
- poster
Conference

Acceptance Rates
PPoPP '19 Paper Acceptance Rate29of152submissions,19%Overall Acceptance Rate230of1,014submissions,23%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 184
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.