Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
- Download citation
- Copy citation

GPGPU-2: Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units

Go to Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units

March 2009

2009 Proceeding

Conference Chairs:
David Kaeli
Northeastern University
,
Miriam Leeser
Northeastern University

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

GPGPU '09: Second Workshop on General-Purpose Computation on Graphics Processing Units Washington D.C. USA 8 March 2009

ISBN:

978-1-60558-517-8

Published:

08 March 2009

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Reflects downloads up to 07 Mar 2025Bibliometrics

Citation Count

839

Downloads (6 weeks)

Downloads (12 months)

234

Downloads (cumulative)

13,621

Sections

GPGPU-2: Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units

2009

Previous Next

Skip Abstract Section

Abstract

Graphics cards have long been used to accelerate gaming and 3D graphics applications. More recently, they have begun to be used to accelerate more general purpose and high performance applications. GPUs are beginning to be used to accelerate a wide range of remote sensing, environmental monitoring, business forecasting and medical imaging applications, though have relied on programming interfaces that utilized graphics primitives and libraries. Only recently have general purpose programming environments become available that allow these platforms to be used to accelerate a wider class of applications.

We are pleased to present these 12 high quality papers that were selected for the final program of GPGPU-2. The goal of this workshop is to provide a forum to discuss these general purpose programming environments and platforms, as well as describe successful applications that have leveraged this new approach to acceleration. This year's workshop focuses on a range of applications, though also presents new work in GPU languages and optimization techniques, as well as GPU reliability

Proceeding Downloads

PDFFront matter (Title page, TOC, Introduction)

Skip Table Of Content Section

Select All

Export Citations Save to Binder

research-article

Accelerating cosmological data analysis with graphics processors

Dylan W. Roeh,
Volodymyr V. Kindratenko,
Robert J. Brunner

Pages 1–8https://doi.org/10.1145/1513895.1513896

In this paper we describe a successful effort to accelerate the two-point angular correlation function---a basic statistics tool used in the field of cosmology to characterize the distribution of the matter and energy in the Universe---by using an ...

- 11
- 551
Metrics
Total Citations11
Total Downloads551
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

research-article

High performance computation and interactive display of molecular orbitals on GPUs and multi-core CPUs

John E. Stone,
Jan Saam,
David J. Hardy,
Kirby L. Vandivort,
Wen-mei W. Hwu,
Klaus Schulten

Pages 9–18https://doi.org/10.1145/1513895.1513897

The visualization of molecular orbitals (MOs) is important for analyzing the results of quantum chemistry simulations. The functions describing the MOs are computed on a three-dimensional lattice, and the resulting data can then be used for plotting ...

- 41
- 757
Metrics
Total Citations41
Total Downloads757
Last 12 Months22
Last 6 weeks3

Abstract
Get Access

research-article

GPU acceleration of a production molecular docking code

Bharat Sukhwani,
Martin C. Herbordt

Pages 19–27https://doi.org/10.1145/1513895.1513898

Modeling the interactions of biological molecules, or docking, is critical to both understanding basic life processes and to designing new drugs. Here we describe the GPU-based acceleration of a recently developed, complex, production docking code. We ...

- 32
- 584
Metrics
Total Citations32
Total Downloads584
Last 12 Months5
Last 6 weeks0

Abstract
Get Access

research-article

Accelerating phase unwrapping and affine transformations for optical quadrature microscopy using CUDA

Perhaad Mistry,
Sherman Braganza,
David Kaeli,
Miriam Leeser

Pages 28–37https://doi.org/10.1145/1513895.1513899

Optical Quadrature Microscopy (OQM) is a process which uses phase data to capture information about the sample being studied. OQM is part of an imaging framework developed by the Optical Science Laboratory at Northeastern University. In one particular ...

- 6
- 593
Metrics
Total Citations6
Total Downloads593
Last 12 Months6
Last 6 weeks0

Abstract
Get Access

research-article

Performance analysis of accelerated image registration using GPGPU

Peter Bui,
Jay Brockman

Pages 38–45https://doi.org/10.1145/1513895.1513900

This paper presents a performance analysis of an accelerated 2-D rigid image registration implementation that employs the Compute Unified Device Architecture (CUDA) programming environment to take advantage of the parallel processing capabilities of ...

- 12
- 1,125
Metrics
Total Citations12
Total Downloads1,125
Last 12 Months7
Last 6 weeks0

Abstract
Get Access

research-article

Accelerating linpack with CUDA on heterogenous clusters

Massimiliano Fatica

Pages 46–51https://doi.org/10.1145/1513895.1513901

This paper describes the use of CUDA to accelerate the Linpack benchmark on heterogenous clusters, where both CPUs and GPUs are used in synergy with minor or no modifications to the original source code. A host library intercepts the calls to DGEMM and ...

- 109
- 2,321
Metrics
Total Citations109
Total Downloads2,321
Last 12 Months12
Last 6 weeks1

Abstract
Get Access

research-article

hiCUDA: a high-level directive-based language for GPU programming

Tianyi David Han,
Tarek S. Abdelrahman

Pages 52–61https://doi.org/10.1145/1513895.1513902

The Compute Unified Device Architecture (CUDA) has become a de facto standard for programming NVIDIA GPUs. However, CUDA places on the programmer the burden of packaging GPU code in separate functions, of explicitly managing data transfer between the ...

- 101
- 1,730
Metrics
Total Citations101
Total Downloads1,730
Last 12 Months23
Last 6 weeks0

Abstract
Get Access

research-article

Architecture-aware optimization targeting multithreaded stream computing

Byunghyun Jang,
Synho Do,
Homer Pien,
David Kaeli

Pages 62–70https://doi.org/10.1145/1513895.1513903

Optimizing program execution targeted for Graphics Processing Units (GPUs) can be very challenging. Our ability to efficiently map serial code to a GPU or stream processing platform is a time consuming task and is greatly hampered by a lack of detail ...

- 25
- 643
Metrics
Total Citations25
Total Downloads643
Last 12 Months6
Last 6 weeks0

Abstract
Get Access

research-article

QR decomposition on GPUs

Andrew Kerr,
Dan Campbell,
Mark Richards

Pages 71–78https://doi.org/10.1145/1513895.1513904

QR decomposition is a computationally intensive linear algebra operation that factors a matrix A into the product of a unitary matrix Q and upper triangular matrix R. Adaptive systems commonly employ QR decomposition to solve overdetermined least ...

- 36
- 1,011
Metrics
Total Citations36
Total Downloads1,011
Last 12 Months35
Last 6 weeks7

Abstract
Get Access

research-article

3D finite difference computation on GPUs using CUDA

Paulius Micikevicius

Pages 79–84https://doi.org/10.1145/1513895.1513905

In this paper we describe a GPU parallelization of the 3D finite difference computation using CUDA. Data access redundancy is used as the metric to determine the optimal implementation for both the stencil-only computation, as well as the discretization ...

- 353
- 2,853
Metrics
Total Citations353
Total Downloads2,853
Last 12 Months72
Last 6 weeks12

Abstract
Get Access

research-article

Optimization of tele-immersion codes

Albert Sidelnik,
I-Jui Sung,
Wanmin Wu,
María Jesús Garzarán,
Wen-mei Hwu,
Klara Nahrstedt,
David Padua,
Sanjay J. Patel

Pages 85–93https://doi.org/10.1145/1513895.1513906

As computational power increases, tele-immersive applications are an emerging trend. These applications make extensive demands on computational resources through their heavy use of real-time 3D reconstruction algorithms. Since computer vision developers ...

- 1
- 209
Metrics
Total Citations1
Total Downloads209
Last 12 Months2
Last 6 weeks0

Abstract
Get Access

research-article

Understanding software approaches for GPGPU reliability

Martin Dimitrov,
Mike Mantor,
Huiyang Zhou

Pages 94–104https://doi.org/10.1145/1513895.1513907

Even though graphics processors (GPUs) are becoming increasingly popular for general purpose computing, current (and likely near future) generations of GPUs do not provide hardware support for detecting soft/hard errors in computation logic or memory ...

- 112
- 1,212
Metrics
Total Citations112
Total Downloads1,212
Last 12 Months43
Last 6 weeks2

Abstract
Get Access

Save to Binder

Create a New Binder

Name

Contributors

David R. Kaeli
Northeastern University
- Publication Years1991 - 2024
- Publication counts193
- Citation count2,246
- Available for Download106
- Downloads (cumulative)61,162
- Downloads (12 months)11,382
- Downloads (6 weeks)1,346
- Average Downloads per Article577
- Average Citation per Article12
View Full Profile
Miriam E Leeser
Northeastern University
- Publication Years1986 - 2025
- Publication counts95
- Citation count1,010
- Available for Download35
- Downloads (cumulative)33,470
- Downloads (12 months)5,050
- Downloads (6 weeks)598
- Average Downloads per Article956
- Average Citation per Article11
View Full Profile

Comments

Recommendations

GPGPU-5: Proceedings of the 5th Annual Workshop on General Purpose Processing with Graphics Processing Units
GPGPU-3: Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units
Algorithmic performance studies on graphics processing units

We report on our experience with integrating and using graphics processing units (GPUs) as fast parallel floating-point co-processors to accelerate two fundamental computational scientific kernels on the GPU: sparse direct factorization and nonlinear ...

Acceptance Rates

Overall Acceptance Rate 57 of 129 submissions, 44%

Year	Submitted	Accepted	Rate
GPGPU '20	12	7	58%
GPGPU '19	15	6	40%
GPGPU-10	15	8	53%
GPGPU '16	23	9	39%
GPGPU-7	27	12	44%
GPGPU-6	37	15	41%
Overall	129	57	44%

Save to Binder

Sections

Proceeding Downloads

Save to Binder

Recommendations

GPGPU-5: Proceedings of the 5th Annual Workshop on General Purpose Processing with Graphics Processing Units

GPGPU-3: Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units

Algorithmic performance studies on graphics processing units

Acceptance Rates