skip to main content
10.1145/2716282acmotherconferencesBook PagePublication PagesgpgpuConference Proceedingsconference-collections
GPGPU-8: Proceedings of the 8th Workshop on General Purpose Processing using GPUs
ACM2015 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
GPGPU-8: General-purpose Processing with Graphics Processing Units 8 San Francisco CA USA 7 February 2015
ISBN:
978-1-4503-3407-5
Published:
07 February 2015
In-Cooperation:

Bibliometrics
Abstract

No abstract available.

Skip Table Of Content Section
SESSION: HPC
research-article
A comparative investigation of device-specific mechanisms for exploiting HPC accelerators

A variety of computational accelerators have been greatly improved in recent years. Intel's MIC (Many Integrated Core) and both GPU architectures, NVIDIA's Kepler and AMD's Graphics Core Next, all represent real innovations in the field of HPC. Based ...

SESSION: Cache and Shared Memory
research-article
GPU-SM: shared memory multi-GPU programming

Discrete GPUs in modern multi-GPU systems can transparently access each other's memories through the PCIe interconnect. Future systems will improve this capability by including better GPU interconnects such as NVLink. However, remote memory access ...

research-article
Adaptive GPU cache bypassing

Modern graphics processing units (GPUs) include hardware- controlled caches to reduce bandwidth requirements and energy consumption. However, current GPU cache hierarchies are inefficient for general purpose GPU (GPGPU) comput- ing. GPGPU workloads ...

research-article
Efficient utilization of GPGPU cache hierarchy

Recent GPUs are equipped with general-purpose L1 and L2 caches in an attempt to reduce memory bandwidth demand and improve the performance of some irregular GPGPU applications. However, due to the massive multithreading, GPGPU caches suffer from severe ...

SESSION: Optimization
research-article
Effects of source-code optimizations on GPU performance and energy consumption

This paper studies the effects of source-code optimizations on the performance, power draw, and energy consumption of a modern compute GPU. We evaluate 128 versions of two n-body codes: a compute-bound regular implementation and a memory-bound ...

research-article
Public Access
Optimization for performance and energy for batched matrix computations on GPUs

As modern hardware keeps evolving, an increasingly effective approach to develop energy efficient and high-performance solvers is to design them to work on many small size independent problems. Many applications already need this functionality, ...

research-article
Helium: a transparent inter-kernel optimizer for OpenCL

State of the art automatic optimization of OpenCL applications focuses on improving the performance of individual compute kernels. Programmers address opportunities for inter-kernel optimization in specific applications by ad-hoc hand tuning: manually ...

SESSION: Applications
research-article
Stochastic gradient descent on GPUs

Irregular algorithms such as Stochastic Gradient Descent (SGD) can benefit from the massive parallelism available on GPUs. However, unlike in data-parallel algorithms, synchronization patterns in SGD are quite complex. Furthermore, scheduling for scale-...

research-article
High performance computing of fiber scattering simulation

Cellulose is one of the most promising energy resources that is waiting to be tapped. Harvesting energy from cellulose requires decoding its atomic structure. Some structural information can be exposed by modeling data produced by X-ray scattering. ...

research-article
Rethinking the parallelization of random-restart hill climbing: a case study in optimizing a 2-opt TSP solver for GPU execution

Random-restart hill climbing is a common approach to combinatorial optimization problems such as the traveling salesman problem (TSP). We present and evaluate an implementation of random-restart hill climbing with 2-opt local search applied to TSP. Our ...

research-article
Forma: a DSL for image processing applications to target GPUs and multi-core CPUs

As architectures evolve, optimization techniques to obtain good performance evolve as well. Using low-level programming languages like C/C++ typically results in architecture-specific optimization techniques getting entangled with the application ...

Contributors
  • Northeastern University
  • University of Delaware

Index Terms

  1. Proceedings of the 8th Workshop on General Purpose Processing using GPUs

    Recommendations

    Acceptance Rates

    Overall Acceptance Rate57of129submissions,44%
    YearSubmittedAcceptedRate
    GPGPU '2012758%
    GPGPU '1915640%
    GPGPU-1015853%
    GPGPU '1623939%
    GPGPU-7271244%
    GPGPU-6371541%
    Overall1295744%