ABSTRACT
For modern SoCs used in mobile devices, it is vital to focus on the processing efficiency through leveraging a heterogeneous potential of the architecture. In this tutorial, we offer a hands-on experience with existing APIs for accelerating compute-intensive portions of a mobile application.
Specifically, as a first step we introduce the essentials of the most popular general Compute APIs available in the mobile domain, including RenderScript*, OpenCL*, GLES pixel and recently compute shaders (plus quick comparison to more vendor-specific APIs like CUDA* and Metal*). We continue with medium-complexity topics like example API-specific performance tricks in action.
Finally, we touch advanced aspects like tools-assisted performance analysis. General focus is on the changes required for the typical user code to leverage good GPU acceleration.
Index Terms
- Compute for mobile devices: performance-focused hands-on tutorial
Recommendations
High-performance code optimizations for mobile devices
Mobile devices have seen their performance increased in latest years due to improvements on System on Chip technologies. These shared memory systems now integrate multicore CPUs and accelerators, and obtaining the optimal performance from such ...
A Comparison of Graphics Processor Architectures for RFID Simulation
NBIS '14: Proceedings of the 2014 17th International Conference on Network-Based Information SystemsGraphics Processing Units (GPUs) have a huge number of cores to speed up graphical computations and they are being used in a wide area of general-purpose applications that require high performances. In this paper, GPU computing is exploited to model the ...
Achieving a single compute device image in OpenCL for multiple GPUs
PPoPP '11In this paper, we propose an OpenCL framework that combines multiple GPUs and treats them as a single compute device. Providing a single virtual compute device image to the user makes an OpenCL application written for a single GPU portable to the ...
Comments