Toward OpenCL Automatic Multi-Device Support

Henry, Sylvain; Denis, Alexandre; Barthou, Denis; Counilh, Marie-Christine; Namyst, Raymond

doi:10.1007/978-3-319-09873-9_65

Sylvain Henry¹⁶,
Alexandre Denis¹⁷,
Denis Barthou¹⁸,
Marie-Christine Counilh¹⁸ &
…
Raymond Namyst¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8632))

Included in the following conference series:

European Conference on Parallel Processing

2984 Accesses
9 Citations

Abstract

To fully tap into the potential of today heterogeneous machines, offloading parts of an application on accelerators is no longer sufficient. The real challenge is to build systems where the application would permanently spread across the entire machine, that is, where parallel tasks would be dynamically scheduled over the full set of available processing units. In this paper we present SOCL, an OpenCL implementation that improves and simplifies the programming experience on heterogeneous architectures. SOCL enables applications to dynamically dispatch computation kernels over processing devices so as to maximize their utilization. OpenCL applications can incrementally make use of light extensions to automatically schedule kernels in a controlled manner on multi-device architectures. We demonstrate the relevance of our approach by experimenting with several OpenCL applications on a range of heterogeneous architectures. We show that performance portability is enhanced by using SOCL extensions.

Download to read the full chapter text

Chapter PDF

Automatic OpenCL Task Adaptation for Heterogeneous Architectures

Exploiting Heterogeneous Mobile Architectures Through a Unified Runtime Framework

Multi-device Controllers: A Library to Simplify Parallel Heterogeneous Programming

Article 09 December 2017

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

HSA Foundation: Heterogeneous System Architecture (2012), http://hsafoundation.com
Khronos OpenCL Working Group: The OpenCL Specification, Version 1.2 (2011)
Google Scholar
Topcuoglu, H., Hariri, S., Wu, M.Y.: Performance-effective and low-complexity task scheduling for heterogeneous computing. IEEE Transactions on Parallel and Distributed Systems 13(3), 260–274 (2002)
Article Google Scholar
Augonnet, C., Thibault, S., Namyst, R., Wacrenier, P.-A.: StarPU: a unified platform for task scheduling on heterogeneous multicore architectures. In: Sips, H., Epema, D., Lin, H.-X. (eds.) Euro-Par 2009. LNCS, vol. 5704, pp. 863–874. Springer, Heidelberg (2009)
Chapter Google Scholar
LuxRender: GPL physically based renderer (2013), http://www.luxrender.net
Intel: Hybrid HDR tone mapping for post processing multi-device version (2013), http://software.intel.com/en-us/vcsource/samples/hdr-tone-mapping-multi-device
IBM: OpenCL Common Runtime for Linux on x86 Architecture (version 0.1) (2011)
Google Scholar
Multicoreware, Inc.: GMAC: Global Memory for Accelerator, TM: Task Manager (2011), http://www.multicorewareinc.com
Kim, J., Kim, H., Lee, J.H., Lee, J.: Achieving a single compute device image in opencl for multiple gpus. In: Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming, PPoPP 2011, pp. 277–288. ACM, New York (2011)
Google Scholar
de La Lama, C., Toharia, P., Bosque, J., Robles, O.: Static multi-device load balancing for opencl. In: 2012 IEEE 10th International Symposium on Parallel and Distributed Processing with Applications (ISPA), pp. 675–682 (2012)
Google Scholar
Spafford, K., Meredith, J., Vetter, J.: Maestro: data orchestration and tuning for OpenCL devices. In: D’Ambra, P., Guarracino, M., Talia, D. (eds.) Euro-Par 2010, Part II. LNCS, vol. 6272, pp. 275–286. Springer, Heidelberg (2010)
Chapter Google Scholar
Kim, J., Seo, S., Lee, J., Nah, J., Jo, G., Lee, J.: SnuCL: an OpenCL framework for heterogeneous CPU/GPU clusters. In: Proceedings of the 26th ACM International Conference on Supercomputing, ICS 2012, pp. 341–352. ACM, New York (2012)
Google Scholar
Grewe, D., O’Boyle, M.F.P.: A Static Task Partitioning Approach for Heterogeneous Systems Using OpenCL. In: Knoop, J. (ed.) CC 2011. LNCS, vol. 6601, pp. 286–305. Springer, Heidelberg (2011)
Chapter Google Scholar
Dolbeau, R., Bihan, S., Bodin, F.: HMPP: A hybrid Multi-core Parallel Programming Environment (2007)
Google Scholar
Wolfe, M.: Implementing the PGI accelerator model. In: GPGPU (2010)
Google Scholar
Grewe, D., Wang, Z., O’Boyle, M.F.: Portable mapping of data parallel programs to opencl for heterogeneous systems. In: ACM/IEEE International Symposium on Code Generation and Optimization, Shenzen, China (February 2013)
Google Scholar
Luk, C.K., Hong, S., Kim, H.: Qilin: exploiting parallelism on heterogeneous multiprocessors with adaptive mapping. In: Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 42, pp. 45–55. ACM, New York (2009)
Chapter Google Scholar
Ayguadé, E., Badia, R.M., Igual, F.D., Labarta, J., Mayo, R., Quintana-Ortí, E.S.: An Extension of the StarSs Programming Model for Platforms with Multiple GPUs. In: Sips, H., Epema, D., Lin, H.-X. (eds.) Euro-Par 2009. LNCS, vol. 5704, pp. 851–862. Springer, Heidelberg (2009)
Chapter Google Scholar
Gautier, T., Besseron, X., Pigeon, L.: KAAPI: A thread scheduling runtime system for data flow computations on cluster of multi-processors. In: Proceedings of the 2007 International Workshop on Parallel Symbolic Computation, PASCO 2007, pp. 15–23. ACM, New York (2007)
Chapter Google Scholar
Boyer, M., Skadron, K., Che, S., Jayasena, N.: Load balancing in a changing world: Dealing with heterogeneity and performance variability. In: IEEE Computing Frontiers Conference (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Exascale Computing Research Laboratory, France
Sylvain Henry
Inria Bordeaux – Sud-Ouest, France
Alexandre Denis
Univ. of Bordeaux, France
Denis Barthou, Marie-Christine Counilh & Raymond Namyst

Authors

Sylvain Henry
View author publications
You can also search for this author in PubMed Google Scholar
Alexandre Denis
View author publications
You can also search for this author in PubMed Google Scholar
Denis Barthou
View author publications
You can also search for this author in PubMed Google Scholar
Marie-Christine Counilh
View author publications
You can also search for this author in PubMed Google Scholar
Raymond Namyst
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CRACS/INESC-TEC and FCUP, Universidade do Porto, Rua do Campo Alegre, 1021, 4169-007, Porto, Portugal
Fernando Silva , Inês Dutra & Vítor Santos Costa , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Henry, S., Denis, A., Barthou, D., Counilh, MC., Namyst, R. (2014). Toward OpenCL Automatic Multi-Device Support. In: Silva, F., Dutra, I., Santos Costa, V. (eds) Euro-Par 2014 Parallel Processing. Euro-Par 2014. Lecture Notes in Computer Science, vol 8632. Springer, Cham. https://doi.org/10.1007/978-3-319-09873-9_65

Download citation

DOI: https://doi.org/10.1007/978-3-319-09873-9_65
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09872-2
Online ISBN: 978-3-319-09873-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Toward OpenCL Automatic Multi-Device Support

Abstract

Chapter PDF

Similar content being viewed by others

Automatic OpenCL Task Adaptation for Heterogeneous Architectures

Exploiting Heterogeneous Mobile Architectures Through a Unified Runtime Framework

Multi-device Controllers: A Library to Simplify Parallel Heterogeneous Programming

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Toward OpenCL Automatic Multi-Device Support

Abstract

Chapter PDF

Similar content being viewed by others

Automatic OpenCL Task Adaptation for Heterogeneous Architectures

Exploiting Heterogeneous Mobile Architectures Through a Unified Runtime Framework

Multi-device Controllers: A Library to Simplify Parallel Heterogeneous Programming

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation