accULL : An OpenACC Implementation with CUDA and OpenCL Support

Reyes, Ruymán; López-Rodríguez, Iván; Fumero, Juan J.; de Sande, Francisco

doi:10.1007/978-3-642-32820-6_86

accULL: An OpenACC Implementation with CUDA and OpenCL Support

Ruymán Reyes¹⁹,
Iván López-Rodríguez¹⁹,
Juan J. Fumero¹⁹ &
…
Francisco de Sande¹⁹

Conference paper

3428 Accesses
40 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7484))

Abstract

The irruption in the HPC scene of hardware accelerators, like GPUs, has made available unprecedented performance to developers. However, even expert developers may not be ready to exploit the new complex processor hierarchies. We need to find a way to leverage the programming effort in these devices at programming language level, otherwise, developers will spend most of their time focusing on device-specific code instead of implementing algorithmic enhancements. The recent advent of the OpenACC standard for heterogeneous computing represents an effort in this direction. This initiative, combined with future releases of the OpenMP standard, will converge into a fully heterogeneous framework that will cope the programming requirements of future computer architectures. In this work we present accULL, a novel implementation of the OpenACC standard, based on the combination of a source to source compiler and a runtime library. To our knowledge, our approach is the first providing support for both OpenCL and CUDA platforms under this new standard.

This work has been partially supported by the EU (FEDER), the Spanish MEC (Plan Nacional de I+D+I, contracts TIN2008-06570-C04-03 and TIN2011-24598), HPC-EUROPA2 (project number 228398) and the Canary Islands Government, ACIISI (contract PI2008/285).

Download to read the full chapter text

Chapter PDF

References

Bihan, F.B.S.: Heterogeneous multicore parallel programming for graphics processing units. Sci. Program. 17, 325–336 (2009)
Google Scholar
Che, S., Sheaffer, J.W., Boyer, M., Szafaryn, L.G., Wang, L., Skadron, K.: A characterization of the rodinia benchmark suite with comparison to contemporary cmp workloads. In: Proceedings of the IEEE International Symposium on Workload Characterization, IISWC 2010, pp. 1–11. IEEE Computer Society, Washington, DC (2010)
Google Scholar
Giménez, J., Labarta, J., Pegenaute, F.X., Wen, H.-F., Klepacki, D., Chung, I.-H., Cong, G., Voigtländer, F., Mohr, B.: Guided Performance Analysis Combining Profile and Trace Tools. In: Guarracino, M.R., Vivien, F., Träff, J.L., Cannatoro, M., Danelutto, M., Hast, A., Perla, F., Knüpfer, A., Di Martino, B., Alexander, M. (eds.) Euro-Par-Workshop 2010. LNCS, vol. 6586, pp. 513–521. Springer, Heidelberg (2011)
Chapter Google Scholar
OpenACC directives for accelerators (2011), http://www.openacc-standard.org
Reyes, R., de Sande, F.: Optimization strategies in different CUDA architectures using. Microprocessors and Microsystems - Embedded Hardware Design 36(2), 78–87 (2012)
Article Google Scholar
Wolfe, M.: Implementing the PGI accelerator model. In: Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units, GPGPU 2010, pp. 43–50. ACM, New York (2010)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Dept. de E.I.O. y Computación, Universidad de La Laguna, 38271, La Laguna, Spain
Ruymán Reyes, Iván López-Rodríguez, Juan J. Fumero & Francisco de Sande

Authors

Ruymán Reyes
View author publications
You can also search for this author in PubMed Google Scholar
Iván López-Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Juan J. Fumero
View author publications
You can also search for this author in PubMed Google Scholar
Francisco de Sande
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Patras, Computer Technology Institute and Press “Diophantus”,, N. Kazantzaki, 26504, Rio, Greece
Christos Kaklamanis
University of Patras, University Building B, 26504, Rio, Greece
Theodore Papatheodorou
Computer Technology Institute and Press “Diophantus”, University of Patras, N. Kazantzaki, 26504, Rio, Greece
Paul G. Spirakis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Reyes, R., López-Rodríguez, I., Fumero, J.J., de Sande, F. (2012). accULL: An OpenACC Implementation with CUDA and OpenCL Support. In: Kaklamanis, C., Papatheodorou, T., Spirakis, P.G. (eds) Euro-Par 2012 Parallel Processing. Euro-Par 2012. Lecture Notes in Computer Science, vol 7484. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32820-6_86

Download citation

DOI: https://doi.org/10.1007/978-3-642-32820-6_86
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32819-0
Online ISBN: 978-3-642-32820-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics