JCUDA: A Programmer-Friendly Interface for Accelerating Java Programs with CUDA

Yan, Yonghong; Grossman, Max; Sarkar, Vivek

doi:10.1007/978-3-642-03869-3_82

Yonghong Yan¹⁷,
Max Grossman¹⁷ &
Vivek Sarkar¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5704))

Included in the following conference series:

European Conference on Parallel Processing

1974 Accesses

Abstract

A recent trend in mainstream desktop systems is the use of general-purpose graphics processor units (GPGPUs) to obtain order-of-magnitude performance improvements. CUDA has emerged as a popular programming model for GPGPUs for use by C/C++ programmers. Given the widespread use of modern object-oriented languages with managed runtimes like Java and C#, it is natural to explore how CUDA-like capabilities can be made accessible to those programmers as well. In this paper, we present a programming interface called JCUDA that can be used by Java programmers to invoke CUDA kernels. Using this interface, programmers can write Java codes that directly call CUDA kernels, and delegate the responsibility of generating the Java-CUDA bridge codes and host-device data transfer calls to the compiler. Our preliminary performance results show that this interface can deliver significant performance improvements to Java programmers. For future work, we plan to use the JCUDA interface as a target language for supporting higher level parallel programming languages like X10 and Habanero-Java.

Download to read the full chapter text

Chapter PDF

Boosting Java Performance Using GPGPUs

Portable C++ Code that can Look and Feel Like Fortran Code with Yet Another Kernel Launcher (YAKL)

Article Open access 08 December 2022

A Case Study for Performance Portability Using OpenMP 4.5

References

Nickolls, J., Buck, I., Garland, M., Nvidia, Skadron, K.: Scalable Parallel Programming with CUDA. ACM Queue 6(2), 40–53 (2008)
Article Google Scholar
Java Grande Forum Panel, Java Grande Forum Report: Making Java Work for High-End Computing, Java Grande Forum, SC 1998, Tech. Rep. (November 1998)
Google Scholar
Bull, J.M., Smith, L.A., Pottage, L., Freeman, R.: Benchmarking Java Against C and Fortran for Scientific Applications. In: Proceedings of the 2001 joint ACM-ISCOPE Conference on Java Grande, pp. 97–105. ACM Press, New York (2001)
Chapter Google Scholar
Smith, L.A., Bull, J.M., Obdrzálek, J.: A Parallel Java Grande Benchmark Suite. In: Proceedings of the 2001 ACM/IEEE conference on Supercomputing, p. 8. ACM Press, New York (2001)
Chapter Google Scholar
SciMark Java Benchmark for Scientific and Numerical Computing, http://math.nist.gov/scimark2/
Liang, S.: Java Native Interface: Programmer’s Guide and Specification. Sun Microsystems (1999)
Google Scholar
Carpenter, B., Getov, V., Judd, G., Skjellum, A., Fox, G.: MPJ: MPI-Like Message Passing for Java. Concurrency - Practice and Experience 12(11), 1019–1038 (2000)
Article MATH Google Scholar
AMD, ATI Stream Computing - Technical Overview. AMD, Tech. Rep. (2008)
Google Scholar
Khronos OpenCL Working Group, The OpenCL Specification - Version 1.0. The Khronos Group, Tech. Rep. (2009)
Google Scholar
Nystrom, N., Clarkson, M.R., Myers, A.C.: Polyglot: An Extensible Compiler Framework for Java. In: Kahng, H.-K. (ed.) ICOIN 2003. LNCS, vol. 2662, pp. 138–152. Springer, Heidelberg (2003)
Google Scholar
Moreira, J.E., Midkiff, S.P., Gupta, M., Artigas, P.V., Snir, M., Lawrence, R.D.: Java Programming for High-Performance Numerical Computing. IBM Systems Journal 39(1), 21–56 (2000)
Article Google Scholar
Charles, P., Grothoff, C., Saraswat, V., Donawa, C., Kielstra, A., Ebcioglu, K., von Praun, C., Sarkar, V.: X10: an Object-Oriented Approach to Non-Uniform Cluster Computing. In: OOPSLA 2005: Proceedings of the 20th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications, pp. 519–538. ACM, New York (2005)
Chapter Google Scholar
NVIDIA, NVIDIA CUDA Programming Guide 2.2.plus 0.5em minus 0.4emNVIDIA (2009), http://www.nvidia.com/cuda
Java Binding for NVIDIA CUDA BLAS and FFT Implementation, http://www.jcuda.de
PyCuda, http://documen.tician.de/pycuda/
Matthew Monteyne, RapidMind Multi-Core Development Platform. RapidMind Inc., Tech. Rep (2008)
Google Scholar
Lee, S., Min, S.-J., Eigenmann, R.: Openmp to gpgpu: a compiler framework for automatic translation and optimization. In: PPoPP 2009: Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming, pp. 101–110. ACM, New York (2009)
Google Scholar
Leung, A.C.-W.: Thesis: Automatic Parallelization for Graphics Processing Units in JikesRVM. University of Waterloo, Tech. Rep. (2008)
Google Scholar
Alpern, B., Augart, S., Blackburn, S.M., Butrico, M., Cocchi, A., Cheng, P., Dolby, J., Fink, S., Grove, D., Hind, M., McKinley, K.S., Mergen, M., Moss, J.E.B., Ngo, T., Sarkar, V.: The Jikes Research Virtual Machine Project: Building an Open-Source Research Community. IBM Systems Journal 44(2), 399–417 (2005)
Article Google Scholar
Habanero Multicore Software Project, http://habanero.rice.edu
Shirako, J., Kasahara, H., Sarkar, V.: Language Extensions in Support of Compiler Parallelization. In: Adve, V., Garzarán, M.J., Petersen, P. (eds.) LCPC 2007. LNCS, vol. 5234, pp. 78–94. Springer, Heidelberg (2008)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Rice University, USA
Yonghong Yan, Max Grossman & Vivek Sarkar

Authors

Yonghong Yan
View author publications
You can also search for this author in PubMed Google Scholar
Max Grossman
View author publications
You can also search for this author in PubMed Google Scholar
Vivek Sarkar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Software Technology, Delft University of Technology, Mekelweg 4, 2628, Delft, CD, The Netherlands
Henk Sips , Dick Epema & Hai-Xiang Lin , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yan, Y., Grossman, M., Sarkar, V. (2009). JCUDA: A Programmer-Friendly Interface for Accelerating Java Programs with CUDA. In: Sips, H., Epema, D., Lin, HX. (eds) Euro-Par 2009 Parallel Processing. Euro-Par 2009. Lecture Notes in Computer Science, vol 5704. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03869-3_82

Download citation

DOI: https://doi.org/10.1007/978-3-642-03869-3_82
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03868-6
Online ISBN: 978-3-642-03869-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics