SeloGPU: A Selective Off-Loading Framework for High Performance GPGPU Execution

Park, Sejin; Ma, Jeonghyeon; Park, Chanik

doi:10.1007/978-3-642-39958-9_22

Sejin Park¹⁷,
Jeonghyeon Ma¹⁷ &
Chanik Park¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7979))

Included in the following conference series:

International Conference on Parallel Computing Technologies

1310 Accesses

Abstract

In general, GPU accelerated GPGPU application results in much higher performance than CPU application. However, to be accelerated by GPU, users should have GPGPU-enabled computation resources like recent GPU or CPU in their local machine. In this paper, we proposed selective GPGPU off-loading framework named SeloGPU. SeloGPU not only supports remote off-loading for GPGPU application but also supports target node selection among multiple GPGPU-enabled computation resources. We also proposed four optimization techniques to reduce additional overhead owing to remote execution. We implemented SeloGPU using OpenCL which is open standard heterogeneous language. The experimental result shows SeloGPU can choose best target node based on the history of execution information. The four optimization techniques reduce ~87% of network transmission overhead.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Gao, W., Huyen, N.T.T., Loi, H.S., Kemao, Q.: Real-time 2D parallel windowed Fourier transform for fringe pattern analysis using Graphics Processing Unit. Opt. Express 17(25), 23147–23152 (2009)
Article Google Scholar
Lu, P.J., Oki, H., Frey, C.A., Chamitoff, G.E., et al.: Orders-of-magnitude performance increases in GPU-accelerated correlation of images from the International Space Station. Journal of Real-Time Image Processing 5(3), 179–193 (2010)
Article Google Scholar
OpenCL, http://www.khronos.org/opencl/
Duato, J., Igual, F.D., Mayo, R., Peña, A.J., Quintana-Ortí, E.S., Silla, F.: An efficient implementation of GPU virtualization in high performance clusters. In: Lin, H.-X., Alexander, M., Forsell, M., Knüpfer, A., Prodan, R., Sousa, L., Streit, A. (eds.) Euro-Par 2009. LNCS, vol. 6043, pp. 385–394. Springer, Heidelberg (2010)
Chapter Google Scholar
Shi, L., Chen, H., Sun, J., Li, K.: vCUDA: GPU-accelerated high-performance computing in virtual machines. In: IEEE International Symposium on Parallel & Distributed Processing (2009)
Google Scholar
Simon, L., Joe, J., Edd, D.: Programming Web Services with XML-RPC, 1st edn. O’Reilly (2001)
Google Scholar
Kegel, P., Michel, S., Sergei, G.: dOpenCL: Towards a uniform programming approach for distributed heterogeneous multi-/many-core systems. In: 21th International Heterogeneity in Computing Workshop (HCW 2012) (2012)
Google Scholar
Alves, A., Rufino, J., Pina, A., Santos, L.P.: clOpenCL - Supporting Distributed Heterogeneous Computing in HPC Clusters. In: Caragiannis, I., et al. (eds.) Euro-Par Workshops 2012. LNCS, vol. 7640, pp. 112–122. Springer, Heidelberg (2013)
Chapter Google Scholar
Goglin, B.: High-Performance Message Passing over generic Ethernet Hardware with Open-MX. Elsevier Journal of Parallel Comp 37(2), 85–100 (2011)
Article Google Scholar
Seymour, K., Nakada, H., Matsuoka, S., Dongarra, J., Lee, C., Casanova, H.: Overview of GridRPC: A remote procedure call API for grid computing. In: Parashar, M. (ed.) GRID 2002. LNCS, vol. 2536, pp. 274–278. Springer, Heidelberg (2002)
Chapter Google Scholar
Cuervo, E., Balasubramanian, A., Cho, D.K., Wolman, A., Saroiu, S., Chandra, R., Bahl, P.: MAUI: making smartphones last longer with code offload. In: Proceedings of the 8th International Conference on Mobile Systems, Applications, and Services, pp. 49–62. ACM (2010)
Google Scholar
Rodinia 2.2, http://www.cs.virginia.edu/~skadron/wiki/rodinia/

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, POSTECH, Pohang, South Korea
Sejin Park, Jeonghyeon Ma & Chanik Park

Authors

Sejin Park
View author publications
You can also search for this author in PubMed Google Scholar
Jeonghyeon Ma
View author publications
You can also search for this author in PubMed Google Scholar
Chanik Park
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computational Mathematics and Mathematical Geophysics, Supercomputer Department, Russian Academy of Science, 630090, Novosibirsk, Russia
Victor Malyshkin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Park, S., Ma, J., Park, C. (2013). SeloGPU: A Selective Off-Loading Framework for High Performance GPGPU Execution. In: Malyshkin, V. (eds) Parallel Computing Technologies. PaCT 2013. Lecture Notes in Computer Science, vol 7979. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39958-9_22

Download citation

DOI: https://doi.org/10.1007/978-3-642-39958-9_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39957-2
Online ISBN: 978-3-642-39958-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics