Skip to main content

SeloGPU: A Selective Off-Loading Framework for High Performance GPGPU Execution

  • Conference paper
Parallel Computing Technologies (PaCT 2013)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7979))

Included in the following conference series:

  • 1310 Accesses

Abstract

In general, GPU accelerated GPGPU application results in much higher performance than CPU application. However, to be accelerated by GPU, users should have GPGPU-enabled computation resources like recent GPU or CPU in their local machine. In this paper, we proposed selective GPGPU off-loading framework named SeloGPU. SeloGPU not only supports remote off-loading for GPGPU application but also supports target node selection among multiple GPGPU-enabled computation resources. We also proposed four optimization techniques to reduce additional overhead owing to remote execution. We implemented SeloGPU using OpenCL which is open standard heterogeneous language. The experimental result shows SeloGPU can choose best target node based on the history of execution information. The four optimization techniques reduce ~87% of network transmission overhead.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Gao, W., Huyen, N.T.T., Loi, H.S., Kemao, Q.: Real-time 2D parallel windowed Fourier transform for fringe pattern analysis using Graphics Processing Unit. Opt. Express 17(25), 23147–23152 (2009)

    Article  Google Scholar 

  2. Lu, P.J., Oki, H., Frey, C.A., Chamitoff, G.E., et al.: Orders-of-magnitude performance increases in GPU-accelerated correlation of images from the International Space Station. Journal of Real-Time Image Processing 5(3), 179–193 (2010)

    Article  Google Scholar 

  3. OpenCL, http://www.khronos.org/opencl/

  4. Duato, J., Igual, F.D., Mayo, R., Peña, A.J., Quintana-Ortí, E.S., Silla, F.: An efficient implementation of GPU virtualization in high performance clusters. In: Lin, H.-X., Alexander, M., Forsell, M., Knüpfer, A., Prodan, R., Sousa, L., Streit, A. (eds.) Euro-Par 2009. LNCS, vol. 6043, pp. 385–394. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  5. Shi, L., Chen, H., Sun, J., Li, K.: vCUDA: GPU-accelerated high-performance computing in virtual machines. In: IEEE International Symposium on Parallel & Distributed Processing (2009)

    Google Scholar 

  6. Simon, L., Joe, J., Edd, D.: Programming Web Services with XML-RPC, 1st edn. O’Reilly (2001)

    Google Scholar 

  7. Kegel, P., Michel, S., Sergei, G.: dOpenCL: Towards a uniform programming approach for distributed heterogeneous multi-/many-core systems. In: 21th International Heterogeneity in Computing Workshop (HCW 2012) (2012)

    Google Scholar 

  8. Alves, A., Rufino, J., Pina, A., Santos, L.P.: clOpenCL - Supporting Distributed Heterogeneous Computing in HPC Clusters. In: Caragiannis, I., et al. (eds.) Euro-Par Workshops 2012. LNCS, vol. 7640, pp. 112–122. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  9. Goglin, B.: High-Performance Message Passing over generic Ethernet Hardware with Open-MX. Elsevier Journal of Parallel Comp 37(2), 85–100 (2011)

    Article  Google Scholar 

  10. Seymour, K., Nakada, H., Matsuoka, S., Dongarra, J., Lee, C., Casanova, H.: Overview of GridRPC: A remote procedure call API for grid computing. In: Parashar, M. (ed.) GRID 2002. LNCS, vol. 2536, pp. 274–278. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  11. Cuervo, E., Balasubramanian, A., Cho, D.K., Wolman, A., Saroiu, S., Chandra, R., Bahl, P.: MAUI: making smartphones last longer with code offload. In: Proceedings of the 8th International Conference on Mobile Systems, Applications, and Services, pp. 49–62. ACM (2010)

    Google Scholar 

  12. Rodinia 2.2, http://www.cs.virginia.edu/~skadron/wiki/rodinia/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Park, S., Ma, J., Park, C. (2013). SeloGPU: A Selective Off-Loading Framework for High Performance GPGPU Execution. In: Malyshkin, V. (eds) Parallel Computing Technologies. PaCT 2013. Lecture Notes in Computer Science, vol 7979. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39958-9_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-39958-9_22

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-39957-2

  • Online ISBN: 978-3-642-39958-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics