short-paper

Data Distribution Method for Fast Giga-scale Hologram Generation on a Multi-GPU Cluster

Authors:
Takanobu Baba

Utsunomiya University, Utsunomiya, Japan

Utsunomiya University, Utsunomiya, Japan
View Profile

,
Shinpei Watanabe

Utsunomiya University, Utsunomiya, Japan

Utsunomiya University, Utsunomiya, Japan
View Profile

,
Boaz Jessie Jackin

NICT, Koganei, Japan

NICT, Koganei, Japan
View Profile

,
Kanemitsu Ootsu

Utsunomiya University, Utsunomiya, Japan

Utsunomiya University, Utsunomiya, Japan
View Profile

,
Takeshi Ohkawa

Utsunomiya Universit, Utsunomiya, Japan

Utsunomiya Universit, Utsunomiya, Japan
View Profile

,
Takashi Yokota

Utsunomiya University, Utsunomiya, Japan

Utsunomiya University, Utsunomiya, Japan
View Profile

,
Yoshio Hayasaki

Utsunomiya University, Utsunomiya, Japan

Utsunomiya University, Utsunomiya, Japan
View Profile

,
Toyohiko Yatagai

Utsunomiya University, Utsunomiya, Japan

Utsunomiya University, Utsunomiya, Japan
View Profile

ApPLIED '18: Proceedings of the 2018 Workshop on Advanced Tools, Programming Languages, and PLatforms for Implementing and Evaluating Algorithms for Distributed systemsJuly 2018Pages 37–40https://doi.org/10.1145/3231104.3231105

Published:23 July 2018Publication History

ApPLIED '18: Proceedings of the 2018 Workshop on Advanced Tools, Programming Languages, and PLatforms for Implementing and Evaluating Algorithms for Distributed systems

Pages 37–40

ABSTRACT

The 3D holographic display has long been expected as a future human interface as it does not require users to wear special devices. However, in addition to the delay of display device technology, its heavy computation requirement prevents the realization of such displays. A recent study says that objects and holograms with several giga-pixels should be processed in real time for the realization of high resolution and wide view angle. To this problem, first, we have proposed a new data distribution method that utilizes a basic FFT-based O(N log N) computation but does not need any inter-node communications during the computation on a multi-GPU cluster. Then, we have implemented the method on a multi-GPU cluster, applying several single-node and multi-node optimization and parallelization techniques. The experimental results show that the intra-node optimizations attain 11.52 times speed-up from the original single node code. Further, multi-node optimizations using 8 nodes, 2 GPUs per node, attain the execution time of 4.28 sec. for generating 1.6 giga-pixel hologram from 3.2 giga-pixel object. It means 237.92 times speed-up of the sequential processing by CPU using a conventional FFT-based algorithm.

References

T. Baba, S. Watanabe, B.J. Jackin, T. Ohkawa, K. Ootsu, T. Yokota, Y. Hayasaki, and T. Yatagai. 2018. Overcoming the difficulty of large-scale CGH generation on multi-GPU cluster,. In Proc. the 11th Workshop on General Purpose GPUs. 13--21. Vienna, Austria. Google ScholarDigital Library
D.G. Curry, G. Martinse, and D.G. Hopper. 2003. Capability of the human visual system. In Proc. SPIE, Vol. 5080.Google Scholar
B.J. Jackin, H. Miyata, T. Ohkawa, K. Ootsu, T. Yokota, Y. Hayasakiand T. Yatagai, and T. Baba. 2014. Distributed caluculation method for large-pixel-number holograms by decomposition of object and hologram planes. In Optics Letters, Vol. 39. 6867--6870.Google ScholarCross Ref
B.J. Jackin, S. Watanabe, K. Ootsu, T. Ohkawa, T. Yokota, Y. Hayasaki, T. Yatagai, and T. Baba. 2018. Decomposition method for fast computation of gigapixel-sized Fresnel holograms on a graphics processing unit cluster. In Applied Optics, Vol. 57. 3134--3145.Google ScholarCross Ref
H. Niwase, M. Fujiwara, H. Araki, Y. Maeda, H. Nakayama, T. Kakue, T. Shimobaba, T. Ito, and N. Takada. 2015. Fast computation of computer-generated hologram using multi-GPU cluster system for a single spatial light modulator. In Forum on Information Technology, Vol. 14. 41--44.Google Scholar
NVIDIA. 2016. CUDA C PROGRAMMING GUIDE NVIDIA.Google Scholar
L. Onural, F. Yaras., and H. Kang. 2011. Digiral Holographic Three-Dimensional Video Displays. In Proc. IEEE 99. 576--589.Google Scholar
Open MPI 2017. Open Source High Performance Computing. https://www. open-mpi.org/Google Scholar
R.B.A. Tanjung, X. Xu, X. Liang, S. Solanki, F. Farbiz Y. Pan, B. Xu, and T-C. Chong. 2010. Digital holographic three-dimensional display of 50-Mpixel holograms using a two-axis scanning mirror device. In Optical Engineering, Vol. 49(2).Google Scholar
S. Watanabe, B.J. Jackin, T. Ohkawa, K. Ootsu, T. Yokota, Y. Hayasaki, T. Yatagai, and T. Baba. 2017. Acceleration of large-scale CGH generation using multi-GPU cluster,. In Proc. Workshop on Advances in Networking and Computing. 589--593.Google Scholar
Y. Zhang, J. Liu, X. Li, and Y. Wang. 2016. Fast processing method to generate gigabyte computer generated holography for three-dimensional dynamic holographic display. In Chinese Optics Letters. 030901--1--030901--5.Google Scholar
Y. Zhao, L. Cao, H. Zhang, D. Kong, and G. Jin. 2015. Accurate calculation of Computer-generated holograms using angular-spectrum layer-oriented method. In Optics Express, Vol. 23.Google Scholar

Recommendations

Overcoming the difficulty of large-scale CGH generation on multi-GPU cluster
GPGPU-11: Proceedings of the 11th Workshop on General Purpose GPUs

The 3D holographic display has long been expected as a future human interface as it does not require users to wear special devices. However, its heavy computation requirement prevents the realization of such displays. A recent study says that objects ...
Read More
A Jacobi_PCG solver for sparse linear systems on multi-GPU cluster

The General Purpose Graphics Processing Unit (GPGPU or GPU) has powerful float-point computation ability and is suitable for intensive computing, such as solving large linear systems. The Jacobi Preconditioned Conjugate Gradient method (Jacobi_PCG or ...
Read More
Multi-GPU performance of incompressible flow computation by lattice Boltzmann method on GPU cluster

GPGPU has drawn much attention on accelerating non-graphic applications. The simulation by D3Q19 model of the lattice Boltzmann method was executed successfully on multi-node GPU cluster by using CUDA programming and MPI library. The GPU code runs on ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ApPLIED '18: Proceedings of the 2018 Workshop on Advanced Tools, Programming Languages, and PLatforms for Implementing and Evaluating Algorithms for Distributed systems
July 2018
54 pages
ISBN:9781450357753
DOI:10.1145/3231104
General Chairs:
Chryssis Georgiou
University of Cyprus, Cyprus
,
Elad M. Schiller
Chalmers University of Technology, Sweden
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 23 July 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
data distribution method
large-scale hologram
multi-gpu cluster
Qualifiers
- short-paper
Conference

Acceptance Rates
ApPLIED '18 Paper Acceptance Rate3of4submissions,75%Overall Acceptance Rate3of4submissions,75%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 81
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Data Distribution Method for Fast Giga-scale Hologram Generation on a Multi-GPU Cluster

ApPLIED '18: Proceedings of the 2018 Workshop on Advanced Tools, Programming Languages, and PLatforms for Implementing and Evaluating Algorithms for Distributed systems

ABSTRACT

References

Cited By

Recommendations

Overcoming the difficulty of large-scale CGH generation on multi-GPU cluster

A Jacobi_PCG solver for sparse linear systems on multi-GPU cluster

Multi-GPU performance of incompressible flow computation by lattice Boltzmann method on GPU cluster