skip to main content
column

FPGA-based Custom Computing Architecture for Large-Scale Fluid Simulation with Building Cube Method

Published:03 December 2014Publication History
Skip Abstract Section

Abstract

We are designing a custom computing machine for large-scale flui simulation with the building-cube method (BCM). In BCM, parallel computation is performed with cubes, each of which is an orthogonal grid with a f xed resolution of cells. Although BCM is advantageous in balancing loads with cubes, it also has a problem of efficien y and scalability for comptuting with general-purpose supercomputers due to insufficien memory bandwidth and communication overhead of an interconnection network. In this paper, we present a custom computing architecture for FPGA-based scalable BCM computation with a dedicated network, called an accelerator domain network (ADN). We design a cube engine which allows bandwidth-efficien computation of cubes based on streamed stencil computation of the fractional-step method. Through prototype implementation, we evaluate the potential performance of the architecture. For ALTERA Stratix V 28nm FPGA, we estimate that a single FPGA has the peak performance of 107 GFlop/s in a single precision.

References

  1. K. Nakahashi. Building-cube method for flow problems with broadband characteristic length. Computational Fluid Dynamics, pages 77--81, 2002.Google ScholarGoogle Scholar
  2. S. Takahashi, T. Ishida, K. Nakahashi, H. Kobayashi, K. Okabe, Y. Shimomura, T. Soga, and A. Musa. Large scaled computation of incompressible flows on cartesian mesh using a vector-parallel supercomputer. Parallel Computational Fluid Dynamics, 74:331--338, 2008.Google ScholarGoogle Scholar
  3. H. Onda D. Sasaki, A. Deguchi and K. Nakahashi. Landing gear aerodynamic noise prediction using building-cube method. Modelling and Simulation in Engineering, 2012(632387):1--16, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Michael deLorimier and André DeHon. Floating-point sparse matrix-vector multiply for FPGAs. Proceedings of the International Symposium on Field-Programmable Gate Arrays, pages 75--85, February 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Ling Zhuo and Viktor K. Prasanna. Sparse matrix-vector multiplication on FPGAs. Proceedings of the International Symposium on Field-Programmable Gate Arrays, pages 63--74, February 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Yong Dou, S. Vassiliadis, G. K. Kuzmanov, and G. N. Gaydadjiev. 64-bit floating-poin FPGA matrix multiplication. Proceedings of the International Symposium on Field-Programmable Gate Arrays, pages 86--95, February 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Kentaro Sano, Takanori Iizuka, and Satoru Yamamoto. Systolic architecture for computational flui dynamics on FPGAs. Proceedings of the IEEE Symposium on Field-Programmable Custom Computing Machines, pages 107--116, April 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Murtaza, A.G. Hoekstra, and P.M. Sloot. Compute bound and I/O bound cellular automata simulations on FPGA logic. ACM Transactions on Reconfiurable Technology and Systems, 1(4), January 2009. Article 23. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Kentaro Sano, WANG Luzhou, Yoshiaki Hatsuda, Takanori Iizuka, and Satoru Yamamoto. FPGA-array with bandwidth-reduction mechanism for scalable and power-efficient numerical simulations based on finit difference methods. ACM Transactions on Reconfigurable Technology and Systems, 3(4), November 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Kentaro Sano, Yoshiaki Hatsuda, and Satoru Yamamoto. Multi-FPGA accelerator for scalable stencil computation with constant memory-bandwidth. IEEE Transaction on Parallel and Distributed Systems, 25(3):695--705, March 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Kentaro Sano, Yoshiaki Kono, Hayato Suzuki, Ryotaro Chiba, Ryo Ito, Kyo Koizumi, and Satoru Yamamoto. Efficient custom computing of fully-streamed lattice boltzmann method on tightly-coupled FPGA cluster. ACM SIGARCH Computer Architecture News, 2013. To appear. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Kim and P. Moin. Application of a fractional-step method to incompressible navier-stokes. Journal of Computational Physics, 59:308--323, June 1985.Google ScholarGoogle ScholarCross RefCross Ref
  13. John C. Strikwerda and Young S. Lee. The accuracy of the fractional step method. SIAM Journal on Numerical Analysis, 37(1):37--47, November 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Louis A. Hageman and David M. Young. Applied Iterative Methods. Academic Press, 1981.Google ScholarGoogle Scholar
  15. Terasic Technologies. http://www.terasic.com, 2014.Google ScholarGoogle Scholar
  16. Altera Corporation. http://www.altera.com/literature/, 2014.Google ScholarGoogle Scholar
  17. FloPoCo project. http://flopoco.gfo ge.inria.fr, 2014.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in

Full Access

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader