column

FPGA-based Custom Computing Architecture for Large-Scale Fluid Simulation with Building Cube Method

Authors:
Kentaro Sano

Tohoku University, Sendai, Japan

Tohoku University, Sendai, Japan
View Profile

,
Ryotaro Chiba

Tohoku University, Sendai, Japan

Tohoku University, Sendai, Japan
View Profile

,
Tomoya Ueno

Tohoku University, Sendai, Japan

Tohoku University, Sendai, Japan
View Profile

,
Hayato Suzuki

Tohoku University, Sendai, Japan

Tohoku University, Sendai, Japan
View Profile

,
Ryo Ito

Tohoku University, Sendai, Japan

Tohoku University, Sendai, Japan
View Profile

,
Satoru Yamamoto

Tohoku University, Sendai, Japan

Tohoku University, Sendai, Japan
View Profile

Authors Info & Claims

ACM SIGARCH Computer Architecture News Volume 42 Issue 4Setember 2014pp 45–50https://doi.org/10.1145/2693714.2693723

Published:03 December 2014Publication History

ACM SIGARCH Computer Architecture News

Abstract

We are designing a custom computing machine for large-scale flui simulation with the building-cube method (BCM). In BCM, parallel computation is performed with cubes, each of which is an orthogonal grid with a f xed resolution of cells. Although BCM is advantageous in balancing loads with cubes, it also has a problem of efficien y and scalability for comptuting with general-purpose supercomputers due to insufficien memory bandwidth and communication overhead of an interconnection network. In this paper, we present a custom computing architecture for FPGA-based scalable BCM computation with a dedicated network, called an accelerator domain network (ADN). We design a cube engine which allows bandwidth-efficien computation of cubes based on streamed stencil computation of the fractional-step method. Through prototype implementation, we evaluate the potential performance of the architecture. For ALTERA Stratix V 28nm FPGA, we estimate that a single FPGA has the peak performance of 107 GFlop/s in a single precision.

References

K. Nakahashi. Building-cube method for flow problems with broadband characteristic length. Computational Fluid Dynamics, pages 77--81, 2002.Google Scholar
S. Takahashi, T. Ishida, K. Nakahashi, H. Kobayashi, K. Okabe, Y. Shimomura, T. Soga, and A. Musa. Large scaled computation of incompressible flows on cartesian mesh using a vector-parallel supercomputer. Parallel Computational Fluid Dynamics, 74:331--338, 2008.Google Scholar
H. Onda D. Sasaki, A. Deguchi and K. Nakahashi. Landing gear aerodynamic noise prediction using building-cube method. Modelling and Simulation in Engineering, 2012(632387):1--16, 2012. Google ScholarDigital Library
Michael deLorimier and André DeHon. Floating-point sparse matrix-vector multiply for FPGAs. Proceedings of the International Symposium on Field-Programmable Gate Arrays, pages 75--85, February 2005. Google ScholarDigital Library
Ling Zhuo and Viktor K. Prasanna. Sparse matrix-vector multiplication on FPGAs. Proceedings of the International Symposium on Field-Programmable Gate Arrays, pages 63--74, February 2005. Google ScholarDigital Library
Yong Dou, S. Vassiliadis, G. K. Kuzmanov, and G. N. Gaydadjiev. 64-bit floating-poin FPGA matrix multiplication. Proceedings of the International Symposium on Field-Programmable Gate Arrays, pages 86--95, February 2005. Google ScholarDigital Library
Kentaro Sano, Takanori Iizuka, and Satoru Yamamoto. Systolic architecture for computational flui dynamics on FPGAs. Proceedings of the IEEE Symposium on Field-Programmable Custom Computing Machines, pages 107--116, April 2007. Google ScholarDigital Library
S. Murtaza, A.G. Hoekstra, and P.M. Sloot. Compute bound and I/O bound cellular automata simulations on FPGA logic. ACM Transactions on Reconfiurable Technology and Systems, 1(4), January 2009. Article 23. Google ScholarDigital Library
Kentaro Sano, WANG Luzhou, Yoshiaki Hatsuda, Takanori Iizuka, and Satoru Yamamoto. FPGA-array with bandwidth-reduction mechanism for scalable and power-efficient numerical simulations based on finit difference methods. ACM Transactions on Reconfigurable Technology and Systems, 3(4), November 2010. Google ScholarDigital Library
Kentaro Sano, Yoshiaki Hatsuda, and Satoru Yamamoto. Multi-FPGA accelerator for scalable stencil computation with constant memory-bandwidth. IEEE Transaction on Parallel and Distributed Systems, 25(3):695--705, March 2014. Google ScholarDigital Library
Kentaro Sano, Yoshiaki Kono, Hayato Suzuki, Ryotaro Chiba, Ryo Ito, Kyo Koizumi, and Satoru Yamamoto. Efficient custom computing of fully-streamed lattice boltzmann method on tightly-coupled FPGA cluster. ACM SIGARCH Computer Architecture News, 2013. To appear. Google ScholarDigital Library
J. Kim and P. Moin. Application of a fractional-step method to incompressible navier-stokes. Journal of Computational Physics, 59:308--323, June 1985.Google ScholarCross Ref
John C. Strikwerda and Young S. Lee. The accuracy of the fractional step method. SIAM Journal on Numerical Analysis, 37(1):37--47, November 1999. Google ScholarDigital Library
Louis A. Hageman and David M. Young. Applied Iterative Methods. Academic Press, 1981.Google Scholar
Terasic Technologies. http://www.terasic.com, 2014.Google Scholar
Altera Corporation. http://www.altera.com/literature/, 2014.Google Scholar
FloPoCo project. http://flopoco.gfo ge.inria.fr, 2014.Google Scholar

Recommendations

Application-Specific FPGA using heterogeneous logic blocks

This work presents a new automatic mechanism to explore the solution space between Field Programmable Gate Arrays (FPGAs) and Application-Specific Integrated Circuits (ASICs). This new solution is termed as an Application-Specific Inflexible FPGA (ASIF) ...
Read More
High throughput architecture for packet classification using FPGA
ANCS '09: Proceedings of the 5th ACM/IEEE Symposium on Architectures for Networking and Communications Systems

To avoid packet classification from being the performance bottleneck in network devices, one-chip solution hardware packet classifier based on HiCuts algorithm is designed and implemented in single chip of FPGA. The compact data structure and the ...
Read More
Efficient custom computing of fully-streamed lattice boltzmann method on tightly-coupled FPGA cluster

This paper presents the detailed design of a custom computing machine for fully-streamed LBM computation on multiple FPGAs, and evaluates its efficiency with prototype implementation. We design a unit for completely streamed computation including ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM SIGARCH Computer Architecture News Volume 42, Issue 4
HEART '14
Setember 2014
99 pages
ISSN:0163-5964
DOI:10.1145/2693714
Editor:
Doug DeGroot
acm dot org
Issue’s Table of Contents
Copyright © 2014 Authors
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 3 December 2014
Check for updates
Author Tags
FPGA
architecture
building cube method
custom computing
flui simulation
Qualifiers
- column
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 151
  Total Downloads
- Downloads (Last 12 months)6
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

FPGA-based Custom Computing Architecture for Large-Scale Fluid Simulation with Building Cube Method

ACM SIGARCH Computer Architecture News

Abstract

References

Cited By

Recommendations

Application-Specific FPGA using heterogeneous logic blocks

High throughput architecture for packet classification using FPGA

Efficient custom computing of fully-streamed lattice boltzmann method on tightly-coupled FPGA cluster

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

FPGA-based Custom Computing Architecture for Large-Scale Fluid Simulation with Building Cube Method

ACM SIGARCH Computer Architecture News

Abstract

References

Cited By

Recommendations

Application-Specific FPGA using heterogeneous logic blocks

High throughput architecture for packet classification using FPGA

Efficient custom computing of fully-streamed lattice boltzmann method on tightly-coupled FPGA cluster

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media