skip to main content
10.1145/1342250.1342265acmconferencesArticle/Chapter ViewAbstractPublication Pagesi3dConference Proceedingsconference-collections
research-article

Hardware-aware analysis and optimization of stable fluids

Published: 15 February 2008 Publication History

Abstract

We perform a detailed flop and bandwidth analysis of Jos Stam's Stable Fluids algorithm on the CPU, GPU, and Cell. In all three cases, we find that the algorithm is bandwidth bound, with the cores sitting idle up to 96% of the time. Knowing this, we propose two modifications to accelerate the algorithm. First, a Mehrstellen discretization for the pressure solver which reduces the running time of the solver by a third. Second, a static caching scheme that eliminates roughly 99% of the random lookups in the advection stage. We observe a 2x speedup in the advection stage using this scheme. Both modifications apply equally well to all three architectures.

References

[1]
Bolz, J., Farmer, I., Grinspun, E., and Schröder, P. 2003. Sparse matrix solvers on the gpu: Conjugate gradients and multi-grid. In Proceedings of SIGGRAPH.
[2]
Collatz, L. 1960. The Numerical Treatment of Differential Equations. Springer-Verlag.
[3]
Crane, K., Tariq, S., and Llamas, I. 2007. GPU Gems 3. ch. Real-time Simulation and Rendering of 3D Fluids.
[4]
Demmel, J. 1997. Applied Numerical Linear Algebra. SIAM.
[5]
Enright, D., Fedkiw, R., Ferziger, J., and Mitchell, I. 2002. A hybrid particle level set method for improved interface capturing. Journal of Computational Physics 183, 83--116.
[6]
Fedkiw, R., Stam, J., and Jensen, H. W. 2001. Visual simulation of smoke. Proc. of SIGGRAPH, 15--22.
[7]
Foster, N., and Fedkiw, R. 2001. Practical animation of liquids. Proc. of SIGGRAPH, pp. 15--22.
[8]
Goodnight, N., Woolley, C., Lewin, G., Luebke, D., and Humphreys, G. 2003. A multigrid solver for boundary value problems using programmable graphics hardware. In Euro-graphics/SIGGRAPH Workshop on Graphics Hardware.
[9]
Gupta, M., Kouatchou, J., and Zhang, J. 1997. Comparison of second- and fourth-order discretizations for multigrid poisson solvers. Journal of Computational Physics, 226--232.
[10]
Harada, T., Koshizuka, S., and Kawaguchi, Y. 2007. Smoothed particle hydrodynamics on gpus. In Computer Graphics International, 63--70.
[11]
Harris, M. 2004. GPU Gems. ch. Fast Fluid Dynamics Simulation on the GPU.
[12]
Harris, M. 2005. GPU Gems 2. ch. Mapping Computational Concepts to the GPU.
[13]
IBM, 2007. Cell broadband engine programming handbook. http://www-01.ibm.com/chips/techlib/techlib.nsf/products/Cell_Broadband_Engine.
[14]
Intel, 2007. Dual-core intel xeon processor 5100 series datasheet. http://www.intel.com/design/xeon/datashts/313355.htm.
[15]
Kim, B., Liu, Y., Llamas, I., Jiao, X., and Rossignac, J. 2007. Simulation of bubbles in foam by volume control. In Proceedings of ACM SIGGRAPH.
[16]
Kistler, M., Perrone, M., and Petrini, F. 2006. Cell multiprocessor interconnection network: Built for speed. IEEE Micro 26, 3.
[17]
Klingner, B. M., Feldman, B. E., Chentanez, N., and O'Brien, J. F. 2006. Fluid animation with dynamic meshes. In Proceedings of ACM SIGGRAPH 2006.
[18]
Lax, P., and Wendroff, B. 1960. Systems of conservation laws. Communications on Pure and Applied Mathematics, 217--237.
[19]
Li, W., Wei, X., and Kaufman, A. 2003. Implementing lattice boltzmann computation on graphics hardware. The Visual Computer, 444--456.
[20]
Moreland, K., and Angel, E. The fft on a gpu. In Eurographics Workshop on Graphics Hardware, 112.
[21]
Nvidia, 2007. Geforce 8 series. http://www.nvidia.com/page/geforce8.html.
[22]
Nvidia. 2007. Nvidia CUDA Programming Guide.
[23]
SLEPc, 2007. Scalable library for eigenvalue problem computations. http://www.grycap.upv.es/slepc/.
[24]
Smith, R., 2007. Ode: Open dynamics engine. http://www.ode.org/.
[25]
Stam, J. 1999. Stable fluids. Proceedings of ACM SIGGRAPH, 121--128.
[26]
Stam, J. 2003. Real-time fluid dynamics for games. In Proceedings of the Game Developer Conference.

Cited By

View all
  • (2023)Challenges in GPU-Accelerated Nonlinear Dynamic Analysis for Structural SystemsJournal of Structural Engineering10.1061/JSENDH.STENG-11311149:3Online publication date: Mar-2023
  • (2019)Deep Fluids: A Generative Network for Parameterized Fluid SimulationsComputer Graphics Forum10.1111/cgf.1361938:2(59-70)Online publication date: 7-Jun-2019
  • (2019)Porting a high-order finite-element earthquake modeling application to NVIDIA graphics cards using CUDAJournal of Parallel and Distributed Computing10.1016/j.jpdc.2009.01.00669:5(451-460)Online publication date: 4-Jan-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
I3D '08: Proceedings of the 2008 symposium on Interactive 3D graphics and games
February 2008
219 pages
ISBN:9781595939838
DOI:10.1145/1342250
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 February 2008

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

I3D08
Sponsor:
I3D08: Symposium on Interactive 3D Graphics and Games
February 15 - 17, 2008
California, Redwood City

Acceptance Rates

Overall Acceptance Rate 148 of 485 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)1
Reflects downloads up to 17 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Challenges in GPU-Accelerated Nonlinear Dynamic Analysis for Structural SystemsJournal of Structural Engineering10.1061/JSENDH.STENG-11311149:3Online publication date: Mar-2023
  • (2019)Deep Fluids: A Generative Network for Parameterized Fluid SimulationsComputer Graphics Forum10.1111/cgf.1361938:2(59-70)Online publication date: 7-Jun-2019
  • (2019)Porting a high-order finite-element earthquake modeling application to NVIDIA graphics cards using CUDAJournal of Parallel and Distributed Computing10.1016/j.jpdc.2009.01.00669:5(451-460)Online publication date: 4-Jan-2019
  • (2011)PhysisProceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/2063384.2063398(1-12)Online publication date: 12-Nov-2011
  • (2011)World-highest resolution global atmospheric model and its performance on the Earth SimulatorState of the Practice Reports10.1145/2063348.2063376(1-12)Online publication date: 12-Nov-2011
  • (2010)Calligraphic VideoInternational Journal of Creative Interfaces and Computer Graphics10.4018/jcicg.20100101061:1(67-83)Online publication date: Jan-2010
  • (2010)Filament-based smoke with vortex shedding and variational reconnectionACM SIGGRAPH 2010 papers10.1145/1833349.1778852(1-12)Online publication date: 26-Jul-2010
  • (2010)Filament-based smoke with vortex shedding and variational reconnectionACM Transactions on Graphics10.1145/1778765.177885229:4(1-12)Online publication date: 26-Jul-2010
  • (2010)CUDA-Based Linear Solvers for Stable Fluids2010 International Conference on Information Science and Applications10.1109/ICISA.2010.5480268(1-8)Online publication date: Apr-2010
  • (2010)A CUDA-Based Implementation of Stable Fluids in 3D with Internal and Moving BoundariesProceedings of the 2010 International Conference on Computational Science and Its Applications10.1109/ICCSA.2010.43(118-128)Online publication date: 23-Mar-2010
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media