Improving the ratio of memory operations to floating-point operations in loops
|
journal
|
November 1994 |
A framework for hybrid parallel flow simulations with a trillion cells in complex geometries
- Godenschwager, Christian; Schornbaum, Florian; Bauer, Martin
-
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '13
https://doi.org/10.1145/2503210.2503273
|
conference
|
January 2013 |
11 PFLOP/s simulations of cloud cavitation collapse
- Rossinelli, Diego; Koumoutsakos, Petros; Hejazialhosseini, Babak
-
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '13
https://doi.org/10.1145/2503210.2504565
|
conference
|
January 2013 |
Understanding Application Performance via Micro-benchmarks on Three Large Supercomputers: Intrepid, Ranger and Jaguar
|
journal
|
May 2010 |
Estimating interlock and improving balance for pipelined architectures
|
journal
|
August 1988 |
Multicore/Multi-GPU Accelerated Simulations of Multiphase Compressible Flows Using Wavelet Adapted Grids
|
journal
|
January 2011 |
I/O complexity: The red-blue pebble game
|
conference
|
January 1981 |
A Study on Balancing Parallelism, Data Locality, and Recomputation in Existing PDE Solvers
- Olschanowsky, Catherine; Strout, Michelle Mills; Guzik, Stephen
-
SC14: International Conference for High Performance Computing, Networking, Storage and Analysis
https://doi.org/10.1109/SC.2014.70
|
conference
|
November 2014 |
Performance evaluations of gyrokinetic Eulerian code GT5D on massively parallel multi-core platforms
|
conference
|
January 2011 |
An Optimizing Code Generator for a Class of Lattice-Boltzmann Computations
- Pananilath, Irshad; Acharya, Aravind; Vasista, Vinay
-
ACM Transactions on Architecture and Code Optimization, Vol. 12, Issue 2
https://doi.org/10.1145/2739047
|
journal
|
July 2015 |
Comparison of accurate methods for the integration of hyperbolic equations
|
journal
|
January 1972 |
Quantifying Performance Bottlenecks of Stencil Computations Using the Execution-Cache-Memory Model
|
conference
|
January 2015 |
Essentially non-oscillatory and weighted essentially non-oscillatory schemes for hyperbolic conservation laws
|
book
|
January 1998 |
Compiler-Directed Transformation for Higher-Order Stencils
|
conference
|
May 2015 |
Communication lower bounds and optimal algorithms for numerical linear algebra
|
journal
|
May 2014 |
High-order finite-volume methods for hyperbolic conservation laws on mapped multiblock grids
|
journal
|
May 2015 |
On the GPU performance of cell-centered finite volume method over unstructured tetrahedral meshes
|
conference
|
January 2013 |
Weighted Essentially Non-oscillatory Schemes
|
journal
|
November 1994 |
Fully multidimensional flux-corrected transport algorithms for fluids
|
journal
|
June 1979 |
Optimization of a lattice Boltzmann computation on state-of-the-art multicore platforms
|
journal
|
September 2009 |
Solving the compressible navier-stokes equations on up to 1.97 million cores and 4.1 trillion grid points
- Bermejo-Moreno, Iván; Bodart, Julien; Larsson, Johan
-
SC13: International Conference for High Performance Computing, Networking, Storage and Analysis, Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
https://doi.org/10.1145/2503210.2503265
|
conference
|
November 2013 |
A 30 Year Retrospective on Dennard's MOSFET Scaling Paper
|
journal
|
January 2007 |
A high-order finite-volume method for conservation laws on locally refined grids
|
journal
|
January 2011 |
The Piecewise Parabolic Method (PPM) for gas-dynamical simulations
|
journal
|
April 1984 |
Strong Stability-Preserving High-Order Time Discretization Methods
|
journal
|
January 2001 |
Hierarchical N-body Simulations with Autotuning for Heterogeneous Systems
|
journal
|
May 2012 |
A performance analysis framework for identifying potential benefits in GPGPU applications
|
journal
|
September 2012 |
Performance modeling of serial and parallel implementations of the fractional Adams-Bashforth-Moulton method
|
journal
|
June 2014 |
Managing application complexity in the SAMRAI object-oriented framework
|
journal
|
January 2002 |
Efficient Implementation of Weighted ENO Schemes
|
journal
|
June 1996 |
High-order, finite-volume methods in mapped coordinates
|
journal
|
April 2011 |
High throughput software for direct numerical simulations of compressible two-phase flows
- Hejazialhosseini, Babak; Rossinelli, Diego; Conti, Christian
-
2012 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis
https://doi.org/10.1109/SC.2012.66
|
conference
|
November 2012 |
Roofline: an insightful visual performance model for multicore architectures
|
journal
|
April 2009 |
An efficient mixed-precision, hybrid CPU–GPU implementation of a nonlinearly implicit one-dimensional particle-in-cell algorithm
|
journal
|
June 2012 |
Flux-corrected transport. I. SHASTA, a fluid transport algorithm that works
|
journal
|
January 1973 |