Performance comparison of different parallel lattice Boltzmann implementations on multi-core multi-socket systems
暂无分享,去创建一个
Ulrich Rüde | Gerhard Wellein | Thomas Zeiser | Klaus Iglberger | Stefan Donath | Aditya Nitsure | G. Wellein | U. Rüde | T. Zeiser | A. Nitsure | S. Donath | K. Iglberger
[1] Markus Kowarschik,et al. Data locality optimizations for iterative numerical algorithms and cellular automata on hierarchical memory architectures , 2004, Advances in simulation.
[2] Chau-Wen Tseng,et al. Tiling Optimizations for 3D Scientific Computations , 2000, ACM/IEEE SC 2000 Conference (SC'00).
[3] Gerhard Wellein,et al. Optimizing performance on modern HPC systems: learning from simple kernel benchmarks , 2006 .
[4] Harihar Rajaram,et al. Accuracy and Computational Efficiency in 3D Dispersion via Lattice-Boltzmann: Models for Dispersion in Rough Fractures and Double-Diffusive Fingering , 1998 .
[5] Ulrich Rüde,et al. Optimization and Profiling of the Cache Performance of Parallel Lattice Boltzmann Codes in 2 D and 3 D ∗ , 2003 .
[6] Ernst Rank,et al. Parallelization Strategies and Efficiency of CFD Computations in Complex Geometries Using Lattice Boltzmann Methods on High-Performance Computers , 2002 .
[7] Gerhard Wellein,et al. On the single processor performance of simple lattice Boltzmann kernels , 2006 .
[8] Jacques Periaux,et al. Parallel Computational Fluid Dynamics 2005: Theory and Applications , 2006 .
[9] Volker Strumpen,et al. Cache oblivious stencil computations , 2005, ICS '05.
[10] Gerhard Wellein,et al. Towards Optimal Performance for Lattice Boltzmann Applications on Terascale Computers , 2006 .
[11] G. Wellein,et al. Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method , 2008 .
[12] Charles E. Leiserson,et al. Cache-Oblivious Algorithms , 2003, CIAC.
[13] J. Boon. The Lattice Boltzmann Equation for Fluid Dynamics and Beyond , 2003 .