Performance Analysis of the Lattice Boltzmann Method on x 86-64 Architectures
暂无分享,去创建一个
The Lattice Boltzmann method (LBM) is a well established algorithm to simulate fluid flow. The complexity of todays 3D simulation problems resulting in long computation times together with the fact that a standard implementation of the LBM only achieves a small fraction of the potential of a modern CPU is the motivation for this performance analysis. We show in our paper, that it is crucial to combine new CPU architectural features as software prefetching and SIMD instruction set extensions, with the established cache blocking techniques to utilize the computational power of modern CPUs.
[1] Markus Kowarschik,et al. Data locality optimizations for iterative numerical algorithms and cellular automata on hierarchical memory architectures , 2004, Advances in simulation.
[2] Stefan Donath. On Optimized Implementations of the Lattice Boltzmann Method on Contemporary Architectures , 2004 .
[3] Joachim Wilke,et al. Cache Optimizations for the Lattice Boltzmann Method in 2D , 2003 .
[4] D. Wolf-Gladrow. Lattice-Gas Cellular Automata and Lattice Boltzmann Models: An Introduction , 2000 .