Performance issues on many-core processors: A D2Q37 Lattice Boltzmann scheme as a test-case
暂无分享,去创建一个
Raffaele Tripiccione | Sebastiano Fabio Schifano | Filippo Mantovani | M. Pivanti | F. Mantovani | R. Tripiccione | M. Pivanti | S. Schifano
[1] Alexander Heinecke,et al. An efficient vectorization of linked-cell particle simulations , 2012, CF '12.
[2] L. Biferale,et al. Lattice Boltzmann method with self-consistent thermo-hydrodynamic equilibria , 2009, Journal of Fluid Mechanics.
[3] Federico Toschi,et al. A Multi-GPU Implementation of a D2Q37 Lattice Boltzmann Code , 2011, PPAM.
[4] S. F. Schifano,et al. Implementation and optimization of a thermal Lattice Boltzmann algorithm on a multi-GPU cluster , 2012, 2012 Innovative Parallel Computing (InPar).
[5] Federico Toschi,et al. Optimization of Multi-Phase Compressible Lattice Boltzmann Codes on Massively Parallel Multi-Core Systems , 2011, ICCS.
[6] Geppino Pucci,et al. The Potential of On-Chip Multiprocessing for QCD Machines , 2005, HiPC.
[7] J. Boon. The Lattice Boltzmann Equation for Fluid Dynamics and Beyond , 2003 .
[8] Raffaele Tripiccione,et al. An optimized D2Q37 Lattice Boltzmann code on GP-GPUs , 2013 .
[9] Ulrich Rüde,et al. Optimization and Profiling of the Cache Performance of Parallel Lattice Boltzmann Codes in 2 D and 3 D ∗ , 2003 .
[10] Federico Toschi,et al. Lattice Boltzmann methods for thermal flows: Continuum limit and applications to compressible Rayleigh-Taylor systems , 2010, 1005.3639.
[11] Federico Toschi,et al. Lattice Boltzmann method simulations on massively parallel multi-core architectures , 2011, SpringSim.
[12] Sebastian Szkoda,et al. Accelerating cellular automata simulations using AVX and CUDA , 2012, ArXiv.
[13] Gerhard Wellein,et al. On the single processor performance of simple lattice Boltzmann kernels , 2006 .