Optimized Stencil Computation Using In-Place Calculation on Modern Multicore Systems
暂无分享,去创建一个
Werner Augustin | Jan-Philipp Weiss | Vincent Heuveline | V. Heuveline | Jan-Philipp Weiss | W. Augustin
[1] Ulrich Rüde,et al. Optimization and Profiling of the Cache Performance of Parallel Lattice Boltzmann Codes in 2 D and 3 D ∗ , 2003 .
[2] Katherine Yelick,et al. OSKI: A library of automatically tuned sparse matrix kernels , 2005 .
[3] Yuefan Deng,et al. New trends in high performance computing , 2001, Parallel Computing.
[4] Samuel Williams,et al. Implicit and explicit optimizations for stencil computations , 2006, MSPC '06.
[5] Samuel Williams,et al. Optimization and Performance Modeling of Stencil Computations on Modern Microprocessors , 2007, SIAM Rev..
[6] David G. Wonnacott,et al. Time Skewing for Parallel Computers , 1999, LCPC.
[7] Samuel Williams,et al. The Landscape of Parallel Computing Research: A View from Berkeley , 2006 .
[8] Samuel Williams,et al. Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.