Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures
暂无分享,去创建一个
Samuel Williams | Leonid Oliker | John Shalf | Katherine Yelick | David A. Patterson | Kaushik Datta | Mark Murphy | Vasily Volkov | Jonathan Carter | J. Shalf | Samuel Williams | K. Yelick | J. Carter | V. Volkov | David A. Patterson | M. Murphy | L. Oliker | K. Datta
[1] Samuel Williams,et al. Implicit and explicit optimizations for stencil computations , 2006, MSPC '06.
[2] Michael Gschwind. Chip multiprocessing and the cell broadband engine , 2006, CF '06.
[3] Siddhartha Chatterjee,et al. Cache-Efficient Multigrid Algorithms , 2004, Int. J. High Perform. Comput. Appl..
[4] M. Berger,et al. Adaptive mesh refinement for hyperbolic partial differential equations , 1982 .
[5] Katherine Yelick,et al. OSKI: A library of automatically tuned sparse matrix kernels , 2005 .
[6] Yuefan Deng,et al. New trends in high performance computing , 2001, Parallel Computing.
[7] Leonid Oliker,et al. Impact of modern memory subsystems on cache optimizations for stencil computations , 2005, MSP '05.
[8] Monica S. Lam,et al. Blocking and array contraction across arbitrarily nested loops using affine partitioning , 2001, PPoPP '01.
[9] Samuel Williams,et al. The Landscape of Parallel Computing Research: A View from Berkeley , 2006 .
[10] Samuel Williams,et al. Optimization of sparse matrix-vector multiplication on emerging multicore platforms , 2009, Parallel Comput..
[11] Chau-Wen Tseng,et al. Tiling Optimizations for 3D Scientific Computations , 2000, ACM/IEEE SC 2000 Conference (SC'00).
[12] Samuel Williams,et al. The potential of the cell processor for scientific computing , 2005, CF '06.
[13] Samuel Williams,et al. Lattice Boltzmann simulation optimization on leading multicore platforms , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.