Blocking and array contraction across arbitrarily nested loops using affine partitioning
暂无分享,去创建一个
[1] William Pugh,et al. Iteration Space Slicing for Locality , 1999, LCPC.
[2] TimePaul FeautrierLaboratoire Masi. Some Eecient Solutions to the Aane Scheduling Problem Part I One-dimensional Time , 1993 .
[3] Monica S. Lam,et al. An affine partitioning algorithm to maximize parallelism and minimize communication , 1999, ICS '99.
[4] David K. Smith. Theory of Linear and Integer Programming , 1987 .
[5] Vivek Sarkar,et al. Optimization of array accesses by collective loop transformations , 1991, ICS '91.
[6] A. Jameson. Solution of the Euler equations for two dimensional transonic flow by a multigrid method , 1983 .
[7] Monica S. Lam,et al. Maximizing parallelism and minimizing synchronization with affine transforms , 1997, POPL '97.
[8] Chau-Wen Tseng,et al. Compiler optimizations for improving data locality , 1994, ASPLOS VI.
[9] Amer Diwan,et al. SUIF Explorer: an interactive and interprocedural parallelizer , 1999, PPoPP '99.
[10] Lawrence Snyder,et al. The implementation and evaluation of fusion and contraction in array languages , 1998, PLDI '98.
[11] David H. Bailey,et al. The NAS kernel benchmark program , 1985 .
[12] Monica S. Lam,et al. Maximizing Parallelism and Minimizing Synchronization with Affine Partitions , 1998, Parallel Comput..
[13] Monica S. Lam,et al. A data locality optimizing algorithm , 1991, PLDI '91.
[14] Ken Kennedy,et al. Optimizing for parallelism and data locality , 1992 .
[15] Michael E. Wolf,et al. Improving locality and parallelism in nested loops , 1992 .
[16] Keshav Pingali,et al. Tiling Imperfectly-nested Loop Nests , 2000, ACM/IEEE SC 2000 Conference (SC'00).
[17] Ken Kennedy,et al. Maximizing Loop Parallelism and Improving Data Locality via Loop Fusion and Distribution , 1993, LCPC.
[18] Michael Wolfe,et al. High performance compilers for parallel computing , 1995 .
[19] V. Sarkar,et al. Collective Loop Fusion for Array Contraction , 1992, LCPC.
[20] Keshav Pingali,et al. Data-centric multi-level blocking , 1997, PLDI '97.
[21] David F. Bacon,et al. Compiler transformations for high-performance computing , 1994, CSUR.