Tile Selection Algorithms and their Performance Models
暂无分享,去创建一个
[1] Monica S. Lam,et al. The cache performance and optimizations of blocked algorithms , 1991, ASPLOS IV.
[2] Chau-Wen Tseng,et al. Improving data locality with loop transformations , 1996, TOPL.
[3] Philippe Clauss. Counting Solutions to Linear and Nonlinear Constraints Through Ehrhart Polynomials: Applications to Analyze and Transform Scientific Programs , 1996, International Conference on Supercomputing.
[4] David A. Patterson,et al. Computer Architecture: A Quantitative Approach , 1969 .
[5] R. C. Whaley,et al. Automatically Tuned Linear Algebra Software (ATLAS) , 2011, Encyclopedia of Parallel Computing.
[6] Olivier Temam,et al. A quantitative analysis of loop nest locality , 1996, ASPLOS VII.
[7] Chau-Wen Tseng,et al. Data transformations for eliminating conflict misses , 1998, PLDI.
[8] Chau-Wen Tseng,et al. Eliminating conflict misses for high performance architectures , 1998, ICS '98.
[9] Keshav Pingali,et al. An experimental evaluation of tiling and shackling for memory hierarchy management , 1999, ICS '99.
[10] Jacqueline Chame,et al. A tile selection algorithm for data locality and cache interference , 1999, ICS '99.
[11] Norman P. Jouppi,et al. Improving direct-mapped cache performance by the addition of a small fully-associative cache and pre , 1990, ISCA 1990.
[12] Karim Esseghir. Improving data locality for caches , 1993 .
[13] Keshav Pingali,et al. Data-centric multi-level blocking , 1997, PLDI '97.
[14] Graham R. Nudd,et al. Predicting the Cache Miss Ratio of Loop-Nested Array References , 1997 .
[15] Olivier Temam,et al. Cache interference phenomena , 1994, SIGMETRICS.
[16] Vivek Sarkar,et al. On Estimating and Enhancing Cache Effectiveness , 1991, LCPC.
[17] Sharad Malik,et al. Cache miss equations: an analytical representation of cache misses , 1997, ICS '97.
[18] Sharad Malik,et al. Precise miss analysis for program transformations with caches of arbitrary associativity , 1998, ASPLOS VIII.
[19] Kathryn S. McKinley,et al. Tile size selection using cache organization and data layout , 1995, PLDI '95.
[20] William Pugh,et al. Counting solutions to Presburger formulas: how and why , 1994, PLDI '94.
[21] W. Jalby,et al. To copy or not to copy: a compile-time technique for assessing when data copying should be used to eliminate cache conflicts , 1993, Supercomputing '93.
[22] Chau-Wen Tseng,et al. A Comparison of Compiler Tiling Algorithms , 1999, CC.
[23] James Demmel,et al. Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology , 1997, ICS '97.
[24] Norman P. Jouppi,et al. Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers , 1990, [1990] Proceedings. The 17th Annual International Symposium on Computer Architecture.
[25] Alfons G. Hoekstra,et al. Efficient analytical modelling of multi-level set-associative caches , 1999 .
[26] Graham R. Nudd,et al. Analytical Modeling of Set-Associative Cache Behavior , 1999, IEEE Trans. Computers.
[27] François Bodin,et al. Skewed associativity enhances performance predictability , 1995, Proceedings 22nd Annual International Symposium on Computer Architecture.