Reuse Distance as a Metric for Cache Behavior.
暂无分享,去创建一个
[1] B. Ramakrishna Rau,et al. EPIC: Explicititly Parallel Instruction Computing , 2000, Computer.
[2] Keshav Pingali,et al. Data-centric multi-level blocking , 1997, PLDI '97.
[3] Monica S. Lam,et al. The cache performance and optimizations of blocked algorithms , 1991, ASPLOS IV.
[4] Steven W. K. Tjiang,et al. SUIF: an infrastructure for research on parallelizing and optimizing compilers , 1994, SIGP.
[5] Alan Jay Smith,et al. Evaluating Associativity in CPU Caches , 1989, IEEE Trans. Computers.
[6] Hiroshi Nakamura,et al. Augmenting Loop Tiling with Data Alignment for Improved Cache Performance , 1999, IEEE Trans. Computers.
[7] Kathryn S. McKinley,et al. Tile size selection using cache organization and data layout , 1995, PLDI '95.
[8] Peter J. Denning,et al. Operating Systems Theory , 1973 .
[9] Gyungho Lee,et al. Reference distance as a metric for data locality , 1997, Proceedings High Performance Computing on the Information Superhighway. HPC Asia '97.
[10] W. Jalby,et al. To copy or not to copy: a compile-time technique for assessing when data copying should be used to eliminate cache conflicts , 1993, Supercomputing '93.
[11] François Bodin,et al. Skewed-associative Caches , 1993, PARLE.
[12] Olivier Temam,et al. Quantifying loop nest locality using SPEC'95 and the perfect benchmarks , 1999, TOCS.
[13] Chau-Wen Tseng,et al. Data transformations for eliminating conflict misses , 1998, PLDI.
[14] Norman P. Jouppi,et al. Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers , 1990, [1990] Proceedings. The 17th Annual International Symposium on Computer Architecture.
[15] Kristof Beyls,et al. Compiler Generated Multithreading to Alleviate Memory Latency , 2000, J. Univers. Comput. Sci..