Efficient discovery of regular stride patterns in irregular programs and its use in compiler prefetching
暂无分享,去创建一个
[1] Chi-Keung Luk,et al. Tolerating memory latency through software-controlled pre-execution in simultaneous multithreading processors , 2001, Proceedings 28th Annual International Symposium on Computer Architecture.
[2] Brad Calder,et al. Value profiling , 1997, Proceedings of 30th Annual International Symposium on Microarchitecture.
[3] T. Ozawa,et al. Cache miss heuristics and preloading techniques for general-purpose programs , 1995, Proceedings of the 28th Annual International Symposium on Microarchitecture.
[4] Norman P. Jouppi,et al. Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers , 1990, [1990] Proceedings. The 17th Annual International Symposium on Computer Architecture.
[5] Joseph A. Fisher,et al. Predicting conditional branch directions from previous runs of a program , 1992, ASPLOS V.
[6] Rakesh Krishnaiyer,et al. An Advanced Optimizer for the IA-64 Architecture , 2000, IEEE Micro.
[7] Ken Kennedy,et al. Software prefetching , 1991, ASPLOS IV.
[8] Scott A. Mahlke,et al. Using profile information to assist classic code optimizations , 1991, Softw. Pract. Exp..
[9] Scott Mahlke,et al. Effective compiler support for predicated execution using the hyperblock , 1992, MICRO 1992.
[10] Todd C. Mowry,et al. Compiler-based prefetching for recursive data structures , 1996, ASPLOS VII.
[11] Scott A. Mahlke,et al. Effective compiler support for predicated execution using the hyperblock , 1992, MICRO 25.
[12] Craig Zilles,et al. Execution-based prediction using speculative slices , 2001, ISCA 2001.
[13] Per Stenström,et al. Evaluation of Hardware-Based Stride and Sequential Prefetching in Shared-Memory Multiprocessors , 1996, IEEE Trans. Parallel Distributed Syst..
[14] Rakesh Krishnaiyer,et al. Value-Profile Guided Stride Prefetching for Irregular Code , 2002, CC.
[15] Mikko H. Lipasti,et al. Partial resolution in branch target buffers , 1995, Proceedings of the 28th Annual International Symposium on Microarchitecture.
[16] Youfeng Wu,et al. Memory performance analysis of SPEC2000C for the Intel(R) Itanium/sup TM/ processor , 2001 .
[17] James R. Larus,et al. Optimally profiling and tracing programs , 1992, POPL '92.
[18] Brad Calder,et al. Predictor-directed stream buffers , 2000, MICRO 33.
[19] John L. Henning. SPEC CPU2000: Measuring CPU Performance in the New Millennium , 2000, Computer.
[20] Gurindar S. Sohi,et al. Effective jump-pointer prefetching for linked data structures , 1999, ISCA.
[21] Wei-Chung Hsu,et al. Data Prefetching On The HP PA-8000 , 1997, Conference Proceedings. The 24th Annual International Symposium on Computer Architecture.
[22] David C. Sehr,et al. On the importance of points-to analysis and other memory disambiguation methods for C programs , 2001, PLDI '01.
[23] Anoop Gupta,et al. Design and evaluation of a compiler algorithm for prefetching , 1992, ASPLOS V.
[24] Andreas Moshovos,et al. Dependence based prefetching for linked data structures , 1998, ASPLOS VIII.
[25] Christopher Hughes,et al. Speculative precomputation: long-range prefetching of delinquent loads , 2001, ISCA 2001.
[26] Guang R. Gao,et al. Speculative Prefetching of Induction Pointers , 2001, CC.
[27] Richard E. Kessler,et al. Evaluating stream buffers as a secondary cache replacement , 1994, Proceedings of 21 International Symposium on Computer Architecture.
[28] Saumya K. Debray,et al. Code Specialization Based on Value Profiles , 2000, SAS.