Continual flow pipelines: achieving resource-efficient latency tolerance
暂无分享,去创建一个
Haitham Akkary | Ravi Rajwar | Amit Gandhi | Srikanth T. Srinivasan | Michael Upton | M. Upton | Ravi Rajwar | Haitham Akkary | A. Gandhi
[1] Trevor N. Mudge,et al. Author retrospective improving data cache performance by pre-executing instructions under a cache miss , 1997, International Conference on Supercomputing.
[2] Haitham Akkary,et al. Continual flow pipelines , 2004, ASPLOS XI.
[3] Josep Llosa,et al. Out-of-order commit processors , 2004, 10th International Symposium on High Performance Computer Architecture (HPCA'04).
[4] Josep Llosa,et al. Large virtual robs by processor checkpointing , 2002 .
[5] Tejas Karkhanis,et al. A Day in the Life of a Data Cache Miss , 2002 .
[6] Eric Rotenberg,et al. A large, fast instruction window for tolerating cache misses , 2002, ISCA.
[7] Onur Mutlu,et al. Runahead execution: an alternative to very large instruction windows for out-of-order processors , 2003, The Ninth International Symposium on High-Performance Computer Architecture, 2003. HPCA-9 2003. Proceedings..
[8] Haitham Akkary,et al. Checkpoint Processing and Recovery: Towards Scalable Large Instruction Window Processors , 2003, MICRO.
[9] Haitham Akkary,et al. Checkpoint Processing and Recovery: An Efficient, Scalable Alternative to Reorder Buffers , 2003, IEEE Micro.
[10] Alvin M. Despain,et al. The 16-fold way: a microparallel taxonomy , 1993, MICRO 1993.
[11] David J. Sager,et al. The microarchitecture of the Pentium 4 processor , 2001 .
[12] Mateo Valero,et al. Dynamic Register Renaming Through Virtual-Physical Registers , 2000, J. Instr. Level Parallelism.