Serialization-Aware Mini-Graphs: Performance with Fewer Resources
暂无分享,去创建一个
[1] Rastislav Bodík,et al. Focusing processor policies via critical-path prediction , 2001, Proceedings 28th Annual International Symposium on Computer Architecture.
[2] Kurt Keutzer,et al. A text-compression-based method for code size minimization in embedded systems , 1999, TODE.
[3] Mikko H. Lipasti,et al. An approach for implementing efficient superscalar CISC processors , 2006, The Twelfth International Symposium on High-Performance Computer Architecture, 2006..
[4] Gabriel H. Loh,et al. Static strands: Safely exposing dependence chains for increasing embedded power efficiency , 2007, TECS.
[5] Scott A. Mahlke,et al. Application-Specific Processing on a General-Purpose Core via Transparent Instruction Set Customization , 2004, 37th International Symposium on Microarchitecture (MICRO-37'04).
[6] Rastislav Bodík,et al. Slack: maximizing performance under technological constraints , 2002, ISCA.
[7] Sanjay J. Patel,et al. Characterization of Repeating Dynamic Code Fragments , 2002 .
[8] John Wawrzynek,et al. Garp: a MIPS processor with a reconfigurable coprocessor , 1997, Proceedings. The 5th Annual IEEE Symposium on Field-Programmable Custom Computing Machines Cat. No.97TB100186).
[9] Tilman Wolf,et al. CommBench-a telecommunications benchmark for network processors , 2000, 2000 IEEE International Symposium on Performance Analysis of Systems and Software. ISPASS (Cat. No.00EX422).
[10] Miodrag Potkonjak,et al. MediaBench: a tool for evaluating and synthesizing multimedia and communications systems , 1997, Proceedings of 30th Annual International Symposium on Microarchitecture.
[11] James E. Smith,et al. Using dynamic binary translation to fuse dependent instructions , 2004, International Symposium on Code Generation and Optimization, 2004. CGO 2004..
[12] Gabriel H. Loh,et al. Static strands: safely collapsing dependence chains for increasing embedded power efficiency , 2005, LCTES '05.
[13] Mikko H. Lipasti,et al. Macro-op Scheduling: Relaxing Scheduling Loop Constraints , 2003, MICRO.
[14] Scott Mahlke,et al. Processor acceleration through automated instruction set customization , 2003, Proceedings. 36th Annual IEEE/ACM International Symposium on Microarchitecture, 2003. MICRO-36..
[15] Olivier Temam,et al. From sequences of dependent instructions to functions: an approach for improving performance without ILP or speculation , 2004, Proceedings. 31st Annual International Symposium on Computer Architecture, 2004..
[16] Ho-Seop Kim,et al. An instruction set and microarchitecture for instruction level distributed processing , 2002, Proceedings 29th Annual International Symposium on Computer Architecture.
[17] Andreas Moshovos,et al. CHIMAERA: a high-performance architecture with a tightly-coupled reconfigurable functional unit , 2000, Proceedings of 27th International Symposium on Computer Architecture (IEEE Cat. No.RS00201).
[18] Gurindar S. Sohi,et al. Characterizing and predicting value degree of use , 2002, 35th Annual IEEE/ACM International Symposium on Microarchitecture, 2002. (MICRO-35). Proceedings..
[19] Peter G. Sassone,et al. Dynamic Strands: Collapsing Speculative Dependence Chains for Reducing Pipeline Communication , 2004, 37th International Symposium on Microarchitecture (MICRO-37'04).
[20] Trevor Mudge,et al. MiBench: A free, commercially representative embedded benchmark suite , 2001 .
[21] Amir Roth,et al. Dataflow Mini-Graphs: Amplifying Superscalar Capacity and Bandwidth , 2004, 37th International Symposium on Microarchitecture (MICRO-37'04).