Compiler-assisted Hybrid Operand Communication
暂无分享,去创建一个
[1] S. Winkel. Optimal versus Heuristic Global Code Scheduling , 2007, 40th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO 2007).
[2] Brad Calder,et al. Basic block distribution analysis to find periodic behavior and simulation points in applications , 2001, Proceedings 2001 International Conference on Parallel Architectures and Compilation Techniques.
[3] Gurindar S. Sohi,et al. Register traffic analysis for streamlining inter-operation communication in fine-grain parallel processors , 1992, MICRO.
[4] Scott A. Mahlke,et al. Effective compiler support for predicated execution using the hyperblock , 1992, MICRO 25.
[5] Ramon Canal,et al. A low-complexity issue logic , 2000, ICS '00.
[6] Kathryn S. McKinley,et al. Strategies for mapping dataflow blocks to distributed hardware , 2008, 2008 41st IEEE/ACM International Symposium on Microarchitecture.
[7] Jack B. Dennis,et al. A preliminary architecture for a basic data-flow processor , 1974, ISCA '98.
[8] Mateo Valero,et al. A new pointer-based instruction queue design and its power-performance evaluation , 2005, 2005 International Conference on Computer Design.
[9] Jaehyuk Huh,et al. Exploiting ILP, TLP, and DLP with the polymorphous TRIPS architecture , 2003, ISCA '03.
[10] Gürhan Küçük,et al. Energy-efficient issue queue design , 2003, IEEE Trans. Very Large Scale Integr. Syst..
[11] Michael C. Huang,et al. Energy-efficient hybrid wakeup logic , 2002, ISLPED '02.
[12] M.A. Ramirez,et al. Direct Instruction Wakeup for Out-of-Order Processors , 2004, Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'04).
[13] Lizy Kurian John,et al. Scaling to the end of silicon with EDGE architectures , 2004, Computer.
[14] Ramon Canal,et al. Reducing the complexity of the issue logic , 2001, ICS '01.
[15] Arvind,et al. Executing a Program on the MIT Tagged-Token Dataflow Architecture , 1990, IEEE Trans. Computers.
[16] Doug Burger,et al. An adaptive, non-uniform cache structure for wire-delay dominated on-chip caches , 2002, ASPLOS X.
[17] Aaron Smith,et al. Compiling for EDGE architectures , 2006, International Symposium on Code Generation and Optimization (CGO'06).
[18] Jeffrey R. Diamond,et al. An evaluation of the TRIPS computer system , 2009, ASPLOS.
[19] Jack B. Dennis,et al. A preliminary architecture for a basic data-flow processor , 1974, ISCA '75.