Triggered instructions: a control paradigm for spatially-programmed architectures
暂无分享,去创建一个
Antonia Zhai | Aamer Jaleel | Daniel Lustig | Michael Adler | Joel S. Emer | Neal Clayton Crago | Vladimir Pavlov | Michael Pellauer | Angshuman Parashar | Bushra Ahsan | Randy L. Allmon | Mohit Gambhir | Stephen Maresh | Rachid Rayess | A. Parashar | J. Emer | A. Jaleel | R. Allmon | Daniel Lustig | Michael Pellauer | Antonia Zhai | Michael Adler | Bushra Ahsan | N. Crago | S. Maresh | Vladimir Pavlov | M. Gambhir | Rachid Rayess
[1] H. T. Kung. The CMU warp processor , 1990 .
[2] Lizy Kurian John,et al. Scaling to the end of silicon with EDGE architectures , 2004, Computer.
[3] Arvind,et al. Executing a Program on the MIT Tagged-Token Dataflow Architecture , 1990, IEEE Trans. Computers.
[4] Henry Hoffmann,et al. The Raw Microprocessor: A Computational Fabric for Software Circuits and General-Purpose Programs , 2002, IEEE Micro.
[5] John Wawrzynek,et al. Garp: a MIPS processor with a reconfigurable coprocessor , 1997, Proceedings. The 5th Annual IEEE Symposium on Field-Programmable Custom Computing Machines Cat. No.97TB100186).
[6] A. Tsai,et al. PipeRench: A virtualized programmable datapath in 0.18 micron technology , 2002, Proceedings of the IEEE 2002 Custom Integrated Circuits Conference (Cat. No.02CH37285).
[7] André DeHon,et al. MATRIX: a reconfigurable computing architecture with configurable instruction distribution and deployable resources , 1996, 1996 Proceedings IEEE Symposium on FPGAs for Custom Computing Machines.
[8] Jack B. Dennis,et al. A preliminary architecture for a basic data-flow processor , 1974, ISCA '75.
[9] Douglas W. Clark,et al. A Characterization of Processor Performance in the vax-11/780 , 1984, ISCA '84.
[10] Scott Hauck,et al. Reconfigurable computing: a survey of systems and software , 2002, CSUR.
[11] Vaughn Betz,et al. Speed and area tradeoffs in cluster-based FPGA architectures , 2000, IEEE Trans. Very Large Scale Integr. Syst..
[12] Steven Swanson,et al. The WaveScalar architecture , 2007, TOCS.
[13] T. Mohsenin,et al. An asynchronous array of simple processors for dsp applications , 2006, 2006 IEEE International Solid State Circuits Conference - Digest of Technical Papers.
[14] Andrew S. Grimshaw,et al. Revisiting sorting for GPGPU stream architectures , 2010, 2010 19th International Conference on Parallel Architectures and Compilation Techniques (PACT).
[15] Alan Gray,et al. Deterministic Parallel Processing , 2006, International Journal of Parallel Programming.
[16] Henk Corporaal,et al. Transport-Triggering versus Operation-Triggering , 1994, CC.
[17] K. Mani Chandy,et al. Parallel program design - a foundation , 1988 .
[18] K. Mani Chandy. Parallel program design , 1989 .
[19] Karthikeyan Sankaralingam,et al. Dynamically Specialized Datapaths for energy efficient computing , 2011, 2011 IEEE 17th International Symposium on High Performance Computer Architecture.
[20] Edsger W. Dijkstra,et al. Guarded commands, nondeterminacy and formal derivation of programs , 1975, Commun. ACM.
[21] Samuel Williams,et al. The Landscape of Parallel Computing Research: A View from Berkeley , 2006 .
[22] Rudy Lauwereins,et al. ADRES: An Architecture with Tightly Coupled VLIW Processor and Coarse-Grained Reconfigurable Matrix , 2003, FPL.
[23] Robert A. van de Geijn,et al. SUMMA: Scalable Universal Matrix Multiplication Algorithm , 1995 .
[24] Zhiyi Yu,et al. A 167-Processor Computational Platform in 65 nm CMOS , 2009, IEEE Journal of Solid-State Circuits.
[25] Donald E. Knuth,et al. Fast Pattern Matching in Strings , 1977, SIAM J. Comput..
[26] Robert A. van de Geijn,et al. SUMMA: scalable universal matrix multiplication algorithm , 1995, Concurr. Pract. Exp..
[27] Andreas Moshovos,et al. CHIMAERA: a high-performance architecture with a tightly-coupled reconfigurable functional unit , 2000, ISCA '00.