Energy efficient tiling on a Many-Core Architecture
暂无分享,去创建一个
[1] K. Yee. Numerical solution of initial boundary value problems involving maxwell's equations in isotropic media , 1966 .
[2] Allan Porterfield,et al. Data cache performance of supercomputer applications , 1990, Proceedings SUPERCOMPUTING '90.
[3] F. Frances Yao,et al. A scheduling model for reduced CPU energy , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.
[4] Hiroshi Nakamura,et al. SCIMA: a novel processor architecture for high performance computing , 2000, Proceedings Fourth International Conference/Exhibition on High Performance Computing in the Asia-Pacific Region.
[5] Vikas Agarwal,et al. Static energy reduction techniques for microprocessor caches , 2001, Proceedings 2001 IEEE International Conference on Computer Design: VLSI in Computers and Processors. ICCD 2001.
[6] Sang Lyul Min,et al. An Accurate Instruction-Level Energy Consumption Model for Embedded RISC Processors , 2001 .
[7] Guang R. Gao,et al. Optimization of Dense Matrix Multiplication on IBM Cyclops-64: Challenges and Experiences , 2006, Euro-Par.
[8] Uday Bondhugula,et al. Effective automatic parallelization of stencil computations , 2007, PLDI '07.
[9] Guang R. Gao,et al. Optimizing the Fast Fourier Transform on a Multi-core Architecture , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.
[10] Petru Eles,et al. Energy Optimization of Multiprocessor Systems on Chip by Voltage Selection , 2007, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.
[11] Guang R. Gao,et al. Mapping the LU decomposition on a many-core architecture: challenges and solutions , 2009, CF '09.
[12] Josep Torrellas. Architectures for Extreme-Scale Computing , 2009, Computer.
[13] Guang R. Gao,et al. Locality Optimization of Stencil Applications Using Data Dependency Graphs , 2010, LCPC.
[14] Guang R. Gao,et al. Optimized Dense Matrix Multiplication on a Many-Core Architecture , 2010, Euro-Par.
[15] Guang R. Gao,et al. Computer Architecture and Parallel Systems Laboratory Dynamic Percolation-Mapping Dense Matrix Multiplication on a Many-Core Architecture , 2010 .