A Compiler for Automatic Selection of Suitable Processing-in-Memory Instructions
暂无分享,去创建一个
Luigi Carro | Marco A. Z. Alves | Hameeza Ahmed | Paulo C. Santos | João P. C. Lima | Rafael F. Moura | Antônio C. S. Beck | Rafael Fão de Moura | L. Carro | M. Alves | A. C. S. Beck | P. C. Santos | Hameeza Ahmed | J. P. C. Lima
[1] Luigi Carro,et al. Operand size reconfiguration for big data processing in memory , 2017, Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017.
[2] Kiyoung Choi,et al. A scalable processing-in-memory accelerator for parallel graph processing , 2015, 2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA).
[3] Ramyad Hadidi,et al. CAIRO , 2017, ACM Trans. Archit. Code Optim..
[4] Krishna M. Kavi,et al. Exploring the Processing-in-Memory design space , 2017, J. Syst. Archit..
[5] Babak Falsafi,et al. The mondrian data engine , 2017, 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA).
[6] Luigi Carro,et al. Saving memory movements through vector processing in the DRAM , 2015, 2015 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES).
[7] Barbara M. Chapman,et al. Towards Automatic HBM Allocation Using LLVM: A Case Study with Knights Landing , 2016, 2016 Third Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC).
[8] Ravi Nair,et al. Evolution of Memory Architecture , 2015, Proceedings of the IEEE.
[9] Luigi Carro,et al. Design space exploration for PIM architectures in 3D-stacked memories , 2018, CF.
[10] J. Thomas Pawlowski,et al. Hybrid memory cube (HMC) , 2011, 2011 IEEE Hot Chips 23 Symposium (HCS).
[11] Timothy M. Jones,et al. Throttling Automatic Vectorization: When Less is More , 2015, 2015 International Conference on Parallel Architecture and Compilation (PACT).
[12] Franz Franchetti,et al. Data reorganization in memory using 3D-stacked DRAM , 2015, 2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA).
[13] Gabriel H. Loh,et al. Thermal Feasibility of Die-Stacked Processing in Memory , 2014 .
[14] Luca Benini,et al. Design and Evaluation of a Processing-in-Memory Architecture for the Smart Memory Cube , 2016, ARCS.
[15] Tejas Karkhanis,et al. Active Memory Cube: A processing-in-memory architecture for exascale systems , 2015, IBM J. Res. Dev..
[16] Somayeh Sardashti,et al. The gem5 simulator , 2011, CARN.
[17] Onur Mutlu,et al. Transparent Offloading and Mapping (TOM): Enabling Programmer-Transparent Near-Data Processing in GPU Systems , 2016, 2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA).