Large vector extensions inside the HMC
暂无分享,去创建一个
Luigi Carro | Matthias Diener | Marco Antonio Zanata Alves | Paulo C. Santos | L. Carro | M. Diener | M. Alves | P. C. Santos
[1] Steven Swanson,et al. Near-Data Processing: Insights from a MICRO-46 Workshop , 2014, IEEE Micro.
[2] Khaled Salah,et al. Memory controller architectures: A comparative study , 2013, 2013 8th IEEE Design and Test Symposium.
[3] Christoforos E. Kozyrakis,et al. A case for intelligent RAM , 1997, IEEE Micro.
[4] David A. Padua,et al. An Evaluation of Vectorizing Compilers , 2011, 2011 International Conference on Parallel Architectures and Compilation Techniques.
[5] Kiyoung Choi,et al. A scalable processing-in-memory accelerator for parallel graph processing , 2015, 2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA).
[6] Luigi Carro,et al. Saving memory movements through vector processing in the DRAM , 2015, 2015 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES).
[7] J. Jeddeloh,et al. Hybrid memory cube new DRAM architecture increases density and performance , 2012, 2012 Symposium on VLSI Technology (VLSIT).
[8] Jung Ho Ahn,et al. The McPAT Framework for Multicore and Manycore Architectures: Simultaneously Modeling Power, Area, and Timing , 2013, TACO.
[9] Bruce Jacob,et al. Memory Systems: Cache, DRAM, Disk , 2007 .
[10] Fong Pong,et al. Missing the Memory Wall: The Case for Processor/Memory Integration , 1996, 23rd Annual International Symposium on Computer Architecture (ISCA'96).
[11] Neal Cardwell,et al. Evaluation of Existing Architectures in IRAM Systems , 1998 .
[12] Tejas Karkhanis,et al. Active Memory Cube: A processing-in-memory architecture for exascale systems , 2015, IBM J. Res. Dev..
[13] Franz Franchetti,et al. A 3D-stacked logic-in-memory accelerator for application-specific data intensive computing , 2013, 2013 IEEE International 3D Systems Integration Conference (3DIC).
[14] Duncan G. Elliott,et al. Computational RAM: Implementing Processors in Memory , 1999, IEEE Des. Test Comput..
[15] Philippe Olivier Alexandre Navaux,et al. SiNUCA: A Validated Micro-Architecture Simulator , 2015, 2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems.
[16] Feifei Li,et al. NDC: Analyzing the impact of 3D-stacked memory+logic devices on MapReduce workloads , 2014, 2014 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS).
[17] A. Jourdain,et al. 3D stacked IC demonstration using a through Silicon Via First approach , 2008, 2008 IEEE International Electron Devices Meeting.
[18] Jung Ho Ahn,et al. DRAMA: An Architecture for Accelerated Processing Near Memory , 2015, IEEE Computer Architecture Letters.
[19] Kiyoung Choi,et al. PIM-enabled instructions: A low-overhead, locality-aware processing-in-memory architecture , 2015, 2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA).
[20] Luigi Carro,et al. Opportunities and Challenges of Performing Vector Operations inside the DRAM , 2015, MEMSYS.
[21] Franz Franchetti,et al. Accelerating sparse matrix-matrix multiplication with 3D-stacked logic-in-memory hardware , 2013, 2013 IEEE High Performance Extreme Computing Conference (HPEC).
[22] Paul Rosenfeld,et al. Performance Exploration of the Hybrid Memory Cube , 2014 .
[23] Josep Torrellas,et al. A Near-Memory Processor for Vector, Streaming and Bit Manipulation Workloads , 2005 .