A HLS-Based Toolflow to Design Next-Generation Heterogeneous Many-Core Platforms with Shared Memory
暂无分享,去创建一个
[1] Paolo Ienne,et al. Speculative DMA for architecturally visible storage in instruction set extensions , 2008, CODES+ISSS '08.
[2] Luca Benini,et al. Supporting OpenMP on a multi-cluster embedded MPSoC , 2011, Microprocess. Microsystems.
[3] Jason Helge Anderson,et al. From software threads to parallel hardware in high-level synthesis for FPGAs , 2013, 2013 International Conference on Field-Programmable Technology (FPT).
[4] Luca Benini,et al. Tightly-coupled hardware support to dynamic parallelism acceleration in embedded shared memory clusters , 2014, 2014 Design, Automation & Test in Europe Conference & Exhibition (DATE).
[5] Luca Benini,et al. P2012: Building an ecosystem for a scalable, modular and high-efficiency embedded computing accelerator , 2012, 2012 Design, Automation & Test in Europe Conference & Exhibition (DATE).
[6] Luca Benini,et al. Synthesis-friendly techniques for tightly-coupled integration of hardware accelerators into shared-memory multi-core clusters , 2013, 2013 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS).
[7] Paolo Ienne,et al. Way Stealing: Cache-assisted automatic Instruction Set Extensions , 2009, 2009 46th ACM/IEEE Design Automation Conference.
[8] Nikil D. Dutt,et al. Introduction of Architecturally Visible Storage in Instruction Set Extensions , 2007, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.
[9] Luca Benini,et al. VirtualSoC: A Full-System Simulation Environment for Massively Parallel Heterogeneous System-on-Chip , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.
[10] Steven Swanson,et al. Greendroid: Exploring the next evolution in smartphone application processors , 2011, IEEE Communications Magazine.
[11] Eduard Ayguadé,et al. OpenMP extensions for FPGA accelerators , 2009, 2009 International Symposium on Systems, Architectures, Modeling, and Simulation.
[12] Zhen Fang,et al. Buffer-Integrated-Cache: A cost-effective SRAM architecture for handheld and embedded platforms , 2011, 2011 48th ACM/EDAC/IEEE Design Automation Conference (DAC).
[13] Luca Benini,et al. Architecture and programming model support for efficient heterogeneous computing on tigthly-coupled shared-memory clusters , 2013, 2013 Conference on Design and Architectures for Signal and Image Processing.
[14] Weng-Fai Wong,et al. Generating hardware from OpenMP programs , 2006, 2006 IEEE International Conference on Field Programmable Technology.
[15] Steven Swanson,et al. Conservation cores: reducing the energy of mature computations , 2010, ASPLOS XV.
[16] Paul A. Viola,et al. Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.
[17] Luca Benini,et al. A tightly-coupled hardware controller to improve scalability and programmability of shared-memory heterogeneous clusters , 2014, DATE 2014.
[18] Luca Benini,et al. A tightly-coupled multi-core cluster with shared-memory HW accelerators , 2012, 2012 International Conference on Embedded Computer Systems (SAMOS).
[19] Piotr Dziurzanski,et al. A system for transforming an ANSI C code with OpenMP directives into a SystemC description , 2006, 2006 IEEE Design and Diagnostics of Electronic Circuits and systems.
[20] Eric E. Aubanel,et al. An OpenMP-based circuit design tool: Customizable bit-width , 2009, 2009 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing.