Hardware reuse in modern application-specific processors and accelerators

Effective exploitation of the application-specific parallel patterns and computation operations through their direct implementation in hardware is the base for construction of high-quality application-specific (re-) configurable application specific instruction set processors (ASIPs) and hardware accelerators for modern highly-demanding applications. Although it receives a lot of attention from the researchers and practitioners, a very important problem of hardware reuse in ASIP and accelerator synthesis is clearly underestimated and does not get enough attention in the published research. This paper is an effect of an industry and academic collaborative research. It analyses the problem of hardware sharing, shows its high practical relevance, as well as a big influence of hardware sharing on the major circuit and system parameters, and its importance for the multi-objective optimization and tradeoff exploitation. It also demonstrates that the state-of-the-art synthesis tools do not sufficiently address this problem and gives several guidelines related to enhancement of the hardware reuse.

[1]  Paolo Bonzini,et al.  Recurrence-Aware Instruction Set Selection for Extensible Embedded Processors , 2008, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[2]  Jim Hurley,et al.  Ray Tracing Goes Mainstream , 2005 .

[3]  Behrooz Parhami,et al.  Computer arithmetic - algorithms and hardware designs , 1999 .

[4]  Nigel P. Topham,et al.  Exploring the unified design-space of custom-instruction selection and resource sharing , 2010, 2010 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation.

[5]  Hai Lin,et al.  Resource sharing of pipelined custom hardware extension for energy-efficient application-specific instruction set processor design , 2009, ICCD.

[6]  Román Hermida,et al.  A global approach to improve conditional hardware reuse in high-level synthesis , 2002, J. Syst. Archit..

[7]  Nigel P. Topham,et al.  Design-Space Exploration of Resource-Sharing Solutions for Custom Instruction Set Extensions , 2009, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[8]  L. Jozwiak,et al.  Static Power Reduction in Nano CMOS Circuits Through an Adequate Circuit Synthesis , 2007, 2007 14th International Conference on Mixed Design of Integrated Circuits and Systems.

[9]  Pong P. Chu RTL Hardware Design Using VHDL: Coding for Efficiency, Portability, and Scalability , 2006 .

[10]  Nadia Nedjah,et al.  Modern Architectures for Embedded Reconfigurable Systems - a Survey , 2009, J. Circuits Syst. Comput..

[11]  Nadia Nedjah,et al.  Modern development methods and tools for embedded reconfigurable systems: A survey , 2010, Integr..

[12]  Scott A. Mahlke,et al.  Automated custom instruction generation for domain-specific processor acceleration , 2005, IEEE Transactions on Computers.

[13]  Giovanni De Micheli,et al.  Automatic instruction set extension and utilization for embedded processors , 2003, Proceedings IEEE International Conference on Application-Specific Systems, Architectures, and Processors. ASAP 2003.

[14]  Paolo Faraboschi,et al.  Embedded Computing: A VLIW Approach to Architecture, Compilers and Tools , 2004 .

[15]  Craig Ulmer,et al.  Floating-Point Unit Reuse in an FPGA Implementation of a Ray-Triangle Intersection Algorithm , 2006, ERSA.

[16]  Israel Koren Computer arithmetic algorithms , 1993 .

[17]  Steve Kilts Control System Components , 2008 .

[18]  Lech Józwiak,et al.  Quality-driven methodology for demanding accelerator design , 2010, 2010 11th International Symposium on Quality Electronic Design (ISQED).

[19]  Paolo Ienne,et al.  Introducing control-flow inclusion to support pipelining in custom instruction set extensions , 2009, 2009 IEEE 7th Symposium on Application Specific Processors.

[20]  Cid C. de Souza,et al.  Efficient datapath merging for partially reconfigurable architectures , 2005, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[21]  Nigel P. Topham,et al.  Resource Sharing in Custom Instruction Set Extensions , 2008, 2008 Symposium on Application Specific Processors.