SimRPU: A Simulation Environment for Reconfigurable Architecture Exploration

To assist the system architects with fast exploration and performance evaluation of the reconfigurable software/hardware architectures, this paper presents a system-level simulator, named after SimRPU, for the reconfigurable processing unit (RPU), which is the major computing engine in reconfigurable processor. The proposed simulator consists of a simulation kernel, a software compiler, a system profiler providing performance, area and power information for the desired architectures, and a system debugger supporting inspecting and modification of the internal state of the RPU. Object-oriented hierarchical and parameterized architecture modeling techniques are proposed to satisfy the requirements for a fast and comprehensive evaluation. Cycle-accurate simulation mechanisms are developed to improve the accuracy of the profiled performance data. Compared with the traditional register transfer level (RTL) based simulation scheme, the proposed simulator could achieve an average speedup of 18.5× with only 3.5% reduction on performance estimation accuracy. One reconfigurable processor targeted at high-definition multimedia decoding applications (such as H.264, MPEG2, AVS, etc.) is implemented with Taiwan Semiconductor Manufacturing Company 65-nm process using the proposed exploration and design flow. The measured results show that the implemented architecture has obvious advantages in terms of both performance and power consumption than the reference designs in multimedia decoding applications.

[1]  Roberto Guerrieri,et al.  A Heterogeneous Digital Signal Processor for Dynamically Reconfigurable Computing , 2010, IEEE Journal of Solid-State Circuits.

[2]  Leibo Liu,et al.  Compiler Framework for Reconfigurable Computing Architecture , 2009, IEICE Trans. Electron..

[3]  F. Tobajas,et al.  Survey of reconfigurable architectures for multimedia applications , 2009, Microtechnologies.

[4]  Yutaka Arakawa,et al.  A novel traffic engineering method using on-chip diorama network on dynamically reconfigurable processor DAPDNA-2 , 2009, 2009 International Conference on High Performance Switching and Routing.

[5]  Bjorn De Sutter,et al.  Implementation of a Coarse-Grained Reconfigurable Media Processor for AVC Decoder , 2008, J. Signal Process. Syst..

[6]  George Michelogiannakis,et al.  An analysis of on-chip interconnection networks for large-scale chip multiprocessors , 2010, TACO.

[7]  Leibo Liu,et al.  Reducing configuration contexts for coarse-grained reconfigurable architecture , 2012, 2012 IEEE International Symposium on Circuits and Systems.

[8]  Jürgen Becker,et al.  A General Purpose Partially Reconfigurable Processor Simulator (PReProS) , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[9]  Frank Vahid,et al.  Warp Processing: Dynamic Translation of Binaries to FPGA Circuits , 2008, Computer.

[10]  Hui Xu,et al.  A low power many-core SoC with two 32-core clusters connected by tree based NoC for multimedia applications , 2012, 2012 Symposium on VLSI Circuits (VLSIC).

[11]  Jürgen Becker,et al.  Retargeting, Evaluating, and Generating Reconfigurable Array-Based Architectures , 2008, 2008 Symposium on Application Specific Processors.

[12]  Fadi J. Kurdahi,et al.  MorphoSys: An Integrated Reconfigurable System for Data-Parallel and Computation-Intensive Applications , 2000, IEEE Trans. Computers.

[13]  Jürgen Becker,et al.  H. 264 Decoder at HD Resolution on a Coarse Grain Dynamically Reconfigurable Architecture , 2007, 2007 International Conference on Field Programmable Logic and Applications.

[14]  Leibo Liu,et al.  Parallelization of Computing-Intensive Tasks of the H.264 High Profile Decoding Algorithm on a Reconfigurable Multimedia System , 2010, IEICE Trans. Inf. Syst..

[15]  Kostas Masselos,et al.  SystemC and OCAPI-xl Based System-Level Design for Reconfigurable Systems-on-Chip , 2004, FDL.

[16]  Nikil D. Dutt,et al.  IDAP: a tool for high-level power estimation of custom array structures , 2003, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[17]  Bertil Svensson,et al.  Evolution in architectures and programming methodologies of coarse-grained reconfigurable computing , 2009, Microprocess. Microsystems.

[18]  Pao-Ann Hsiung,et al.  Perfecto: A systemc-based design-space exploration framework for dynamically reconfigurable architectures , 2008, TRETS.