Buffer Controller-Based Multiple Processing Element Utilization for Dataflow Synthesis

This paper presents an effective design methodology which maps a complex system represented as a dataflow graph to a reconfigurable target architecture having multi-core processors and programmable logics. In order to synchronize data transfers between two processing blocks mapped to different processors (alternatively, one block is mapped to a processor and the other is realized as a hardware), we propose a mapping methodology that exploits the buffer-based dataflow, a new representation technique for realizing data-centric applications in reconfigurable platforms. From the buffer-based dataflow and estimated execution times of functional blocks and data transfers, the proposed methodology creates a mapped partition and generates the template code which runs on the processors of the target platform. We also use a processor initiation scheme to prevent wrong operations from happening when actual execution takes longer than estimated. Our proposed mapping methodology and the generated template code are evaluated with the SystemC model and Xilinx ISE.

[1]  Massimo Poncino,et al.  A cosimulation methodology for HW/SW validation and performance estimation , 2009, TODE.

[2]  Soonhoi Ha,et al.  PeaCE: A hardware-software codesign environment for multimedia embedded systems , 2008, TODE.

[3]  Soonhoi Ha,et al.  A Systematic Design Space Exploration of MPSoC Based on Synchronous Data Flow Specification , 2010, J. Signal Process. Syst..

[4]  Xin Zhao,et al.  An ILP formulation for task mapping and scheduling on multi-core architectures , 2009, 2009 Design, Automation & Test in Europe Conference & Exhibition.

[5]  Kurt Keutzer,et al.  Scheduling task dependence graphs with variable task execution times onto heterogeneous multiprocessors , 2008, EMSOFT '08.

[6]  Sangjin Hong,et al.  Flexible Controller Design and Its Application for Concurrent Execution of Buffer Centric Dataflows , 2007, J. VLSI Signal Process..

[7]  Julien Delorme An Automatic Design Flow for Mapping Application onto a 2D Mesh NoC Architecture , 2007, PATMOS.

[8]  Ahmed Amine Jerraya,et al.  Fast and accurate timed execution of high level embedded software using HW/SW interface simulation model , 2004, ASP-DAC 2004: Asia and South Pacific Design Automation Conference 2004 (IEEE Cat. No.04EX753).

[9]  Shuvra S. Bhattacharyya,et al.  High-Performance Buffer Mapping to Exploit DRAM Concurrency in Multiprocessor DSP Systems , 2009, 2009 IEEE/IFIP International Symposium on Rapid System Prototyping.

[10]  Alberto L. Sangiovanni-Vincentelli,et al.  Software timing analysis using HW/SW cosimulation and instruction set simulator , 1998, Proceedings of the Sixth International Workshop on Hardware/Software Codesign. (CODES/CASHE'98).

[11]  Wayne H. Wolf,et al.  Communication synthesis for distributed embedded systems , 1995, ICCAD.

[12]  Petar M. Djuric,et al.  Design Methodology for Domain Specific Parameterizable Particle Filter Realizations , 2007, IEEE Transactions on Circuits and Systems I: Regular Papers.

[13]  Soonhoi Ha,et al.  Optimized RTL Code Generation from Coarse-Grain Dataflow Specification for Fast HW/SW Cosynthesis , 2008, J. Signal Process. Syst..

[14]  Edward A. Lee,et al.  A global criticality/local phase driven algorithm for the constrained hardware/software partitioning problem , 1994, CODES.

[15]  M. Engels,et al.  Grape-II: A System-Level Prototyping Environment for DSP Applications , 1995, Computer.