The Chameleon Architecture for Streaming DSP Applications

We focus on architectures for streaming DSP applications such as wireless baseband processing and image processing. We aim at a single generic architecture that is capable of dealing with different DSP applications. This architecture has to be energy efficient and fault tolerant. We introduce a heterogeneous tiled architecture and present the details of a domain-specific reconfigurable tile processor called Montium. This reconfigurable processor has a small footprint (1.8 mm2 in a 130 nm process), is power efficient and exploits the locality of reference principle. Reconfiguring the device is very fast, for example, loading the coefficients for a 200 tap FIR filter is done within 80 clock cycles. The tiles on the tiled architecture are connected to a Network-on-Chip (NoC) via a network interface (NI). Two NoCs have been developed: a packet-switched and a circuit-switched version. Both provide two types of services: guaranteed throughput (GT) and best effort (BE). For both NoCs estimates of power consumption are presented. The NI synchronizes data transfers, configures and starts/stops the tile processor. For dynamically mapping applications onto the tiled architecture, we introduce a run-time mapping tool.

[1]  M. Latva-aho,et al.  Reconfigurable adaptive RAKE receiver for wideband CDMA systems , 1998, VTC '98. 48th IEEE Vehicular Technology Conference. Pathway to Global Wireless Revolution (Cat. No.98CH36151).

[2]  Omar Mansour,et al.  High level synthesis for non-manifest digital signal processing applications , 2006 .

[3]  G.J.M. Smit,et al.  Run-time Mapping of Applications to a Heterogeneous SoC , 2005, 2005 International Symposium on System-on-Chip.

[4]  Eberhard Schüler,et al.  Overview of the 4 S Project , 2005 .

[5]  P. M. Heysters Coarse-Grained Reconfigurable Processors - Flexibility meets Efficiency , 2004 .

[6]  Gerard J. M. Smit,et al.  A virtual channel network-on-chip for GT and BE traffic , 2006, IEEE Computer Society Annual Symposium on Emerging VLSI Technologies and Architectures (ISVLSI'06).

[7]  Dake Liu,et al.  SoCBUS: switched network on chip for hard real time embedded systems , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[8]  Paul J.M. Havinga,et al.  Max-Log-MAP Mapping on an FPFA , 2002 .

[9]  Ramjee Prasad,et al.  Wideband CDMA For Third Generation Mobile Communications: Universal Personal Communications , 1998 .

[10]  Ramjee Prasad,et al.  Wideband CDMA for third generation mobile communications , 1998 .

[11]  Gerard J. M. Smit,et al.  Hydra: An Energy-efficient and Reconfigurable Network Interface , 2006, ERSA.

[12]  Kees Goossens,et al.  Concepts and Implementation of the Philips Network-on-Chip , 2003 .

[13]  Heiko Kalte,et al.  Task placement for heterogeneous reconfigurable architectures , 2005, Proceedings. 2005 IEEE International Conference on Field-Programmable Technology, 2005..

[14]  Katherine Compton,et al.  An execution environment for reconfigurable computing , 2005, 13th Annual IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM'05).

[15]  Gerard J. M. Smit,et al.  Overview of the 4S Project , 2005, 2005 International Symposium on System-on-Chip.

[16]  Rajeev Motwani,et al.  The load rebalancing problem , 2006, J. Algorithms.

[17]  William J. Dally,et al.  Stream Processors: Progammability and Efficiency , 2004, ACM Queue.

[18]  Kang G. Shin,et al.  Support for Multiple Classes of Traffic in Multicomputer Routers , 1994, PCRCW.

[19]  Heather M. Quinn,et al.  Dynamo: A Runtime Partitioning System , 2004, ERSA.

[20]  Kees G. W. Goossens,et al.  Networks on silicon: combining best-effort and guaranteed services , 2002, Proceedings 2002 Design, Automation and Test in Europe Conference and Exhibition.

[21]  Gerard J. M. Smit,et al.  Energy efficient NoC for best effort communication , 2005, International Conference on Field Programmable Logic and Applications, 2005..

[22]  Gerard J. M. Smit,et al.  An energy-efficient reconfigurable circuit-switched network-on-chip , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[23]  Jan M. Rabaey,et al.  Interconnect architecture exploration for low-energy reconfigurable single-chip DSPs , 1999, Proceedings. IEEE Computer Society Workshop on VLSI '99. System Design: Towards System-on-a-Chip Paradigm.

[24]  Yuanqing Guo,et al.  Mapping Applications to a Coarse-Grained Reconfigurable Architecture , 2006 .

[25]  P.K. Lala,et al.  An on-line reconfigurable FPGA architecture , 2000, Proceedings IEEE International Symposium on Defect and Fault Tolerance in VLSI Systems.

[26]  Russell Tessier,et al.  c ○ 2001 Kluwer Academic Publishers. Manufactured in The Netherlands. Reconfigurable Computing for Digital Signal Processing: A Survey ∗ , 1999 .

[27]  Gerard J. M. Smit,et al.  Mapping of DSP algorithms on the MONTIUM architecture , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[28]  Ivo Bolsens Challenges and Opportunities for FPGA Platforms , 2002, FPL.

[29]  Markus Weinhardt,et al.  PACT XPP—A Self-Reconfigurable Data Processing Architecture , 2004, The Journal of Supercomputing.