Efficient Runtime Management of Reconfigurable Hardware Resources

Runtime reconfigurable systems built upon devices with partial reconfiguration can provide reduction in overall hardware area, power efficiency, and economic cost in addition to the performance improvements due to better customization. However, the users of such systems have to be able to afford some additional costs compared to hardwired application specific circuits. More precisely reconfigurable devices have higher power consumption, occupy larger silicon area and operate at lower speeds. Higher power consumption requires additional packaging cost, shortens chip lifetimes, requires expensive cooling systems, decreases system reliability and prohibits battery operation. The less efficient usage of silicon real estate is usually compensated by the runtime hardware reconfiguration and functional units relocation. The available configuration data paths, however, have limited bandwidth that introduces overheads that may eclipse the dynamic reconfiguration benefits. In this dissertation, we address three major problems related to hardware resources runtime management: efficient online hardware task scheduling and placement, power consumption reduction and reconfiguration overhead minimization. Since hardware tasks are allocated and deallocated dynamically at runtime, the reconfigurable fabric can suffer of fragmentation. This can lead to the undesirable situation that tasks cannot be allocated even if there would be sufficient free area available. As a result, the overall system performance is degraded. Therefore, efficient hardware management of resources is very important. To manage hardware resources efficiently, we propose novel online hardware task scheduling and placement algorithms on partially reconfigurable devices with higher quality and faster execution compared to related proposals. To cope with the high power consumption in field programmable devices, we propose a novel logic element with lower power consumption compared to current approaches. To reduce runtime overhead, we augment the FPGA configuration circuit architecture and allow faster reconfiguration and relocation compared to current reconfigurable devices.

[1]  Morteza Saheb Zamani,et al.  Reducing reconfiguration time of reconfigurable computing systems in integrated temporal partitioning and physical design framework , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[2]  Cid C. de Souza,et al.  Efficient datapath merging for partially reconfigurable architectures , 2005, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[3]  Deming Chen,et al.  Low-power high-level synthesis for FPGA architectures , 2003, Proceedings of the 2003 International Symposium on Low Power Electronics and Design, 2003. ISLPED '03..

[4]  Martin Middendorf,et al.  Hyperreconfigurable architectures for fast run time reconfiguration , 2004, 12th Annual IEEE Symposium on Field-Programmable Custom Computing Machines.

[5]  Diederik Verkest,et al.  Run-Time Minimization of Reconfiguration Overhead in Dynamically Reconfigurable Systems , 2003, FPL.

[6]  Kiyoung Choi,et al.  Hardware-software cosynthesis for run-time incrementally reconfigurable FPGAs , 2000, Proceedings 2000. Design Automation Conference. (IEEE Cat. No.00CH37106).

[7]  Scott Hauck,et al.  Configuration prefetch for single context reconfigurable coprocessors , 1998, FPGA '98.

[8]  Chau-Shen Chen,et al.  Low Power Fpga Design - A Re-engineering Approach , 1997, Proceedings of the 34th Design Automation Conference.

[9]  Viktor K. Prasanna,et al.  Configuration compression for FPGA-based embedded systems , 2005, IEEE Trans. Very Large Scale Integr. Syst..

[10]  Heiko Kalte,et al.  REPLICA2Pro: task relocation by bitstream manipulation in virtex-II/Pro FPGAs , 2006, CF '06.

[11]  Seda Ogrenci Memik,et al.  A low power FPGA routing architecture , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[12]  Tulika Mitra,et al.  Configuration bitstream compression for dynamically reconfigurable FPGAs , 2004, IEEE/ACM International Conference on Computer Aided Design, 2004. ICCAD-2004..

[13]  André DeHon,et al.  Dynamically Programmable Gate Arrays: A Step Toward Increased Computational Density , 1996 .

[14]  Kurt Keutzer,et al.  Automatic Replacement of Flip-Flops by Latches in ASICs , 2004 .

[15]  Viktor K. Prasanna,et al.  Loop Pipelining and Optimization for Run Time Reconfiguration , 2000, IPDPS Workshops.

[16]  Scott Hauck,et al.  Runlength compression techniques for FPGA configurations , 1999, Seventh Annual IEEE Symposium on Field-Programmable Custom Computing Machines (Cat. No.PR00375).

[17]  Jason Cong,et al.  Architecture evaluation for power-efficient FPGAs , 2003, FPGA '03.

[18]  Gustavo Sutter,et al.  FSM Decomposition for Low Power in FPGA , 2002, FPL.

[19]  Oliver Diessel,et al.  A configuration memory architecture for fast run-time reconfiguration of FPGAs , 2005, International Conference on Field Programmable Logic and Applications, 2005..

[20]  Russell Tessier,et al.  Power-aware FPGA logic synthesis using binary decision diagrams , 2007, FPGA '07.

[21]  Oliver Diessel,et al.  ACS: An Addressless Configuration Support for efficient partial reconfigurations , 2008, 2008 International Conference on Field-Programmable Technology.

[22]  Zhiyuan Li,et al.  Configuration compression for the Xilinx XC6200 FPGA , 1998, Proceedings. IEEE Symposium on FPGAs for Custom Computing Machines (Cat. No.98TB100251).

[23]  Roman L. Lysecky,et al.  Configuration Locking and Schedulability Estimation for Reduced Reconfiguration Overheads of Reconfigurable Systems , 2010, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[24]  Fei Li,et al.  Vdd programmability to reduce FPGA interconnect power , 2004, ICCAD 2004.

[25]  Oliver Diessel,et al.  On the placement and granularity of FPGA configurations , 2004, Proceedings. 2004 IEEE International Conference on Field- Programmable Technology (IEEE Cat. No.04EX921).

[26]  Massoud Pedram,et al.  Low-power sequential circuit design using T flip-flops , 2001 .

[27]  Zhiyuan Li,et al.  Configuration Compression for Virtex FPGAs , 2001, The 9th Annual IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM'01).

[28]  Gustavo Sutter,et al.  Low-Power FSMs in FPGA: Encoding Alternatives , 2002, PATMOS.

[29]  Sorin Cotofana,et al.  Bitstream compression techniques for Virtex 4 FPGAs , 2008, 2008 International Conference on Field Programmable Logic and Applications.

[30]  Malgorzata Marek-Sadowska,et al.  Efficient circuit clustering for area and power reduction in FPGAs , 2002, TODE.

[31]  Yan Lin,et al.  Dual-Vdd Interconnect With Chip-Level Time Slack Allocation for FPGA Power Reduction , 2006, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[32]  Reiner W. Hartenstein Coarse grain reconfigurable architectures , 2001, Proceedings of the ASP-DAC 2001. Asia and South Pacific Design Automation Conference 2001 (Cat. No.01EX455).

[33]  Michael J. Alexander Power optimization for FPGA look-up tables , 1997, ISPD '97.

[34]  Eduardo de la Torre,et al.  Straight method for reallocation of complex cores by dynamic reconfiguration in Virtex II FPGAs , 2005, 16th IEEE International Workshop on Rapid System Prototyping (RSP'05).

[35]  Andrew A. Kennings,et al.  A technique for minimizing power during FPGA placement , 2008, 2008 International Conference on Field Programmable Logic and Applications.

[36]  Steven J. E. Wilton,et al.  Clock-Aware Placement for FPGAs , 2007, 2007 International Conference on Field Programmable Logic and Applications.

[37]  Ali Akoglu,et al.  Partial Bitstream 2-D Core Relocation for Reconfigurable Architectures , 2009, 2009 NASA/ESA Conference on Adaptive Hardware and Systems.

[38]  Hua Wang,et al.  Variable tapered pareto buffer design and implementation allowing run-time configuration for low-power embedded SRAMs , 2005, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[39]  Azadeh Davoodi,et al.  FPGA Dynamic Power Minimization through Placement and Routing Constraints , 2006, EURASIP J. Embed. Syst..

[40]  Axel Jantsch,et al.  Run-time Partial Reconfiguration speed investigation and architectural design space exploration , 2009, 2009 International Conference on Field Programmable Logic and Applications.

[41]  Srinivas Katkoori,et al.  Efficient LUT-based FPGA technology mapping for power minimization , 2003, ASP-DAC '03.

[42]  Herman Schmit Incremental reconfiguration for pipelined applications , 1997, Proceedings. The 5th Annual IEEE Symposium on Field-Programmable Custom Computing Machines Cat. No.97TB100186).

[43]  Yohei Matsumoto,et al.  Low‐power FPGA using partially low swing routing architecture , 2005 .

[44]  Mahmut T. Kandemir,et al.  Design of power-aware FPGA fabrics , 2007, Int. J. Embed. Syst..

[45]  Farid N. Najm,et al.  Look-up table leakage reduction for FPGAs , 2005, Proceedings of the IEEE 2005 Custom Integrated Circuits Conference, 2005..

[46]  Ying Wang,et al.  A New Placement Approach to Minimizing FPGA Reconfiguration Data , 2008, 2008 International Conference on Embedded Software and Systems.

[47]  Christos A. Papachristou,et al.  High-level low power FPGA design methodology , 2000, Proceedings of the IEEE 2000 National Aerospace and Electronics Conference. NAECON 2000. Engineering Tomorrow (Cat. No.00CH37093).

[48]  Stamatis Vassiliadis,et al.  Compiler-driven FPGA-area allocation for reconfigurable computing , 2006, Proceedings of the Design Automation & Test in Europe Conference.

[49]  Yao-Wen Chang,et al.  Post-placement leakage optimization for partially dynamically reconfigurable FPGAs , 2007, Proceedings of the 2007 international symposium on Low power electronics and design (ISLPED '07).

[50]  Jenny Yi-Chun Kuo,et al.  A Novel Network Architecture Support for Fast Reconfiguration , 2007, 2007 International Conference on Field-Programmable Technology.

[51]  Qiang Wang,et al.  CAD Techniques for Power Optimization in Virtex-5 FPGAs , 2007, 2007 IEEE Custom Integrated Circuits Conference.

[52]  Oliver Diessel,et al.  Module Graph Merging and Placement to Reduce Reconfiguration Overheads in Paged FPGA Devices , 2007, 2007 International Conference on Field Programmable Logic and Applications.

[53]  Jason Helge Anderson,et al.  Power-aware technology mapping for LUT-based FPGAs , 2002, 2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings..

[54]  Majid Sarrafzadeh,et al.  FPGA Technology Mapping for Power Minimization , 1994, FPL.

[55]  Zhiyuan Li,et al.  Configuration Caching Techniques for FPGA , 2000 .

[56]  Robert W. Brodersen,et al.  Analysis and design of low-energy flip-flops , 2001, ISLPED '01.

[57]  Wayne Luk,et al.  Pipeline morphing and virtual pipelines , 1997, FPL.

[58]  Yu Hu,et al.  Physical synthesis for FPGA interconnect power reduction by dual-Vdd budgeting and retiming , 2008, TODE.

[59]  Zhiyuan Li,et al.  Don't Care discovery for FPGA configuration compression , 1999, FPGA '99.

[60]  Chirag Ravishankar,et al.  FPGA power reduction by guarded evaluation , 2010, FPGA '10.

[61]  Neil W. Bergmann,et al.  QUKU: a two-level reconfigurable architecture , 2006, IEEE Computer Society Annual Symposium on Emerging VLSI Technologies and Architectures (ISVLSI'06).

[62]  Mohamed I. Elmasry,et al.  Input Vector Reordering for Leakage Power Reduction in FPGAs , 2008, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[63]  Ronald F. DeMara,et al.  A Physical Resource Management Approach to Minimizing FPGA Partial Reconfiguration Overhead , 2006, 2006 IEEE International Conference on Reconfigurable Computing and FPGA's (ReConFig 2006).

[64]  Wayne Luk,et al.  Enhancing Relocatability of Partial Bitstreams for Run-Time Reconfiguration , 2007, 15th Annual IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM 2007).

[65]  Wayne Luk,et al.  Automating production of run-time reconfigurable designs , 1998, Proceedings. IEEE Symposium on FPGAs for Custom Computing Machines (Cat. No.98TB100251).

[66]  Youn-Long Lin,et al.  Storage optimization by replacing some flip-flops with latches , 1996, Proceedings EURO-DAC '96. European Design Automation Conference with EURO-VHDL '96 and Exhibition.

[67]  Jason Cong,et al.  Delay optimal low-power circuit clustering for FPGAs with dual supply voltages , 2004, Proceedings of the 2004 International Symposium on Low Power Electronics and Design (IEEE Cat. No.04TH8758).

[68]  Francky Catthoor,et al.  A configuration memory hierarchy for fast reconfiguration with reduced energy consumption overhead , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[69]  George Varghese,et al.  The design of a low energy FPGA , 1999, Proceedings. 1999 International Symposium on Low Power Electronics and Design (Cat. No.99TH8477).

[70]  Balakrishna Kumthekar,et al.  Power optimization of FPGA-based designs without rewiring , 2000 .

[71]  Saibal Mukhopadhyay,et al.  Leakage current mechanisms and leakage reduction techniques in deep-submicrometer CMOS circuits , 2003, Proc. IEEE.

[72]  Luca Benini,et al.  Minimization of the reconfiguration latency for the mapping of applications on FPGA-based systems , 2009, CODES+ISSS '09.

[73]  Ulrich Rückert,et al.  Study on column wise design compaction for reconfigurable systems , 2004, Proceedings. 2004 IEEE International Conference on Field- Programmable Technology (IEEE Cat. No.04EX921).

[74]  Yan Meng,et al.  Leakage power reduction of embedded memories on FPGAs through location assignment , 2006, 2006 43rd ACM/IEEE Design Automation Conference.

[75]  Fei Li,et al.  FPGA power reduction using configurable dual-Vdd , 2004, Proceedings. 41st Design Automation Conference, 2004..

[76]  D. Howland,et al.  RTL dynamic power optimization for FPGAs , 2008, 2008 51st Midwest Symposium on Circuits and Systems.

[77]  Steven J. E. Wilton,et al.  Detailed routing architectures for embedded programmable logic IP cores , 2001, FPGA '01.

[78]  Majid Sarrafzadeh,et al.  An optimal algorithm for minimizing run-time reconfiguration delay , 2004, TECS.