In-Place FPGA Retiming for Mitigation of Variational Single-Event Transient Faults

For anti-fuse or flash-memory-based field-programmable gate arrays (FPGAs), single-event transient (SET)-induced faults are significantly more pronounced than single-event upsets (SEUs). While most existing work studies SEU, this paper proposes a retiming algorithm for mitigating variational SETs (i.e., SETs with different durations and strengths). Considering the reshaping effect of an SET pulse caused by broadening and attenuation during its propagation, SET-aware retiming (SaR) redistributes combinational paths via post layout retiming and minimizes the possibility that an SET pulse is latched. The SaR problem is formulated as an integer linear programming (ILP) problem and solved efficiently by a progressive ILP approach. In contrast to existing SET-mitigation techniques, the proposed SaR does not change the FPGA architecture or the layout of an FPGA application. Instead, it reconfigures the connection between a flip-flop and an LUT within a programmable logic block. Experimental results show that SaR increases mean-time-to-failure (MTTF) by 78% for variational SETs with a 10-min runtime limit while preserving the clock frequency on ISCAS89 benchmark circuits. To the best of our knowledge, this paper is the first in-depth study on FPGA retiming for SET mitigation.

[1]  Guowu Yang,et al.  Probabilistic Estimation for Routing Space , 2005, Comput. J..

[2]  Yan Lin,et al.  Device and architecture concurrent optimization for FPGA transient soft error rate , 2007, 2007 IEEE/ACM International Conference on Computer-Aided Design.

[3]  Guowu Yang,et al.  Congestion estimation for hexagonal routing , 2006, Int. J. Comput. Math..

[4]  Warren Robinett,et al.  Defect Tolerance Based on Coding and Series Replication in Transistor-Logic Demultiplexer Circuits , 2007, IEEE Transactions on Circuits and Systems I: Regular Papers.

[5]  Guowu Yang,et al.  Congestion estimation for 3-D circuit architectures , 2004, IEEE Transactions on Circuits and Systems II: Express Briefs.

[6]  Guowu Yang,et al.  A combinatorial congestion estimation approach with generalized detours , 2006, Comput. Math. Appl..

[7]  Yih-Chyun Jenq,et al.  Digital spectrum of a nonuniformly sampled two-dimensional signal and its reconstruction , 2005, IEEE Trans. Instrum. Meas..

[8]  Guowu Yang,et al.  On Theoretical Upper Bounds for Routing Estimation , 2005, J. Univers. Comput. Sci..

[9]  J.J. Wang,et al.  Radiation Hardened FPGA Technology for Space Applications , 2007, 2007 IEEE Aerospace Conference.

[10]  Guowu Yang,et al.  A fast congestion estimator for routing with bounded detours , 2004, ASP-DAC 2004: Asia and South Pacific Design Automation Conference 2004 (IEEE Cat. No.04EX753).

[11]  Guowu Yang,et al.  A hierarchical method for wiring congestion prediction , 2005, IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design (ISVLSI'05).

[12]  Puneet Gupta,et al.  Design dependent process monitoring for back-end manufacturing cost reduction , 2010, 2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD).

[13]  Marco D. Santambrogio,et al.  SEU mitigation for sram-based fpgas through dynamic partial reconfiguration , 2007, GLSVLSI '07.

[14]  Majid Sarrafzadeh,et al.  Computing with uncertainty in a smart textile surface for object recognition , 2010, 2010 IEEE Conference on Multisensor Fusion and Integration.

[15]  John P. Hayes,et al.  Improving testability and soft-error resilience through retiming , 2009, 2009 46th ACM/IEEE Design Automation Conference.

[16]  D. Sylvester,et al.  Soft Error Reduction in Combinational Logic Using Gate Resizing and Flipflop Selection , 2006, 2006 IEEE/ACM International Conference on Computer Aided Design.

[17]  David Blaauw,et al.  Soft error reduction in combinational logic using gate resizing and flipflop selection , 2006, ICCAD.

[18]  Shi-Jie Wen,et al.  Minimizing Soft Errors in TCAM Devices: A Probabilistic Approach to Determining Scrubbing Intervals , 2010, IEEE Transactions on Circuits and Systems I: Regular Papers.

[19]  Ge Yu,et al.  Schedulability analysis of preemptive and nonpreemptive EDF on partial runtime-reconfigurable FPGAs , 2008, TODE.

[20]  J. Melinger,et al.  Investigation of the Propagation Induced Pulse Broadening (PIPB) Effect on Single Event Transients in SOI and Bulk Inverter Chains , 2008, IEEE Transactions on Nuclear Science.

[21]  Chris Winstead,et al.  A Probabilistic LDPC-Coded Fault Compensation Technique for Reliable Nanoscale Computing , 2009, IEEE Transactions on Circuits and Systems II: Express Briefs.

[22]  Lloyd W. Massengill,et al.  Basic mechanisms and modeling of single-event upset in digital microelectronics , 2003 .

[23]  Shashi Kiran Chilappagari,et al.  An Information Theoretical Framework for Analysis and Design of Nanoscale Fault-Tolerant Memories Based on Low-Density Parity-Check Codes , 2007, IEEE Transactions on Circuits and Systems I: Regular Papers.

[24]  Guowu Yang,et al.  Congestion estimation for 3D routing , 2004, IEEE Computer Society Annual Symposium on VLSI.

[25]  Yu Hu,et al.  IPR: In-Place Reconfiguration for FPGA fault tolerance , 2009, 2009 IEEE/ACM International Conference on Computer-Aided Design - Digest of Technical Papers.

[26]  Sandeep K. Shukla,et al.  Reliability Analysis of Large Circuits Using Scalable Techniques and Tools , 2007, IEEE Transactions on Circuits and Systems I: Regular Papers.

[27]  Wenyao Xu,et al.  Task scheduling model and algorithm based on dual-Vdd dynamic reconfigurable FPGA , 2010 .

[28]  Charles E. Leiserson,et al.  Retiming synchronous circuitry , 1988, Algorithmica.

[29]  Guowu Yang,et al.  Routability checking for three-dimensional architectures , 2004, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[30]  Hai Zhou,et al.  An efficient retiming algorithm under setup and hold constraints , 2006, 2006 43rd ACM/IEEE Design Automation Conference.

[31]  Dejan Markovic,et al.  True Energy-Performance Analysis of the MTJ-Based Logic-in-Memory Architecture (1-Bit Full Adder) , 2010, IEEE Transactions on Electron Devices.

[32]  Wenyao Xu,et al.  A novel placement algorithm for symmetrical FPGA , 2007, 2007 7th International Conference on ASIC.

[33]  Yu Hu,et al.  DraXRouter: global routing in X-architecture with dynamic resource assignment , 2006, Asia and South Pacific Conference on Design Automation, 2006..

[34]  Kartik Mohanram,et al.  Gate sizing to radiation harden combinational logic , 2006, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[35]  Yu Hu,et al.  Robust FPGA resynthesis based on fault-tolerant Boolean matching , 2008, 2008 IEEE/ACM International Conference on Computer-Aided Design.

[36]  Jinjun Xiong,et al.  Statistical multilayer process space coverage for at-speed test , 2009, 2009 46th ACM/IEEE Design Automation Conference.

[37]  M. Wirthlin,et al.  SEU-induced persistent error propagation in FPGAs , 2005, IEEE Transactions on Nuclear Science.

[38]  B. Narasimham,et al.  On-Chip Characterization of Single-Event Transient Pulsewidths , 2006, IEEE Transactions on Device and Materials Reliability.