Adaptive single-event effect mitigation for dependable processing systems

For application in radiation-harsh environments, designers apply mitigation techniques according the worst-case (solar) condition to achieve a dependable design. This results in a resource overhead, which is most of the time unnecessary. To overcome this problem, adaptive mitigation techniques are used. This technique is a trade-off between two parameters, such as performance and reliability, according to different operating modes by toggling between these modes. In this context, we propose an Adaptive Single-Event Effect Mitigation (ASEEM) method. It is based on adaptive reconfiguration of an FPGA between two modes, specifically a performance mode and a high reliability mode. The performance mode offers high processing power and thus higher signal processing throughput. We evaluate ASEEM by calculating results with particle data from 2010 until 2016 for one space-grade and two commercial-grade FPGAs. Based on radiation data, we calculate upset rates, availability, performance and performability. We discuss one realization of ASEEM in detail with fixed upset rates. The examples presented in this paper show a reduction of the upset rate form a sixth to a ninth (compared with the performance mode) and the availability of the high processing power over 90 % in the considered time interval. We conclude that the investigated ASEEM realization is optimal for moderate and long mean times to repair. In a processing case study, with a fixed mean time to repair of one hour, we obtain a performability improvement of 14% and an availability improvement of 21 % over the performance mode for an FPGA using the latest semiconductor technology.

[1]  Michael J. Wirthlin,et al.  On-Orbit Flight Results from the Reconfigurable Cibola Flight Experiment Satellite (CFESat) , 2009, 2009 17th IEEE Symposium on Field Programmable Custom Computing Machines.

[2]  M. Shea,et al.  CREME96: A Revision of the Cosmic Ray Effects on Micro-Electronics Code , 1997 .

[3]  Harry C. Koons,et al.  THE IMPACT OF THE SPACE ENVIRONMENT ON SPACE SYSTEMS , 1999 .

[4]  Carl E. Landwehr,et al.  Basic concepts and taxonomy of dependable and secure computing , 2004, IEEE Transactions on Dependable and Secure Computing.

[5]  R Turner,et al.  Solar Particle Events from a risk management perspective. , 2000, IEEE transactions on plasma science. IEEE Nuclear and Plasma Sciences Society.

[6]  J. Barth,et al.  Space, atmospheric, and terrestrial radiation environments , 2003 .

[7]  Wenhai Li,et al.  A Self-Adaptive SEU Mitigation System for FPGAs with an Internal Block RAM Radiation Particle Sensor , 2013, FCCM 2013.

[8]  Florian Rittner,et al.  Detection of solar particle events inside FPGAs , 2016, 2016 16th European Conference on Radiation and Its Effects on Components and Systems (RADECS).

[9]  Michael J. Wirthlin,et al.  Predicting On-Orbit Static Single Event Upset Rates in Xilinx Virtex FPGAs , 2006 .

[10]  Chiara Sandionigi,et al.  A Novel Design Methodology for Implementing Reliability-Aware Systems on SRAM-Based FPGAs , 2011, IEEE Transactions on Computers.

[11]  Elena Dubrova,et al.  Fault-Tolerant Design , 2013 .

[12]  Jürgen Teich,et al.  Reliability of space-grade vs. COTS SRAM-based FPGA in N-modular redundancy , 2015, 2015 NASA/ESA Conference on Adaptive Hardware and Systems (AHS).

[13]  Alan D. George,et al.  Reconfigurable Fault Tolerance: A Comprehensive Framework for Reliable and Adaptive FPGA-Based Space Computing , 2012, TRETS.

[14]  John F. Meyer,et al.  Closed-Form Solutions of Performability , 1982, IEEE Transactions on Computers.