Adaptive single-event effect mitigation for dependable processing systems based on FPGAs

Abstract For FPGA-based applications in harsh radiation environments, designers apply mitigation techniques according the worst-case (solar) condition to achieve a dependable design. This results in a resource overhead, which is most of the time unnecessary. To overcome this problem, an adaptive mitigation technique is used, which is a trade-off between two parameters, such as performance and reliability. We propose an Adaptive Single-Event Effect Mitigation (ASEEM) method, based on FPGA reconfiguration between a performance mode and a reliability mode. The performance mode offers high processing power and the reliability mode offers a high dependability. We evaluate ASEEM by calculating results using particle flux data from 2010 until 2017 for one space-grade and two commercial-grade FPGAs. Based on radiation data, we calculate upset rates, availability, performance, and performability. We optimize the performability, which is the benchmark parameter, in dependence of the Mean Time to Repair (MTTR) and time configured in the performance mode. We conclude that the performability of ideal ASEEM (without implementation losses) maximizes with an improvement of 132% over the compared static modes (performance and reliability) at an MTTR of 19.8 h and 91% of the time in the performance mode for the space-grade FPGA. ASEEM is implemented as FPGA design to determine the implementation impact.

[1]  M. Xapsos,et al.  The Near-Earth Space Radiation Environment , 2008, IEEE Transactions on Nuclear Science.

[2]  Alan D. George,et al.  Reconfigurable Fault Tolerance: A Comprehensive Framework for Reliable and Adaptive FPGA-Based Space Computing , 2012, TRETS.

[3]  M. Shea,et al.  CREME96: A Revision of the Cosmic Ray Effects on Micro-Electronics Code , 1997 .

[4]  Elena Dubrova,et al.  Fault-Tolerant Design , 2013 .

[5]  K.S. Morgan,et al.  SRAM FPGA Reliability Analysis for Harsh Radiation Environments , 2009, IEEE Transactions on Nuclear Science.

[6]  Florian Rittner,et al.  Detection of solar particle events inside FPGAs , 2016, 2016 16th European Conference on Radiation and Its Effects on Components and Systems (RADECS).

[7]  Chiara Sandionigi,et al.  A Novel Design Methodology for Implementing Reliability-Aware Systems on SRAM-Based FPGAs , 2011, IEEE Transactions on Computers.

[8]  Michael J. Wirthlin,et al.  On-Orbit Flight Results from the Reconfigurable Cibola Flight Experiment Satellite (CFESat) , 2009, 2009 17th IEEE Symposium on Field Programmable Custom Computing Machines.

[9]  John F. Meyer,et al.  Closed-Form Solutions of Performability , 1982, IEEE Transactions on Computers.

[10]  Carl E. Landwehr,et al.  Basic concepts and taxonomy of dependable and secure computing , 2004, IEEE Transactions on Dependable and Secure Computing.

[11]  Jürgen Teich,et al.  Reliability of space-grade vs. COTS SRAM-based FPGA in N-modular redundancy , 2015, 2015 NASA/ESA Conference on Adaptive Hardware and Systems (AHS).

[12]  Harry C. Koons,et al.  THE IMPACT OF THE SPACE ENVIRONMENT ON SPACE SYSTEMS , 1999 .

[13]  J. Barth,et al.  Space, atmospheric, and terrestrial radiation environments , 2003 .

[14]  Wenhai Li,et al.  A Self-Adaptive SEU Mitigation System for FPGAs with an Internal Block RAM Radiation Particle Sensor , 2013, FCCM 2013.

[15]  Robért Glein,et al.  Adaptive single-event effect mitigation for dependable processing systems , 2016, 2016 International Conference on ReConFigurable Computing and FPGAs (ReConFig).

[16]  R Turner,et al.  Solar Particle Events from a risk management perspective. , 2000, IEEE transactions on plasma science. IEEE Nuclear and Plasma Sciences Society.