Fault Tolerance Analysis and Self-Healing Strategy of Autonomous, Evolvable Hardware Systems

This paper presents an analysis of the fault tolerance achieved by an autonomous, fully embedded evolvable hardware system, which uses a combination of partial dynamic reconfiguration and an evolutionary algorithm (EA). It demonstrates that the system may self-recover from both transient and cumulative permanent faults. This self-adaptive system, based on a 2D array of 16 (4×4) Processing Elements (PEs), is tested with an image filtering application. Results show that it may properly recover from faults in up to 3 PEs, that is, more than 18% cumulative permanent faults. Two fault models are used for testing purposes, at PE and CLB levels. Two self-healing strategies are also introduced, depending on whether fault diagnosis is available or not. They are based on scrubbing, fitness evaluation, dynamic partial reconfiguration and in-system evolutionary adaptation. Since most of these adaptability features are already available on the system for its normal operation, resource cost for self-healing is very low (only some code additions in the internal microprocessor core).

[1]  Vu Duong,et al.  Experimental results in evolutionary fault-recovery for field programmable analog devices , 2003, NASA/DoD Conference on Evolvable Hardware, 2003. Proceedings..

[2]  Andrew M. Tyrrell,et al.  Evolutionary strategies and intrinsic fault tolerance , 2001, Proceedings Third NASA/DoD Workshop on Evolvable Hardware. EH-2001.

[3]  Eduardo de la Torre,et al.  A fast Reconfigurable 2D HW core architecture on FPGAs for evolvable Self-Adaptive Systems , 2011, 2011 NASA/ESA Conference on Adaptive Hardware and Systems (AHS).

[4]  Carthik A. Sharma,et al.  Expediting GA-Based Evolution Using Group Testing Techniques for Reconfigurable Hardware , 2006, 2006 IEEE International Conference on Reconfigurable Computing and FPGA's (ReConFig 2006).

[5]  Adrian Stoica,et al.  Self-Reconfigurable Mixed-Signal Integrated Circuits Architecture Comprising a Field Programmable Analog Array and a General Purpose Genetic Algorithm IP Core , 2008, ICES.

[6]  Jooheung Lee,et al.  Dynamic Partial Reconfiguration Approach to the Design of Sustainable Edge Detectors , 2010, ERSA.

[7]  Vu Duong,et al.  Circuit self-recovery experiments in extreme environments , 2004, Proceedings. 2004 NASA/DoD Conference on Evolvable Hardware, 2004..

[8]  Ronald F. DeMara,et al.  Layered Approach to Intrinsic Evolvable Hardware using Direct Bitstream Manipulation of Virtex II Pro Devices , 2007, 2007 International Conference on Field Programmable Logic and Applications.

[9]  Miguel Garvie Reliable electronics through artificial evolution , 2005 .

[10]  Ronald F. DeMara,et al.  Autonomous FPGA fault handling through competitive runtime reconfiguration , 2005, 2005 NASA/DoD Conference on Evolvable Hardware (EH'05).

[11]  Lukás Sekanina Virtual Reconfigurable Circuits for Real-World Applications of Evolvable Hardware , 2003, ICES.

[12]  Ronald F. DeMara,et al.  Evolutionary fault recovery in a Virtex FPGA using a representation that incorporates routing , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[13]  Adrian Thompson Evolving fault tolerant systems , 1995 .

[14]  Matthew Parris,et al.  Progress in autonomous fault recovery of field programmable gate arrays , 2011, CSUR.

[15]  Sung-Bae Cho,et al.  Evolutionary Image Enhancement for Impulsive Noise Reduction , 2006, ICIC.

[16]  Sverre Vigander Evolutionary Fault Repair of Electronics in Space Applications , 2001 .

[17]  Eduardo de la Torre,et al.  Evolvable 2D computing matrix model for intrinsic evolution in commercial FPGAs with native reconfiguration support , 2011, 2011 NASA/ESA Conference on Adaptive Hardware and Systems (AHS).

[18]  Lukás Sekanina,et al.  Evolutionary functional recovery in virtual reconfigurable circuits , 2007, JETC.

[19]  Adrian Stoica,et al.  Fault-tolerant evolvable hardware using field-programmable transistor arrays , 2000, IEEE Trans. Reliab..

[20]  D. Du,et al.  Combinatorial Group Testing and Its Applications , 1993 .