Adjustable self-healing methodology for accelerated functions in heterogeneous systems

Self-healing is a promising approach for designing reliable digital systems. It refers to the ability of a system to detect faults and automatically fixing them to avoid total failure. With the development of digital systems, heterogeneous systems, in which some parts of the system are executed on the programmable logic, and some other parts run on the processing elements (CPU), are becoming more prevalent. In this work, we propose an adjustable self-healing method that is applicable to heterogeneous systems with accelerated functions and enables the designers to add the self-healing feature to the design. In this method, by manipulating the software codes that are being executed on the processing element, we add the ability to verify the accelerated functions on the programmable logic and heal the possible failures to the system. This is done not only in a straightforward manner but also without being forced to choose a specific reliability-overhead point. The designer will have the option to select the optimum configuration for a desired reliability level. Experimental results on a large design including several accelerated functions are provided and show 42% improvement of reliability by having 27% overhead, as an example of the reliability-overhead point.

[1]  Heinrich Theodor Vierhaus,et al.  On the feasibility of combining on-line-test and self repair for logic circuits , 2013, 2013 IEEE 16th International Symposium on Design and Diagnostics of Electronic Circuits & Systems (DDECS).

[2]  Onn Shehory,et al.  PANACEA Towards a Self-healing Development Framework , 2007, 2007 10th IFIP/IEEE International Symposium on Integrated Network Management.

[3]  Nenavath Srinivas Naik,et al.  Self-Healing model for software application , 2014, International Conference on Recent Advances and Innovations in Engineering (ICRAIE-2014).

[4]  Kunle Olukotun,et al.  Hardware/software co-design for high performance computing: Challenges and opportunities , 2010, 2010 IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS).

[5]  Magdy Bayoumi,et al.  A novel approach towards less area overhead self-healing hardware systems , 2017, 2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS).

[6]  Zhisheng Qian,et al.  Characteristic analysis and modeling for the near field radiation of FPGA , 2017, 2017 IEEE 5th International Symposium on Electromagnetic Compatibility (EMC-Beijing).

[7]  Pauline C. Haddow,et al.  Evolvable Hardware Challenges: Past, Present and the Path to a Promising Future , 2018 .

[8]  Patrick Schaumont,et al.  The Impact of Aging on an FPGA-Based Physical Unclonable Function , 2011, 2011 21st International Conference on Field Programmable Logic and Applications.

[9]  Geeta Sikka,et al.  Exploration in adaptiveness to achieve automated fault recovery in self-healing software systems: A review , 2019, International Journal of Intelligent Decision Technologies.

[10]  Narayanan Vijaykrishnan,et al.  FLAW: FPGA lifetime awareness , 2006, 2006 43rd ACM/IEEE Design Automation Conference.

[11]  Debanjan Ghosh,et al.  Self-healing systems - survey and synthesis , 2007, Decis. Support Syst..

[12]  Onn Shehory,et al.  SHADOWS: Self-healing complex software systems , 2008, 2008 23rd IEEE/ACM International Conference on Automated Software Engineering - Workshops.

[13]  Holger Giese,et al.  Evaluation of Self-Healing Systems: An Analysis of the State-of-the-Art and Required Improvements , 2020, Comput..

[14]  Kasem Khalil,et al.  Self-healing hardware systems: A review , 2019, Microelectron. J..

[15]  Magdy A. Bayoumi,et al.  On on-chip intelligence paradigms , 2017, 2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE).

[16]  Paul Pop,et al.  Application-aware optimization of redundant resources for the reconfigurable self-healing eDNA hardware architecture , 2011, 2011 NASA/ESA Conference on Adaptive Hardware and Systems (AHS).

[17]  Vignesh Thangavel Cascaded Digital Refinement for Intrinsic Evolvable Hardware , 2015 .

[18]  Zhou,et al.  Fault Tolerant Reconfigurable System with Dual-Module Redundancy and Dynamic Reconfiguration , 2011 .

[19]  Bradley R. Schmerl,et al.  Model-based adaptation for self-healing systems , 2002, WOSS '02.

[20]  Hiroyuki Tomiyama,et al.  CHStone: A benchmark program suite for practical C-based high-level synthesis , 2008, 2008 IEEE International Symposium on Circuits and Systems.

[21]  Wilfried Elmenreich,et al.  SEAHORSE: Generalizing an artificial hormone system algorithm to a middleware for search and delivery of information units , 2015, Comput. Networks.

[22]  Thomas Vogel,et al.  Improving Scalability and Reward of Utility-Driven Self-Healing for Large Dynamic Architectures , 2020, ACM Trans. Auton. Adapt. Syst..

[23]  Lauren M. Huyett,et al.  Closed-Loop Artificial Pancreas Systems: Engineering the Algorithms , 2014, Diabetes Care.

[24]  Peter J. Fleming,et al.  Genetic Algorithms for Multiobjective Optimization: FormulationDiscussion and Generalization , 1993, ICGA.

[25]  Ashutosh Tiwari,et al.  Evolvable Embryonics: 2-in-1 Approach to Self-healing Systems , 2013 .