Symbolic Reliability Analysis of Self-healing Networked Embedded Systems

In recent years, several network online algorithms have been studied that exhibit self-x properties such as self-healing or self-adaption. These properties are used to improve systems characteristics like, e.g., fault-tolerance, reliability, or load-balancing. In this paper, a symbolic reliability analysis of self-healing networked embedded systems that rely on self-reconfiguration and self-routing is presented. The proposed analysis technique respects resource constraints such as the maximum computational load or the maximum memory size, and calculates the achievable reliability of a given system. This analytical approach considers the topology of the system, the properties of the resources, and the executed applications. Moreover, it is independent of the used online algorithms that implement the self-healing properties, but determines the achievable upper bound for the systems reliability. Since this analysis is not tailored to a specific online algorithm, it allows a reasonable decision making on the used algorithm by enabling a rating of different self-healing strategies. Experimental results show the effectiveness of the introduced technique even for large networked embedded systems.

[1]  B. Heile,et al.  Self-organizing, self-healing wireless networks , 2000, 2000 IEEE Aerospace Conference. Proceedings (Cat. No.00TH8484).

[2]  H. C. Cankaya,et al.  Reliability and availability evaluation of self-healing SONET mesh networks , 1997, GLOBECOM 97. IEEE Global Telecommunications Conference. Conference Record.

[3]  Martin Lukasiewycz,et al.  Reliability-Aware System Synthesis , 2007 .

[4]  Antoine Rauzy,et al.  New algorithms for fault trees analysis , 1993 .

[5]  Niklas Sörensson,et al.  Translating Pseudo-Boolean Constraints into SAT , 2006, J. Satisf. Boolean Model. Comput..

[6]  R. BurchJ.,et al.  Symbolic model checking , 1992 .

[7]  A. Satyanarayana,et al.  Efficient algorithms for reliability analysis of planar networks - a survey , 1986, IEEE Transactions on Reliability.

[8]  Bradley R. Schmerl,et al.  Model-based adaptation for self-healing systems , 2002, WOSS '02.

[9]  Randal E. Bryant,et al.  Graph-Based Algorithms for Boolean Function Manipulation , 1986, IEEE Transactions on Computers.

[10]  Suku Nair,et al.  Accelerated reliability analysis for self-healing SONET networks , 1998, SIGCOMM '98.

[11]  B. Heile,et al.  Self-organizing, self-healing wireless networks , 2000, 2000 IEEE International Conference on Personal Wireless Communications. Conference Proceedings (Cat. No.00TH8488).

[12]  Martin Lukasiewycz,et al.  Interactive presentation: Reliability-aware system synthesis , 2007 .

[13]  Petru Eles,et al.  Synthesis of Fault-Tolerant Schedules with Transparency/Performance Trade-offs for Distributed Embedded Systems , 2006, Proceedings of the Design Automation & Test in Europe Conference.

[14]  John Lee Reliability models of a class of self-healing rings , 1997 .

[15]  Andrew M. Tyrrell,et al.  Reliability analysis in self-repairing embryonic systems , 1999, Proceedings of the First NASA/DoD Workshop on Evolvable Hardware.

[16]  Falko Dressler,et al.  Lifetime Analysis in Heterogeneous Sensor Networks , 2006, 9th EUROMICRO Conference on Digital System Design (DSD'06).

[17]  Leslie G. Valiant,et al.  The Complexity of Enumeration and Reliability Problems , 1979, SIAM J. Comput..

[18]  Ken-ichi Sato,et al.  Self-healing ATM networks based on virtual path concept , 1994, IEEE J. Sel. Areas Commun..

[19]  Edmund M. Clarke,et al.  Symbolic Model Checking: 10^20 States and Beyond , 1990, Inf. Comput..

[20]  Yuan-Shun Dai Autonomic Computing and Reliability Improvement , 2005, ISORC.

[21]  Christian Haubelt,et al.  An Operating System Infrastructure for Fault-Tolerant Reconfigurable Networks , 2006, ARCS.

[22]  Klaus Waldschmidt,et al.  Architecture of Computing Systems - ARCS 2006 , 2006, Lecture Notes in Computer Science.

[23]  Michael Glaß,et al.  Topology-Aware Replica Placement in Fault-Tolerant Embedded Networks , 2008, ARCS.

[24]  Michael Glaß,et al.  Design space exploration of reliable networked embedded systems , 2007, J. Syst. Archit..