Dependability analysis for fault-tolerant computer systems using dynamic fault graphs

Dependability analysis is an important step in designing and analyzing safety computer systems and protection systems. Introducing multi-processor and virtual machine increases the system faults' complexity, diversity and dynamic, in particular for software-induced failures, with an impact on the overall dependability. Moreover, it is very different for safety system to operate successfully at any active phase, since there is a huge difference in failure rate between hardware-induced and softwareinduced failures. To handle these difficulties and achieve accurate dependability evaluation, consistently reflecting the construct it measures, a new formalism derived from dynamic fault graphs (DFG) is developed in this paper. DFG exploits the concept of system event as fault state sequences to represent dynamic behaviors, which allows us to execute probabilistic measures at each timestamp when change occurs. The approach automatically combines the reliability analysis with the system dynamics. In this paper, we describe how to use the proposed methodology drives to the overall system dependability analysis through the phases of modeling, structural discovery and probability analysis, which is also discussed using an example of a virtual computing system.

[1]  田村 芳明,et al.  Kemari: Virtual Machine Synchronization for Fault Tolerance , 2010 .

[2]  Giuliana Franceschinis,et al.  Parametric Fault Tree for the Dependability Analysis of Redundant Systems and Its High-Level Petri Net Semantics , 2003, IEEE Trans. Software Eng..

[3]  Dharma P. Agrawal,et al.  Dependability analysis for large systems: A hierarchical modeling approach , 1993, Proceedings of 1993 5th IEEE Symposium on Parallel and Distributed Processing.

[4]  Salvatore J. Bavuso,et al.  Dynamic fault-tree models for fault-tolerant computer systems , 1992 .

[5]  Luigi Portinale,et al.  Automatically translating dynamic fault trees into dynamic Bayesian networks by means of a software tool , 2006, First International Conference on Availability, Reliability and Security (ARES'06).

[6]  Marco Vallini,et al.  Planning Dependability of Virtualised Networks , 2009, 2009 Second International Conference on Dependability.

[7]  Joanne Bechta Dugan,et al.  Approximate sensitivity analysis for acyclic Markov reliability models , 2003, IEEE Trans. Reliab..

[8]  Antonio Puliafito,et al.  Dependability Evaluation with Dynamic Reliability Block Diagrams and Dynamic Fault Trees , 2009, IEEE Transactions on Dependable and Secure Computing.

[9]  Tal Garfinkel,et al.  When Virtual Is Harder than Real: Security Challenges in Virtual Machine Based Computing Environments , 2005, HotOS.

[10]  John D. Andrews,et al.  Dependability analysis of systems with on-demand and active failure modes, using dynamic fault trees , 2002, IEEE Trans. Reliab..

[11]  Karen Scarfone,et al.  Common Vulnerability Scoring System , 2006, IEEE Security & Privacy.

[12]  Gorazd Kandus,et al.  Virtual multiple input multiple output in multiple high-altitude platform constellations , 2009, IET Commun..

[13]  Corinna Cortes,et al.  Computational Methods for Dynamic Graphs , 2003 .

[14]  Krishna R. Pattipati,et al.  Dynamic Multiple Fault Diagnosis: Mathematical Formulations and Solution Techniques , 2009, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[15]  Wojciech Zamojski,et al.  Dependability Analysis of Information Systems with Hierarchical Reconfiguration of Services , 2008, 2008 Second International Conference on Emerging Security Information, Systems and Technologies.

[16]  Kishor S. Trivedi,et al.  Dependability analysis of distributed computer systems with imperfect coverage , 1999, Digest of Papers. Twenty-Ninth Annual International Symposium on Fault-Tolerant Computing (Cat. No.99CB36352).

[17]  Edward P. F. Chan,et al.  Shortest Path Tree Computation in Dynamic Graphs , 2009, IEEE Transactions on Computers.

[18]  Shu-Chin Wang,et al.  Reaching Agreement among Virtual Subnets in Hybrid Failure Mode , 2008, IEEE Transactions on Parallel and Distributed Systems.

[19]  Alexander Chatzigeorgiou,et al.  Architectural Risk Analysis of Software Systems Based on Security Patterns , 2008, IEEE Transactions on Dependable and Secure Computing.

[20]  Chuang Lin,et al.  Model-Driven Dependability Analysis of Virtualization Systems , 2009, 2009 Eighth IEEE/ACIS International Conference on Computer and Information Science.

[21]  Richard F. Paige,et al.  Fault trees for security system design and analysis , 2003, Comput. Secur..

[22]  P. Reviriego,et al.  Reliability Analysis of Memories Suffering Multiple Bit Upsets , 2007, IEEE Transactions on Device and Materials Reliability.

[23]  Victor P. Nelson Fault-tolerant computing: fundamental concepts , 1990, Computer.

[24]  W. Salem Combining VR Technology and Human Factors Methods for Supporting Risk Analysis , 2008, 2008 3rd International Conference on Information and Communication Technologies: From Theory to Applications.