Reliability Evaluation of Network Systems with Dependent Propagated Failures Using Decision Diagrams

In a network system, a propagated failure (PF) is a failure originating from a network component that can cause extensive damages to other network components or even the failure of the entire system. Existing works on PFs have mostly assumed the deterministic effect from a component PF, i.e., a fixed subset of system components is affected whenever the PF occurs. However, in many real-world systems, the components may have different levels of protection, and the effect of damage from a component PF can be dependent upon the status of other components within the same system or the occurrence order of component failures. This paper proposes a new analytical method based on multi-valued decision diagrams (MDDs) for the reliability analysis of network systems with dependent propagation effects. Particularly, new MDD modeling procedures are proposed for considering different types of dependent PF effects introduced by different protection levels. After the system MDD is generated using a new MDD combination algorithm to efficiently handle the dependent PF effects, methods for computing the network reliability and component importance measures are presented. The detailed analysis of an example network system subjected to dependent PFs is presented to illustrate the basics and application of the proposed method. It is shown that the proposed MDD-based method generates smaller model size and thus presents lower computational complexity in the model generation and evaluation than the existing Markov method and separable method.

[1]  Yuchang Mo,et al.  A Multiple-Valued Decision-Diagram-Based Approach to Solve Dynamic Fault Trees , 2014, IEEE Transactions on Reliability.

[2]  Youki Kadobayashi,et al.  A Dynamic Protection System of Web Server in Virtual Cluster Using Live Migration , 2009, 2009 Eighth IEEE International Conference on Dependable, Autonomic and Secure Computing.

[3]  Yung-Ruei Chang,et al.  OBDD-based evaluation of reliability and importance measures for multistate systems subject to imperfect fault coverage , 2005, IEEE Transactions on Dependable and Secure Computing.

[4]  Joanne Bechta Dugan Correlated Hardware Failures in Redundant Systems , 1992 .

[5]  S. Amari,et al.  Closed-form expressions for distribution of sum of exponential random variables , 1997 .

[6]  Yang Wang,et al.  Modeling the effects of timing parameters on virus propagation , 2003, WORM '03.

[7]  Kishor S. Trivedi,et al.  Markov and Markov reward model transient analysis: An overview of numerical approaches , 1989 .

[8]  Emmanuel Hooper,et al.  Intelligent Autonomic Strategy to Attacks in Network Infrastructure Protection: Feedback Methods to IDS, Using Policies, Alert Filters and Firewall Packet Filters for Multiple Protocols , 2006, 2006 2nd IEEE International Symposium on Dependable, Autonomic and Secure Computing.

[9]  Gregory Levitin,et al.  Reliability of Systems Subject to Failures With Dependent Propagation Effect , 2013, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[10]  Svein J. Knapskog,et al.  On Stochastic Modeling for Integrated Security and Dependability Evaluation , 2006, J. Networks.

[11]  Liudong Xing,et al.  MDD-Based Method for Efficient Analysis on Phased-Mission Systems With Multimode Failures , 2014, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[12]  Antoine Rauzy,et al.  New algorithms for fault trees analysis , 1993 .

[13]  Kishor S. Trivedi,et al.  Coverage Modeling for Dependability Analysis of Fault-Tolerant Systems , 1989, IEEE Trans. Computers.

[14]  John Andrews,et al.  Reliability and Risk Assessment , 1994 .

[15]  Shouhuai Xu,et al.  A Stochastic Model for Quantitative Security Analyses of Networked Systems , 2016, IEEE Transactions on Dependable and Secure Computing.

[16]  Gregory Levitin,et al.  Propagated failure analysis for non-repairable systems considering both global and selective effects , 2012, Reliab. Eng. Syst. Saf..

[17]  Yashwant K. Malaiya Linearly Correlated Intermittent Failures , 1982, IEEE Transactions on Reliability.

[18]  Liudong Xing,et al.  A New Decision-Diagram-Based Method for Efficient Analysis on Multistate Systems , 2009, IEEE Transactions on Dependable and Secure Computing.

[19]  Liudong Xing,et al.  A Multiple-Valued Decision Diagram Based Method for Efficient Reliability Analysis of Non-Repairable Phased-Mission Systems , 2014, IEEE Transactions on Reliability.

[20]  Hiromitsu Kumamoto,et al.  Probabilistic Risk Assessment , 1996 .

[21]  Luigi Portinale,et al.  Improving the analysis of dependable systems by mapping fault trees into Bayesian networks , 2001, Reliab. Eng. Syst. Saf..

[22]  Joanne Bechta Dugan,et al.  A discrete-time Bayesian network reliability modeling and analysis framework , 2005, Reliab. Eng. Syst. Saf..

[23]  Sadie Creese,et al.  Virus Propagation in Heterogeneous Bluetooth Networks with Human Behaviors , 2012, IEEE Transactions on Dependable and Secure Computing.

[24]  Luigi Portinale,et al.  A dynamic Bayesian network based framework to evaluate cascading effects in a power grid , 2012, Eng. Appl. Artif. Intell..

[25]  Gregory Levitin,et al.  Multi-state systems with selective propagated failures and imperfect individual and group protections , 2011, Reliab. Eng. Syst. Saf..

[26]  Mohammad Abdollahi Azgomi,et al.  A Game Theoretic Approach for Quantitative Evaluation of Security by Considering Hackers with Diverse Behaviors , 2009, 2009 Eighth IEEE International Conference on Dependable, Autonomic and Secure Computing.

[27]  Luigi Portinale,et al.  Radyban: A tool for reliability analysis of dynamic fault trees through conversion into dynamic Bayesian networks , 2008, Reliab. Eng. Syst. Saf..

[28]  Liudong Xing An Efficient Binary-Decision-Diagram-Based Approach for Network Reliability and Sensitivity Analysis , 2008, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[29]  Jiming Liu,et al.  Modeling and Restraining Mobile Virus Propagation , 2013, IEEE Transactions on Mobile Computing.

[30]  Gregory Levitin,et al.  Reliability of Series-Parallel Systems With Random Failure Propagation Time , 2013, IEEE Transactions on Reliability.