Effect of Failure Propagation on Cold vs. Hot Standby Tradeoff in Heterogeneous 1-Out-of-$N$:G Systems

This paper considers 1-out-of- N:G heterogeneous fault-tolerant systems that are designed with a mix of hot and cold standby redundancies to achieve the tradeoff between restoration and operation costs of standby elements. In such systems, the way in which the elements are distributed between hot and cold standby groups and the initiation sequence of all the cold standby elements can greatly affect the system reliability and mission cost. Therefore, it is significant to solve the optimal standby element distributing and sequencing problem (SE-DSP). The failure that occurs in a system element can propagate, causing the outage of other system elements, which complicates the solution to the SE-DSP problem. In this paper, we first propose a numerical method for evaluating the reliability and expected mission cost of 1-out-of- N:G systems with mixed hot and cold redundancy types and propagated failures. Two different failure propagation modes are considered: an element failure causing the outage of all the system elements, and an element failure causing the outage of only working or hot standby elements but not cold standby elements. A genetic algorithm is utilized as an optimization tool for solving the formulated SE-DSP problem, leading to a solution that can minimize the expected mission cost of the system while providing a desired level of the system reliability. Effects of the failure propagation probability on the system reliability, expected mission cost, as well as the optimization results are investigated. The suggested methodology can facilitate a reliability-cost tradeoff study of the considered systems, thus assisting in optimal decision making regarding the system's standby policy. Examples are provided for illustrating the considered problem as well as the proposed solution methodology.

[1]  Juan Eloy Ruiz-Castro,et al.  A complex discrete warm standby system with loss of units , 2012, Eur. J. Oper. Res..

[2]  Liudong Xing,et al.  Reliability evaluation and optimization of series–parallel systems with k-out-of-n: G subsystems and mixed redundancy types , 2013 .

[3]  J. Onishi,et al.  Solving the Redundancy Allocation Problem With a Mix of Components Using the Improved Surrogate Constraint Method , 2007, IEEE Transactions on Reliability.

[4]  Liudong Xing,et al.  Reliability Analysis of Nonrepairable Cold-Standby Systems Using Sequential Binary Decision Diagrams , 2012, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[5]  Gregory Levitin,et al.  Cold-standby sequencing optimization considering mission cost , 2013, Reliab. Eng. Syst. Saf..

[6]  Alice E. Smith,et al.  An ant colony optimization algorithm for the redundancy allocation problem (RAP) , 2004, IEEE Transactions on Reliability.

[7]  Amir Abbas Najafi,et al.  A bi-objective model to optimize reliability and cost of system with a choice of redundancy strategies , 2012, Comput. Ind. Eng..

[8]  Vanderlei da Costa Bueno,et al.  Active redundancy allocation for a k , 2007, Eur. J. Oper. Res..

[9]  R. Ramaswami,et al.  Book Review: Design and Analysis of Fault-Tolerant Digital Systems , 1990 .

[10]  D. Coit Cold-standby redundancy optimization for nonrepairable systems , 2001 .

[11]  Ta-Cheng Chen,et al.  Immune algorithms-based approach for redundant reliability problems with multiple component choices , 2005, Comput. Ind..

[12]  David W. Coit,et al.  System reliability optimization with k-out-of-n subsystems and changing k , 2011, The Proceedings of 2011 9th International Conference on Reliability, Maintainability and Safety.

[13]  David W. Coit,et al.  Genetic algorithm to maximize a lower-bound for system time-to-failure with uncertain component Weibull parameters , 2002 .

[14]  Gregory Levitin,et al.  Cold vs. hot standby mission operation cost minimization for 1-out-of-N systems , 2014, Eur. J. Oper. Res..

[15]  Peng Zhao,et al.  Optimal allocation of redundancies in series systems , 2012, Eur. J. Oper. Res..

[16]  Barry W. Johnson Design & analysis of fault tolerant digital systems , 1988 .

[17]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[18]  D. Pandey,et al.  Reliability analysis of a powerloom plant with cold standby for its strategic unit , 1996 .

[19]  David W. Coit,et al.  Maximization of System Reliability with a Choice of Redundancy Strategies , 2003 .

[20]  Baoding Liu,et al.  Redundancy optimization problems with uncertainty of combining randomness and fuzziness , 2004, Eur. J. Oper. Res..

[21]  Gregory Levitin,et al.  Optimization of predetermined standby mode transfers in 1-out-of-N: G systems , 2014, Comput. Ind. Eng..

[22]  K. Shen,et al.  The effectiveness of adding standby redundancy at system and component levels , 1991 .

[23]  Baoding Liu,et al.  Standby redundancy optimization problems with fuzzy lifetimes , 2005, Comput. Ind. Eng..

[24]  Way Kuo,et al.  Recent Advances in Optimal Reliability Allocation , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[25]  Gregory Levitin,et al.  Reliability and performance of multi-state systems with propagated failures having selective effect , 2010, Reliab. Eng. Syst. Saf..

[26]  K. Misra,et al.  An efficient algorithm to solve integer-programming problems arising in system-reliability design , 1991 .

[27]  Mitsuo Gen,et al.  Soft computing approach for reliability optimization: State-of-the-art survey , 2006, Reliab. Eng. Syst. Saf..

[28]  Nam Kee Lee,et al.  System Reliability Allocation and a Computational Algorithm , 1968 .

[29]  David W. Coit,et al.  Reliability optimization of series-parallel systems using a genetic algorithm , 1996, IEEE Trans. Reliab..

[30]  Gregory Levitin Optimal Structure of Multi-State Systems With Uncovered Failures , 2008, IEEE Transactions on Reliability.