Co-optimization of state dependent loading and mission abort policy in heterogeneous warm standby systems

Abstract This paper models and optimizes a 1-out-of-N: G heterogeneous warm standby system subject to state-dependent component loading and mission abort policies. For circumstances where a system's survival may have a higher priority than continuing a specified mission due to cost or safety reasons, when a certain incident or malfunction condition is met based on a predefined decision rule, the mission objective is aborted and a rescue procedure is performed for system survival. The mission abort policy implemented can affect the system performance significantly. In addition, load levels of system components affect their performances (e.g., productivity) and time-to-failure distributions, and further affect the overall system performance. This paper makes new contributions by formulating and solving a co-optimization problem that finds the optimal combination of mission abort and component loading policies, maximizing the mission success probability while providing a desired level of system survivability. As demonstrated through examples, the co-optimization results can facilitate optimal decisions on effective and safe operations management of warm standby systems.

[1]  Gregory Levitin,et al.  Non-Homogeneous 1-Out-of-${N}$ Warm Standby Systems With Random Replacement Times , 2015, IEEE Transactions on Reliability.

[2]  Jonathan L. Bell,et al.  Reliability growth planning based on essential function failures , 2014, 2014 Reliability and Maintainability Symposium.

[3]  A. Myers Probability of Loss Assessment of Critical $k$-Out-of-$n$:G Systems Having a Mission Abort Policy , 2009, IEEE Transactions on Reliability.

[4]  Francesco Longo,et al.  Availability Assessment of HA Standby Redundant Clusters , 2010, 2010 29th IEEE Symposium on Reliable Distributed Systems.

[5]  Kuo-Hsiung Wang,et al.  Simulation inferences for an availability system with general repair distribution and imperfect fault coverage , 2010, Simul. Model. Pract. Theory.

[6]  Marvin Rausand,et al.  System Reliability Theory: Models, Statistical Methods, and Applications , 2003 .

[7]  Gregory Levitin,et al.  Optimal loading of system with random repair time , 2015, Eur. J. Oper. Res..

[8]  Dong Liu,et al.  Quantification of Cut Sequence Set for Fault Tree Analysis , 2007, HPCC.

[9]  George Kokolakis,et al.  Reliability analysis of a two-unit general parallel system with (n-2) warm standbys , 2010, Eur. J. Oper. Res..

[10]  Marc Bouissou,et al.  A new formalism that combines advantages of fault-trees and Markov models: Boolean logic driven Markov processes , 2003, Reliab. Eng. Syst. Saf..

[11]  Gregory Levitin,et al.  Mission Cost and Reliability of 1-out-of- $N$ Warm Standby Systems With Imperfect Switching Mechanisms , 2014, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[12]  Liudong Xing,et al.  Reliability Analysis of Nonrepairable Cold-Standby Systems Using Sequential Binary Decision Diagrams , 2012, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[13]  Gregory Levitin,et al.  Optimization of Full versus Incremental Periodic Backup Policy , 2016, IEEE Transactions on Dependable and Secure Computing.

[14]  Liudong Xing,et al.  Redundancy allocation for series-parallel warm-standby systems , 2011, 2011 IEEE International Conference on Industrial Engineering and Engineering Management.

[15]  Gregory Levitin,et al.  Optimal task partition and state-dependent loading in heterogeneous two-element work sharing system , 2016, Reliab. Eng. Syst. Saf..

[16]  Way Kuo,et al.  Recent Advances in Optimal Reliability Allocation , 2007, IEEE Trans. Syst. Man Cybern. Part A.

[17]  D. Coit Cold-standby redundancy optimization for nonrepairable systems , 2001 .

[18]  Gregory Levitin,et al.  Effect of Failure Propagation on Cold vs. Hot Standby Tradeoff in Heterogeneous 1-Out-of-$N$:G Systems , 2015, IEEE Transactions on Reliability.

[19]  Gregory Levitin,et al.  Optimal sequencing of warm standby elements , 2013, Comput. Ind. Eng..

[20]  Antoine Rauzy,et al.  Sequence Algebra, Sequence Decision Diagrams and Dynamic Fault Trees , 2011, Reliab. Eng. Syst. Saf..

[21]  Jun Yang,et al.  Reliability of demand-based warm standby systems subject to fault level coverage , 2015 .

[22]  Liudong Xing,et al.  Binary decision diagram-based reliability evaluation of k-out-of-(n + k) warm standby systems subject to fault-level coverage , 2013 .

[23]  Liudong Xing,et al.  Fault-Tolerance and Reliability Analysis for Wireless Sensor Networks , 2009 .

[24]  Gregory Levitin,et al.  Cold vs. hot standby mission operation cost minimization for 1-out-of-N systems , 2014, Eur. J. Oper. Res..

[25]  Pham Hoang,et al.  Tampered Failure Rate Load-Sharing Systems: Status and Perspectives , 2008 .

[26]  Gregory Levitin,et al.  Optimal Design of Hybrid Redundant Systems With Delayed Failure-Driven Standby Mode Transfer , 2015, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[27]  Gregory Levitin Genetic algorithms in reliability engineering , 2006, Reliab. Eng. Syst. Saf..

[28]  Gregory Levitin,et al.  Sequencing Optimization in k-out-of-n Cold-Standby Systems Considering Mission Cost , 2013, Int. J. Gen. Syst..

[29]  Liudong Xing,et al.  Reliability of warm-standby systems subject to imperfect fault coverage , 2014 .

[30]  Suprasad V. Amari,et al.  Redundancy optimization problem with warm-standby redundancy , 2010, 2010 Proceedings - Annual Reliability and Maintainability Symposium (RAMS).

[31]  Kishor S. Trivedi,et al.  Investigating dynamic reliability and availability through state-space models , 2012, Comput. Math. Appl..

[32]  Ravishankar K. Iyer,et al.  Effect of System Workload on Operating System Reliability: A Study on IBM 3081 , 1985, IEEE Transactions on Software Engineering.

[33]  Gregory Levitin,et al.  Optimal component loading in 1-out-of-N cold standby systems , 2014, Reliab. Eng. Syst. Saf..

[34]  Baoding Liu,et al.  Redundancy optimization problems with uncertainty of combining randomness and fuzziness , 2004, Eur. J. Oper. Res..

[35]  Min Xie,et al.  Availability and reliability of k-out-of-(M+N): G warm standby systems , 2006, Reliab. Eng. Syst. Saf..