Availability analysis of repairable redundant system with three types of failures subject to common cause failure

This paper deals with the availability analysis of a redundant system consisting of N-non-identical components and S warm standby components under the care of single repair facility. The system may fail due to three reasons: 1) human error such as wrong adjustments in the design or development phase of the system or system may undergo damage during maintenance; 2) hardware failure; and 3) software failure. The incorporation of common cause failure gives the major impact on the system performance since it may cause dependant failures of some or all components of the system. An irreducible Markov process is used to construct the system of governing differential equations associated with three types of failure subject to common cause failure. The system availability for different configurations under transient condition is examined by using a numerical approach based on Runge-Kutta method of fourth order. Further, the sensitivity analysis is facilitated to demonstrate the validity of analytical results by taking a numerical illustration. The effect of different parameters on system performance indices are demonstrated graphically.

[1]  Lixuan Lu,et al.  Reliability evaluation of standby safety systems due to independent and common cause failures , 2006, 2006 IEEE International Conference on Automation Science and Engineering.

[2]  Tim Bedford,et al.  Human reliability analysis: A critique and review for managers , 2011 .

[3]  Mohammed Hajeeh,et al.  Multiple component series systems with imperfect repair , 2009 .

[4]  Mohammed Hajeeh Reliability and availability of a standby system with common cause failure , 2011 .

[5]  Bent Natvig,et al.  Using Expert Opinions in Bayesian Prediction of Component Lifetimes in a Shock Model , 1995, Math. Oper. Res..

[6]  I. B. Gertsbakh Reliability Characteristics of Alternative Checkup Schedules for Detecting Hidden Failures , 1978 .

[7]  Guozhi Xu,et al.  Modeling and analysis of software aging and software failure , 2007, J. Syst. Softw..

[8]  Xiang Li,et al.  A generic data-driven software reliability model with model mining technique , 2010, Reliab. Eng. Syst. Saf..

[9]  Mohammed A. Hajeeh Availability of a system with different repair options , 2012, Int. J. Math. Oper. Res..

[10]  Mendus Jacob,et al.  Reliability analysis of a complex system with a deteriorating standby unit under common-cause failure and critical human error , 1996 .

[11]  Elja Arjas,et al.  The Failure and Hazard Processes in Multivariate Reliability Systems , 1981, Math. Oper. Res..

[12]  D. Pandey,et al.  Cost analysis, availability and MTTF of a three state standby complex system under common cause and human failures , 1995 .

[13]  A Case Study in Human Error Modeling Based on a Hybrid Dynamical Systems Approach , 2005 .

[14]  Barry Kirwan,et al.  Human Reliability Assessment , 2008 .

[15]  Smail Adjabi,et al.  terminal reliability Evaluation of a Telecommunications Network represented by a Discret and a Dynamic model , 2011 .

[16]  Kishor S. Trivedi,et al.  Combining Performance and Availability Analysis in Practice , 2012, Adv. Comput..

[17]  Moshe Shaked,et al.  On the First Failure Time of Dependent Multicomponent Reliability Systems , 1988, Math. Oper. Res..

[18]  Jussi K. Vaurio Human factors, human reliability and risk assessment in license renewal of a nuclear power plant , 2009, Reliab. Eng. Syst. Saf..

[19]  Pierre Le Bot Human reliability data, human error and accident models - illustration through the Three Mile Island accident analysis , 2004, Reliab. Eng. Syst. Saf..

[20]  Mohammed Hajeeh Availability of deteriorated system with inspection subject to common-cause failure and human error , 2011 .

[21]  René J. Chevance Hardware and Software Solutions for High Availability , 2005 .

[22]  Jussi K. Vaurio Common cause failure probabilities in standby safety system fault tree analysis with testing - scheme and timing dependencies , 2003, Reliab. Eng. Syst. Saf..

[23]  Yuan-Shun Dai,et al.  A model for availability analysis of distributed software/hardware systems , 2002, Inf. Softw. Technol..

[24]  Madhu Jain,et al.  Load sharing M-out of-N: G system with non-identical components subject to common cause failure , 2012, Int. J. Math. Oper. Res..

[25]  Mark Tweeddale Chapter 11 – Introduction to “Software” or the “Human Factor”: Including Safety Culture, Safety Climate, and Human Error , 2003 .

[26]  M. A. W. Mahmoud,et al.  On a two-unit cold standby system considering hardware, human error failures and preventive maintenance , 2010, Math. Comput. Model..

[27]  Gregory Levitin Reliability and performance analysis of hardware-software systems with fault-tolerant software components , 2006, Reliab. Eng. Syst. Saf..

[28]  Swapna S. Gokhale,et al.  System Availability Analysis Considering Hardware/Software Failure Severities , 2005, 29th Annual IEEE/NASA Software Engineering Workshop.

[29]  T. M. Mak,et al.  Coping with Physical Failures, Soft Errors, and Reliability Issues , 2008 .

[30]  Pooja Mohan,et al.  Reliability Analysis of Consecutive-k, r-Out-Of-n: DFM System using GERT , 2007 .

[31]  Yong Bai,et al.  Human Reliability Assessment , 2003 .

[32]  Mohamed A. Ahmed,et al.  Redundancy Optimization of Reliability Models Subject to Imperfect Fault Coverage , 2008 .

[33]  Daniel R. Jeske,et al.  Reliability Modeling of Hardware and Software Interactions, and Its Applications , 2006, IEEE Transactions on Reliability.

[34]  Liudong Xing,et al.  Reliability analysis of hierarchical computer-based systems subject to common-cause failures , 2007, Reliab. Eng. Syst. Saf..