Important factors affecting fault detection coverage in probabilistic safety assessment of digital instrumentation and control systems

As digital instrumentation and control (I&C) systems are gradually introduced into nuclear power plants (NPPs), concerns about the I&C systems’ reliability and safety are growing. Fault detection coverage is one of the most critical factors in the probabilistic safety assessment (PSA) of digital I&C systems. To correctly estimate the fault detection coverage, it is first necessary to identify important factors affecting it. From experimental results found in the literature and the authors’ experience in fault injection experiments on digital systems, four system-related factors and four fault-related factors are identified as important factors affecting the fault detection coverage. A fault injection experiment is performed to demonstrate the dependency of fault detection coverage on some of the identified important factors. The implications of the experimental results on the estimation of fault detection coverage for the PSA of digital I&C systems are also explained. The set of four system-related factors and four fault-related factors is expected to provide a framework for systematically comparing and analyzing various fault injection experiments and the resultant estimations on fault detection coverage of digital I&C systems in NPPs.

[1]  Jan-Erik Holmberg,et al.  RELIABILITY ANALYSIS OF DIGITAL SYSTEMS IN A PROBABILISTIC RISK ANALYSIS FOR NUCLEAR POWER PLANTS , 2012 .

[2]  Y. Savaria,et al.  Software detection mechanisms providing full coverage against single bit-flip faults , 2004, IEEE Transactions on Nuclear Science.

[3]  Lixuan Lu,et al.  Probabilistic Safety Assessment for Instrumentation and Control Systems in Nuclear Power Plants: An Overview , 2004 .

[4]  Cristian Constantinescu,et al.  Experimental evaluation of error-detection mechanisms , 2003, IEEE Trans. Reliab..

[5]  J. FitzPatrick,et al.  UNITED STATES NUCLEAR REGULATORY COMMISSION REGION II , 1987 .

[6]  Barry W. Johnson,et al.  A method to determine equivalent fault classes for permanent and transient faults , 1995, Annual Reliability and Maintainability Symposium 1995 Proceedings.

[7]  Hyun Gook Kang,et al.  An analysis of safety-critical digital systems for risk-informed design , 2002, Reliab. Eng. Syst. Saf..

[8]  Suku Nair,et al.  Algorithm-Based Fault Tolerance on a Hypercube Multiprocessor , 1990, IEEE Trans. Computers.

[9]  Technical Systems,et al.  Digital Instrumentation and Control Systems in Nuclear Power Plants: Safety and Reliability Issues , 1997 .

[10]  C. Constantinescu Using multi-stage and stratified sampling for inferring fault-coverage probabilities , 1995 .

[11]  Jan Torin,et al.  Evaluating processor-behavior and three error-detection mechanisms using physical fault-injection , 1995 .

[12]  Jacob A. Abraham,et al.  Evaluation of integrated system-level checks for on-line error detection , 1996, Proceedings of IEEE International Computer Performance and Dependability Symposium.

[13]  John P. Hayes,et al.  Low-cost on-line fault detection using control flow assertions , 2003, 9th IEEE On-Line Testing Symposium, 2003. IOLTS 2003..

[14]  Guo-Chang Gu,et al.  An Improved CFCSS Control Flow Checking Algorithm , 2007, 2007 International Workshop on Anti-Counterfeiting, Security and Identification (ASID).

[15]  Seung Jun Lee,et al.  AN OVERVIEW OF RISK QUANTIFICATION ISSUES FOR DIGITALIZED NUCLEAR POWER PLANTS USING A STATIC FAULT TREE , 2009 .

[16]  C. Constantinescu Estimation of coverage probabilities for dependability validation of fault-tolerant computing systems , 1994, Proceedings of COMPASS'94 - 1994 IEEE 9th Annual Conference on Computer Assurance.

[17]  Seyed Ghassem Miremadi,et al.  CFCET: A hardware-based control flow checking technique in COTS processors using execution tracing , 2006, Microelectron. Reliab..

[18]  Raphael R. Some,et al.  Experimental evaluation of a COTS system for space applications , 2002, Proceedings International Conference on Dependable Systems and Networks.