Transient Fault Tolerance and System Safety Enhancement Based on System Theory

Transient faults are hard to be detected and located due to their unpredictable nature and short duration, and they are the dominant causations of system failures, which makes it necessary to consider transient fault-tolerant design in the development of modern safety-critical industrial system. In this paper an approach based on system theory is proposed to tolerate the transient faults in tunnel construction wireless monitoring and control systems (TCWMCS), in which the effects of transient faults are expressed by dysfunction of interactions among software applications. After analyzing the dysfunctional interactions of the system by the operational process model and educing the causes of dysfunction in the functional control diagram, a safety enhancement way was proposed for the designers, in which effictive safety constraints were set up to tolerate the transient faults. The experiment evaluation indicated that the effects of transient faults could be exposed by the causal factors of dysfunctional interactions and system safety could be enhanced by the enforcement of  appropriate constraints.

[1]  Peter Checkland,et al.  Systems Thinking, Systems Practice , 1981 .

[2]  Nancy G. Leveson,et al.  A new accident model for engineering safer systems , 2004 .

[3]  Nancy G. Leveson,et al.  A systems-theoretic approach to safety in software-intensive systems , 2004, IEEE Transactions on Dependable and Secure Computing.

[4]  J. K. Vrijlinga,et al.  Evaluation of tunnel safety : towards an economic safety optimum , 2005 .

[5]  Sarah J. Dunnett,et al.  Cause-consequence analysis of non-repairable phased missions , 2006, Reliab. Eng. Syst. Saf..

[6]  Ian F. Akyildiz,et al.  Wireless underground sensor networks : Research challenges , 2006 .

[7]  T. Kohda,et al.  Accident cause analysis of complex systems based on safety control functions , 2006, RAMS '06. Annual Reliability and Maintainability Symposium, 2006..

[8]  Chunjie Zhou,et al.  Genetic algorithm-based dynamic reconfiguration for networked control system , 2008, Neural Comput. Appl..

[9]  J.R. Laracy,et al.  Apply STAMP to Critical Infrastructure Protection , 2007, 2007 IEEE Conference on Technologies for Homeland Security.

[10]  D. Hickey Distritrack: Automated Average-Case Analysis , 2007 .

[11]  Kirsten Winter,et al.  Probabilistic Model-Checking Support for FMEA , 2007 .

[12]  John Andrews,et al.  System fault diagnostics using fault tree analysis , 2007 .

[13]  Chen Hui,et al.  Genetic algorithm-based dynamic reconfiguration for networked control system , 2008, Neural Computing and Applications.

[14]  E. Zio,et al.  A Combined Monte Carlo and Possibilistic Approach to Uncertainty Propagation in Event Tree Analysis , 2008, Risk analysis : an official publication of the Society for Risk Analysis.

[15]  Zhencai Zhu,et al.  Sensor deployment strategy for chain-type wireless underground mine sensor network , 2008 .

[16]  David E. Verbitsky,et al.  Advanced FTA technique addressing early un-balanced failures of modern commercial electronic devices , 2009, 2009 Annual Reliability and Maintainability Symposium.

[17]  Mehdi Mahdavi,et al.  A novel real-time routing protocol in wireless sensor networks , 2009, 2009 International Conference on the Current Trends in Information Technology (CTIT).

[18]  Randa El-Marakby,et al.  Flooding Zone Control Protocol (FZCP): enhancing the reliability of real-time multimedia delivery in WSNs , 2009, 2009 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT).

[19]  Jing Wu,et al.  A Reliable and High-Bandwidth Multihop Wireless Sensor Network for Mine Tunnel Monitoring , 2009, IEEE Sensors Journal.

[20]  Alan N. Beard Tunnel safety, risk assessment and decision-making , 2010 .