LIFT: Learning Fault Trees from Observational Data

Industries with safety-critical systems increasingly collect data on events occurring at the level of system components, thus capturing instances of system failure or malfunction. With data availability, it becomes possible to automatically learn a model describing the failure modes of the system, i.e., how the states of individual components combine to cause a system failure. We present LIFT, a machine learning method for static fault trees directly out of observational datasets. The fault trees model probabilistic causal chains of events ending in a global system failure. Our method makes use of the Mantel-Haenszel statistical test to narrow down possible causal relationships between events. We evaluate LIFT with synthetic case studies, show how its performance varies with the quality of the data, and discuss practical variants of LIFT.

[1]  Shaojun Li,et al.  Study on Generation of Fault Trees from Altarica Models , 2014 .

[2]  Sohag Kabir,et al.  An overview of fault tree analysis and its application in model based dependability analysis , 2017, Expert Syst. Appl..

[3]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[4]  Florian Leitner-Fischer,et al.  Probabilistic fault tree synthesis using causality computation , 2013, Int. J. Crit. Comput. Based Syst..

[5]  W. Haenszel,et al.  Statistical aspects of the analysis of data from retrospective studies of disease. , 1959, Journal of the National Cancer Institute.

[6]  M. G. Madden,et al.  Generation Of Fault Trees From SimulatedIncipient Fault Case Data , 1970 .

[7]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[8]  Septavera Sharvia,et al.  Model-based dependability analysis , 2016 .

[9]  Leslie G. Valiant,et al.  Learning Boolean formulas , 1994, JACM.

[10]  Meng Xu,et al.  A Method for Constructing Fault Trees from AADL Models , 2011, ATC.

[11]  Julia M. Rohrer,et al.  Thinking Clearly About Correlations and Causation: Graphical Causal Models for Observational Data , 2017 .

[12]  Marco Bozzano,et al.  The FSAP/NuSMV-SA Safety Analysis Platform , 2007, International Journal on Software Tools for Technology Transfer.

[13]  M. W. Birch The Detection of Partial Association, I: The 2 × 2 Case , 1964 .

[14]  Jiuyong Li,et al.  Causal Decision Trees , 2015, IEEE Transactions on Knowledge and Data Engineering.

[15]  Susmit Jha,et al.  On Learning Sparse Boolean Formulae for Explaining AI Decisions , 2017, NFM.

[16]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[17]  Jing Li,et al.  Knowledge discovery from observational data for process control using causal Bayesian networks , 2007 .

[18]  Mariëlle Stoelinga,et al.  Fault tree analysis: A survey of the state-of-the-art in modeling, analysis and tools , 2014, Comput. Sci. Rev..

[19]  David Maxwell Chickering,et al.  Large-Sample Learning of Bayesian Networks is NP-Hard , 2002, J. Mach. Learn. Res..

[20]  Sreerama K. Murthy,et al.  Automatic Construction of Decision Trees from Data: A Multi-Disciplinary Survey , 1998, Data Mining and Knowledge Discovery.

[21]  Yiannis Papadopoulos Safety-Directed System Monitoring Using Safety Cases , 2000 .

[22]  SAMANTHA KLEINBERG,et al.  Why: A Guide to Finding and Using Causes , 2015 .