Learning fault patterns from data in streaming applications

In sensor-based systems, spatio-temporal data streams are often related in non-trivial ways. For example in avionics, while the airspeed that an aircraft attains in cruise phase depends on the weight it carries, it also depends on many other factors such as engine inputs, angle of attack and air density. It is therefore a challenge to develop failure models that can help recognize errors in the data, such as an incorrect fuel quantity or an incorrect airspeed. In this paper, we learn failure models from data streams using two approaches: (1) create static parameterized functional models using regression and use them to detect error modes based on error signatures, and (2) create statistical models based on the näıve Bayes classifier that dynamically update themselves to detect new error modes online. We extend our own PILOTS system to evaluate the accuracy, response time, and adaptability of these learning techniques. While error signatures can be more accurate and responsive than Bayesian learning, the latter method adapts better due to its data-driven nature.

[1]  Carlos A. Varela,et al.  Towards Learning Spatio-Temporal Data Stream Relationships for Failure Detection in Avionics , 2018, Handbook of Dynamic Data Driven Applications Systems.

[2]  Alessandro Galli,et al.  Dynamic Data-driven Avionics Systems: Inferring Failure Modes from Data Streams , 2015, ICCS.

[3]  Mogens Blanke,et al.  Diagnosis of Airspeed Measurement Faults for Unmanned Aerial Vehicles , 2014, IEEE Transactions on Aerospace and Electronic Systems.

[4]  Erik Frisk,et al.  Data-driven and adaptive statistical residual evaluation for fault detection with an automotive application , 2014 .

[5]  Karen Willcox,et al.  Multifidelity DDDAS Methods with Application to a Self-aware Aerospace Vehicle , 2014, ICCS.

[6]  Carlos A. Varela,et al.  Self-Healing Spatio-temporal Data Streams Using Error Signatures , 2013, 2013 IEEE 16th International Conference on Computational Science and Engineering.

[7]  J. Tinsley Oden,et al.  Virtual model validation of complex multiscale systems: Applications to nonlinear elastostatics , 2013 .

[8]  Daniel Mills,et al.  MillWheel: Fault-Tolerant Stream Processing at Internet Scale , 2013, Proc. VLDB Endow..

[9]  Carlos A. Varela,et al.  Autonomous Data Error Detection and Recovery in Streaming Applications , 2013, ICCS.

[10]  James Llinas,et al.  High Level Information Fusion (HLIF): Survey of models, issues, and grand challenges , 2012, IEEE Aerospace and Electronic Systems Magazine.

[11]  Carlos A. Varela,et al.  Programming spatio-temporal data streaming applications with high-level specifications , 2012, QUeST '12.

[12]  Ali Zolghadri,et al.  Advanced model-based FDIR techniques for aerospace systems: Today challenges and opportunities , 2012 .

[13]  Matthias Jarke,et al.  An evaluation framework for traffic information systems based on data streams , 2012 .

[14]  Scott Shenker,et al.  Discretized Streams: An Efficient and Fault-Tolerant Model for Stream Processing on Large Clusters , 2012, HotCloud.

[15]  Julien Marzat,et al.  Model-based fault diagnosis for aerospace systems: a survey , 2012 .

[16]  Carlos A. Varela,et al.  A Programming Model for Spatio-temporal Data Streaming Applications , 2012, ICCS.

[17]  Badrish Chandramouli,et al.  Spatio-Temporal Stream Processing in Microsoft StreamInsight , 2010, IEEE Data Eng. Bull..

[18]  Inseok Hwang,et al.  A Survey of Fault Detection, Isolation, and Reconfiguration Methods , 2010, IEEE Transactions on Control Systems Technology.

[19]  Luc De Raedt,et al.  ProbLog: A Probabilistic Prolog and its Application in Link Discovery , 2007, IJCAI.

[20]  Walid G. Aref,et al.  Continuous Query Processing of Spatio-Temporal Data Streams in PLACE , 2005, GeoInformatica.

[21]  Juwan Kim,et al.  Moving objects management system supporting location data stream , 2005 .

[22]  R. Baierlein Probability Theory: The Logic of Science , 2004 .

[23]  Philip S. Yu,et al.  Active Mining of Data Streams , 2004, SDM.

[24]  Frederica Darema,et al.  Dynamic Data Driven Applications Systems: A New Paradigm for Application Simulations and Measurements , 2004, International Conference on Computational Science.

[25]  Philip S. Yu,et al.  A Regression-Based Temporal Pattern Mining Scheme for Data Streams , 2003, VLDB.

[26]  Yixin Chen,et al.  Multi-Dimensional Regression Analysis of Time-Series Data Streams , 2002, VLDB.

[27]  Irina Rish,et al.  An empirical study of the naive Bayes classifier , 2001 .

[28]  Pat Langley,et al.  Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.

[29]  Peter S. Maybeck,et al.  Sensor/actuator failure detection in the Vista F-16 by multiple model adaptive estimation , 1995, IEEE Transactions on Aerospace and Electronic Systems.

[30]  Stephen Grossberg,et al.  ARTMAP: supervised real-time learning and classification of nonstationary data by a self-organizing neural network , 1991, [1991 Proceedings] IEEE Conference on Neural Networks for Ocean Engineering.

[31]  J. Anderson,et al.  Fundamentals of Aerodynamics , 1984 .