Mining abnormal patterns from heterogeneous time-series with irrelevant features for fault event detection

We address the issue of detecting fault events in multivariate time series. We suppose the following realistic and challenging situations: (A) the features to which multivariate time series correspond are heterogeneous; (B) only a small number of examples of fault events are available in advance relative to a large number of normal examples; and (C) many features irrelevant to fault events are included. To resolve these situations, detecting faults specifically in machine systems such as automobile, train, etc. is usually required. We propose an algorithm to resolve the issue. Key ideas in it include: (1) transforming the time-series for each feature into a sequence of anomaly scores, in order to map heterogeneous features to homogeneous ones (an anomaly score indicates the degree of anomalousness relative to an ordinal sequence) and then representing the pattern of a fault event in terms of fault anomaly score vectors; (2) selecting features specifying a fault event by means of iterative optimization using both normal and fault anomaly score vectors. We then monitor the degree of anomalousness with respect to target anomaly score vectors by comparing them with the abnormal patterns. We demonstrate the effectiveness of our proposed algorithm through an application to an actual automobile fault diagnosis data set as well as an artificial dataset. © 2009 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 2: 1-17, 2009

[1]  Chunsheng Yang,et al.  Learning to predict train wheel failures , 2005, KDD '05.

[2]  Kenji Yamanishi,et al.  A unifying framework for detecting outliers and change points from time series , 2006 .

[3]  Jaideep Srivastava,et al.  Event detection from time series data , 1999, KDD '99.

[4]  Padhraic Smyth,et al.  Knowledge Discovery and Data Mining: Towards a Unifying Framework , 1996, KDD.

[5]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[6]  Graham J. Williams,et al.  On-Line Unsupervised Outlier Detection Using Finite Mixtures with Discounting Learning Algorithms , 2000, KDD '00.

[7]  Keisuke Inoue,et al.  Knowledge Discovery from Heterogeneous Dynamic Systems using Change-Point Correlations , 2005, SDM.

[8]  Sameer Singh,et al.  Novelty detection: a review - part 1: statistical approaches , 2003, Signal Process..

[9]  J. Ma,et al.  Time-series novelty detection using one-class support vector machines , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[10]  Sameer Singh,et al.  Novelty detection: a review - part 2: : neural network based approaches , 2003, Signal Process..

[11]  Takehisa Yairi,et al.  An approach to spacecraft anomaly detection problem using kernel feature space , 2005, KDD '05.

[12]  Kevin P. Murphy,et al.  Modeling changing dependency structure in multivariate time series , 2007, ICML '07.

[13]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[14]  Michael Fink,et al.  Object Classification from a Single Example Utilizing Class Relevance Metrics , 2004, NIPS.

[15]  Vipin Kumar,et al.  Feature bagging for outlier detection , 2005, KDD '05.

[16]  Michael J. Pont,et al.  Improving the performance of radial basis function classifiers in condition monitoring and fault diagnosis applications where 'unknown' faults may occur , 2002, Pattern Recognit. Lett..

[17]  S. Thrun,et al.  Particle Filters for Rover Fault Diagnosis , 2004 .

[18]  Bo-Suk Yang,et al.  Fault Diagnosis System of Induction Motors Using Feature Extraction, Feature Selection and Classification Algorithm , 2006 .

[19]  Peter McBurney,et al.  Chance Discovery , 2003, Advanced Information Processing.

[20]  Uri Lerner,et al.  Hybrid Bayesian networks for reasoning about complex systems , 2002 .

[21]  Michael I. Jordan,et al.  Failure diagnosis using decision trees , 2004 .

[22]  Keming Wang Neural Network Approach to Vibration Feature Selection and Multiple Fault Detection for Mechanical Systems , 2006, First International Conference on Innovative Computing, Information and Control - Volume I (ICICIC'06).

[23]  X. Wang,et al.  Catalytic Asymmetric Epoxidation of Chalcones , 2006 .

[24]  Chris Jermaine,et al.  Outlier detection by sampling with accuracy guarantees , 2006, KDD '06.

[25]  Ye Li,et al.  Fault diagnosis based on support vector machine ensemble , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[26]  Philip S. Yu,et al.  Outlier detection for high dimensional data , 2001, SIGMOD '01.

[27]  Bianca Zadrozny,et al.  Outlier detection by active learning , 2006, KDD '06.