A LOF-Based Method for Abnormal Segment Detection in Machinery Condition Monitoring

Machinery condition monitoring has entered the era of big data and some research has been done based on big data. Abnormal segments, such as missing segments and drift segments, are inevitable in big data acquired from harsh industrial environment due to temporary sensor failures, network segment transmission delays, or accidental loss of some collected data and so on. Being independent of the machinery condition, the abnormal segments not only reduce the quality of the data for condition monitoring and big data analysis, but also bring a heavy computation load. However, there are few reports to address abnormal segment detection for further data cleaning in the field of machinery condition monitoring. Therefore, an abnormal segment detection method is proposed to improve the quality of big data. First, a sliding window is used to separate the data into different segments. Then, 14 kinds of time-domain features are extracted from each segment and principle component analysis (PCA) is employed to extract the principle components from these features. In addition, local outlier factor (LOF) is calculated based on the principle components to evaluate the degree of being an outlier for each segment. Finally, the data, including a drift segment from a real wind turbine, are used to verify the effectiveness of the proposed method.

[1]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[2]  Robert Babuska,et al.  Railway Track Circuit Fault Diagnosis Using Recurrent Neural Networks , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[3]  Christophe Ley,et al.  Detecting outliers: Do not use standard deviation around the mean, use absolute deviation around the median , 2013 .

[4]  Klaus-Robert Müller,et al.  From outliers to prototypes: Ordering data , 2006, Neurocomputing.

[5]  Xin Zhou,et al.  Deep neural networks: A promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data , 2016 .

[6]  Fuwen Yang,et al.  Robust finite-horizon filtering for stochastic systems with missing measurements , 2005, IEEE Signal Processing Letters.

[7]  Ian R. Petersen,et al.  Model validation and state estimation for uncertain continuous-time systems with missing discrete-continuous data , 1996, Proceedings of 35th IEEE Conference on Decision and Control.

[8]  Heiko Paulheim,et al.  A decomposition of the outlier detection problem into a set of supervised learning problems , 2015, Machine Learning.

[9]  Ke Zhang,et al.  A New Local Distance-Based Outlier Detection Approach for Scattered Real-World Data , 2009, PAKDD.

[10]  Xueying Zhang,et al.  Robust support vector data description for outlier detection with noise or uncertain data , 2015, Knowl. Based Syst..

[11]  Ada Wai-Chee Fu,et al.  Enhancements on local outlier detection , 2003, Seventh International Database Engineering and Applications Symposium, 2003. Proceedings..

[12]  Aleksandar Lazarevic,et al.  Incremental Local Outlier Detection for Data Streams , 2007, 2007 IEEE Symposium on Computational Intelligence and Data Mining.

[13]  Douglas M. Hawkins Identification of Outliers , 1980, Monographs on Applied Probability and Statistics.

[14]  Feng Jia,et al.  An Intelligent Fault Diagnosis Method Using Unsupervised Feature Learning Towards Mechanical Big Data , 2016, IEEE Transactions on Industrial Electronics.

[15]  Daniel W. C. Ho,et al.  Variance-constrained filtering for uncertain stochastic systems with missing measurements , 2003, IEEE Trans. Autom. Control..