Anomaly detection and prediction of sensors faults in a refinery using data mining techniques and fuzzy logic

Like all manufacturing companies, refineries use many sensors to monitor and control the process of refining, therefore it is very crucial to detect any sensor faults or anomalies as early as possible, and to be able to replace or repair a sensor well in advance of any fault. Objective of this paper is to present a method for detecting anomalies in a sensor data, as well as to predict next occurance of a sensor failure. Data mining techniques to detect anomaly in sensor data and predict the occurrence of next faulty event were introduced. For anomaly detection, this research used MATLAB’s fuzzy logic toolbox tools to find clusters which uses subtractive fuzzy clustering algorithm and generates a model, a Sugeno-type fuzzy inference system. The same toolbox was used to evaluate the model with a promising result. To predict sensor fault, the original time series were used to create a new ‘derived time series’. Two prediction models known as ‘auto regressive integrated moving average’ and ‘autoregressive tree models’ were used against the new time series to predict next occurrence of sensor failure. The results of these models were compared. The model developed and introduced in this paper serves as an additional tool, which helps not only engineers and operators of oil refineries, but also other engineers of other disciplines to work more efficiently.

[1]  Takehisa Yairi,et al.  An approach to spacecraft anomaly detection problem using kernel feature space , 2005, KDD '05.

[2]  Stephen Ogaji,et al.  Artificial Neural Networks in Fault Diagnosis: A Gas Turbine Scenario , 2006 .

[3]  Selim Gullulu,et al.  Signal based approach for data mining in fault detection of induction motor , 2011 .

[4]  Sanaz Karimkhani Asl Outlier detection in Wireless Sensor Networks , 2013 .

[5]  Rajarshi Roy,et al.  Using synchronization for prediction of high-dimensional chaotic dynamics. , 2008, Physical review letters.

[6]  D. Janakiram,et al.  Outlier Detection in Wireless Sensor Networks using Bayesian Belief Networks , 2006, 2006 1st International Conference on Communication Systems Software & Middleware.

[7]  V. M. Karyampudi,et al.  The Effect of Assimilating Rain Rates Derived from Satellites and Lightning on Forecasts of the 1993 Superstorm , 1999 .

[8]  W. Kling,et al.  Estimation of variability and predictability of large‐scale wind energy in The Netherlands , 2009 .

[9]  Bernard C. Jiang,et al.  Using data mining techniques for multi-diseases prediction modeling of hypertension and hyperlipidemia by common risk factors , 2011, Expert Syst. Appl..

[10]  Thomas M. Smith,et al.  A new merged analysis of precipitation utilizing satellite and reanalysis data , 2008 .

[11]  Ebrahim H. Mamdani,et al.  An Experiment in Linguistic Synthesis with a Fuzzy Logic Controller , 1999, Int. J. Hum. Comput. Stud..

[12]  Ran Wolff,et al.  In-Network Outlier Detection in Wireless Sensor Networks , 2006, ICDCS.

[13]  Philip S. Yu,et al.  Outlier detection for high dimensional data , 2001, SIGMOD '01.

[14]  Michio Sugeno,et al.  Industrial Applications of Fuzzy Control , 1985 .

[15]  André R. S. Marçal,et al.  Very early prediction of wine yield based on satellite data from VEGETATION , 2010 .

[16]  M.M. Deris,et al.  A Comparative Study for Outlier Detection Techniques in Data Mining , 2006, 2006 IEEE Conference on Cybernetics and Intelligent Systems.

[17]  Teh Ying Wah,et al.  Applied Data Mining Approach in Ubiquitous World of Air Transportation , 2009, 2009 Fourth International Conference on Computer Sciences and Convergence Information Technology.

[18]  Hamido Fujita,et al.  Proceedings of the 10th WSEAS international conference on Applied computer science , 2010 .

[19]  Teh Ying Wah,et al.  Data mining and data gathering in a refinery , 2010 .

[20]  D. Heckerman,et al.  Autoregressive Tree Models for Time-Series Analysis , 2002, SDM.

[21]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[22]  Xue Wang,et al.  Time Series Forecasting for Energy-efficient Organization of Wireless Sensor Networks , 2007, Sensors.

[23]  C. Köpken,et al.  Validation of Integrated Water Vapor from Numerical Models Using Ground-Based GPS, SSM/I, and Water Vapor Radiometer Measurements , 2001 .

[24]  Shahab D. Mohaghegh A new methodology for the identification of best practices in the oil and gas industry, using intelligent systems , 2005 .

[25]  Peter J. Rousseeuw,et al.  Robust Regression and Outlier Detection , 2005, Wiley Series in Probability and Statistics.

[26]  P. Sajda,et al.  Detection, synthesis and compression in mammographic image analysis with a hierarchical image probability model , 2001, Proceedings IEEE Workshop on Mathematical Methods in Biomedical Image Analysis (MMBIA 2001).

[27]  Mark Schwabacher,et al.  Unsupervised Anomaly Detection for Liquid-Fueled Rocket Propulsion Health Monitoring , 2007, J. Aerosp. Comput. Inf. Commun..

[28]  Sameer Singh,et al.  Novelty detection: a review - part 2: : neural network based approaches , 2003, Signal Process..

[29]  F. E. Grubbs Procedures for Detecting Outlying Observations in Samples , 1969 .

[30]  Feng Lin,et al.  Dynamic sensor activation for event diagnosis , 2009, 2009 American Control Conference.

[31]  Molly E. Brown,et al.  Evaluation of the consistency of long-term NDVI time series derived from AVHRR,SPOT-vegetation, SeaWiFS, MODIS, and Landsat ETM+ sensors , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[32]  Reda Alhajj,et al.  A comprehensive survey of numeric and symbolic outlier mining techniques , 2006, Intell. Data Anal..

[33]  Victoria J. Hodge,et al.  A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.

[34]  Dimitrios Gunopulos,et al.  Online outlier detection in sensor data using non-parametric models , 2006, VLDB.

[35]  Anastasios Kementsietsidis,et al.  Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data , 2001, SIGMOD 2011.

[36]  Young-Koo Lee,et al.  An Anomaly Detection Algorithm for Detecting Attacks in Wireless Sensor Networks , 2006, ISI.

[37]  Jung-Min Park,et al.  An overview of anomaly detection techniques: Existing solutions and latest technological trends , 2007, Comput. Networks.

[38]  A. Kulkarni,et al.  Impact of climatic parameters on statistical stream flow sensitivity analysis for hydro power , 2009 .

[39]  Vipin Kumar,et al.  Parallel and Distributed Computing for Cybersecurity , 2005, IEEE Distributed Syst. Online.

[40]  Lotfi A. Zadeh,et al.  Outline of a New Approach to the Analysis of Complex Systems and Decision Processes , 1973, IEEE Trans. Syst. Man Cybern..

[41]  Jyh-Shing Roger Jang,et al.  ANFIS: adaptive-network-based fuzzy inference system , 1993, IEEE Trans. Syst. Man Cybern..

[42]  H. S. Teng,et al.  Adaptive real-time anomaly detection using inductively generated sequential patterns , 1990, Proceedings. 1990 IEEE Computer Society Symposium on Research in Security and Privacy.

[43]  Ladislav Hluchý,et al.  Data mining and integration for predicting significant meteorological phenomena , 2010, ICCS.

[44]  H. D. Ardakani,et al.  Application of data mining techniques in stock markets: A survey , 2010 .

[45]  Chris Chatfield,et al.  Time‐series forecasting , 2000 .

[46]  Sameer Singh,et al.  Novelty detection: a review - part 1: statistical approaches , 2003, Signal Process..

[47]  Jianzhong Li,et al.  Unsupervised Outlier Detection in Sensor Networks Using Aggregation Tree , 2007, ADMA.

[48]  Craig B. Borkowf,et al.  Time-Series Forecasting , 2002, Technometrics.

[49]  Fabio Castelli,et al.  A Predictive Multidimensional Model for Vegetation Anomalies Derived From Remote-Sensing Observations , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[50]  Douglas J. Hunsaker,et al.  Combining Remotely Sensed Data and Ground-Based Radiometers to Estimate Crop Cover and Surface Temperatures at Daily Time Steps , 2010 .

[51]  João Gama,et al.  A system for analysis and prediction of electricity-load streams , 2009, Intell. Data Anal..