An efficient approach for outlier detection in big sensor data of health care

Summary In recent years, wireless sensor networks are pervasive and are generating tons of data every second. Performing outlier detection to detect faulty sensors from such a large amount of data becomes a challenging task. Most of the existing techniques for outlier detection in wireless sensor networks concentrate only on contents of the data source without considering correlation among different data attributes. Moreover, these methods are not scalable to big data. To address these 2 limitations, this paper proposes an outlier detection approach based on correlation and dynamic SMO (sequential minimal optimization) regression that is scalable to big data. Initially, correlation is used to find out strongly correlated attributes and then the point anomalous nodes are detected using dynamic SMO regression. For fast processing of big data, Hadoop MapReduce framework is used. The experimental analysis demonstrates that the proposed approach efficiently detects the point and contextual anomalies and reduces the number of false alarms. For experiments, real data of sensors used in body sensor networks are taken from Physionet database.

[1]  Fang Liu,et al.  Insider Attacker Detection in Wireless Sensor Networks , 2007, IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.

[2]  Marimuthu Palaniswami,et al.  Combined multiclass classification and anomaly detection for large-scale Wireless Sensor Networks , 2013, 2013 IEEE Eighth International Conference on Intelligent Sensors, Sensor Networks and Information Processing.

[3]  Borko Furht,et al.  Sensor fault and patient anomaly detection and classification in medical wireless sensor networks , 2013, 2013 IEEE International Conference on Communications (ICC).

[4]  Miriam A. M. Capretz,et al.  Contextual Anomaly Detection in Big Sensor Data , 2014, 2014 IEEE International Congress on Big Data.

[5]  Yuan Yao,et al.  Online anomaly detection for sensor systems: A simple and efficient approach , 2010, Perform. Evaluation.

[6]  Jian Pei,et al.  Hierarchical distributed data classification in wireless sensor networks , 2010, Comput. Commun..

[7]  Yaning Liu,et al.  Anomaly Detection in Medical Wireless Sensor Networks , 2013, J. Comput. Sci. Eng..

[8]  Nasser M. Nasrabadi,et al.  A nonlinear kernel-based joint fusion/detection of anomalies using Hyperspectral and SAR imagery , 2008, 2008 15th IEEE International Conference on Image Processing.

[9]  David J. Hill,et al.  Anomaly detection in streaming environmental sensor data: A data-driven modeling approach , 2010, Environ. Model. Softw..

[10]  Chang-Tien Lu,et al.  Spatial Weighted Outlier Detection , 2006, SDM.

[11]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[12]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[13]  Qi Hao,et al.  Mobile Target Scenario Recognition Via Low-Cost Pyroelectric Sensing System: Toward a Context-Enhanced Accurate Identification , 2014, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[14]  Nasser M. Nasrabadi Kernel-based joint fusion/detection of anomalies using hyperspectral and SAR imagery , 2009, Defense + Commercial Sensing.

[15]  Borko Furht,et al.  Anomaly Detection in Medical Wireless Sensor Networks using SVM and Linear Regression Models , 2014, Int. J. E Health Medical Commun..

[16]  Shen-Min Song,et al.  Alternative framework of the Gaussian filter for non-linear systems with synchronously correlated noises , 2016 .

[17]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[18]  Zengyou He,et al.  Mining Class Outliers: Concepts, Algorithms and Applications , 2004, WAIM.

[19]  Syed Mahfuzul Aziz,et al.  Sensor Anomaly Detection in Wireless Sensor Networks for Healthcare , 2015, Sensors.

[20]  Jong-Suk Ruth Lee,et al.  Detecting Anomaly Teletraffic Using Stochastic Self-Similarity Based on Hadoop , 2013, 2013 16th International Conference on Network-Based Information Systems.