Ensemble based sensing anomaly detection in wireless sensor networks

Highlights? We developed an efficient ensemble of five classifiers to detect sensor anomalies. ? We model the correct sensor behavior in complex/dynamic environments using five models. ? We solve the problems related to training, testing and evaluation of the ensemble. ? Our ensemble offers reliable estimations to replace the erroneous measurements. ? We present a case study to show the preciseness of our method. Wireless sensor networks are often used to monitor and measure physical characteristics from remote and sometimes hostile environments. In these circumstances the sensing data accuracy is a crucial attribute for the way these applications complete their objectives, requiring efficient solutions to discover sensor anomalies. Such solutions are hard to be found mainly because the intricate defining of the correct sensor behavior in a complex and dynamic environment. This paper tackles the sensing anomaly detection from a new perspective by modeling the correct operation of sensors not by one, but by five different dynamical models, acting synergically to provide a reliable solution. Our methodology relies on an ensemble based system composed of a set of diverse binary classifiers, adequately selected, to implement a complex decisional system on network base station. Moreover, every time a sensing anomaly is discovered, our ensemble offers a reliable estimation to replace the erroneous measurement provided by sensor.

[1]  R. Polikar,et al.  Ensemble based systems in decision making , 2006, IEEE Circuits and Systems Magazine.

[2]  Ludmila I. Kuncheva,et al.  Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy , 2003, Machine Learning.

[3]  Jian Ma,et al.  A comparative assessment of ensemble learning for credit scoring , 2011, Expert Syst. Appl..

[4]  Daniel P. Huttenlocher,et al.  Comparing Images Using the Hausdorff Distance , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  S. Papavassiliou,et al.  Diagnosing Anomalies and Identifying Faulty Nodes in Sensor Networks , 2007, IEEE Sensors Journal.

[6]  Atsuyoshi Nakamura,et al.  Comparison of Bagging and Boosting Algorithms on Sample and Feature Weighting , 2009, MCS.

[7]  Jyh-Shing Roger Jang,et al.  Fuzzy Modeling Using Generalized Neural Networks and Kalman Filter Algorithm , 1991, AAAI.

[8]  Jyh-Shing Roger Jang,et al.  ANFIS: adaptive-network-based fuzzy inference system , 1993, IEEE Trans. Syst. Man Cybern..

[9]  Supakit Siripanadorn,et al.  Anomaly detection in wireless sensor networks using self-organizing map and wavelets , 2010 .

[10]  Felix Naumann,et al.  Data fusion , 2009, CSUR.

[11]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[12]  Dae-Ki Kang,et al.  Ensemble with neural networks for bankruptcy prediction , 2010, Expert Syst. Appl..

[13]  Alex Simpkins,et al.  System Identification: Theory for the User, 2nd Edition (Ljung, L.; 1999) [On the Shelf] , 2012, IEEE Robotics & Automation Magazine.

[14]  Dan Pescaru,et al.  Redundancy and its applications in wireless sensor networks: a survey , 2009 .

[15]  Oliver Obst,et al.  Spatiotemporal Anomaly Detection in Gas Monitoring Sensor Networks , 2008, EWSN.

[16]  Lennart Ljung,et al.  System identification toolbox for use with MATLAB , 1988 .

[17]  Miao Xie,et al.  Anomaly Detection in Wireless Sensor Networks , 2013 .

[18]  Lennart Ljung,et al.  Theory and Practice of Recursive Identification , 1983 .

[19]  Grigorios Tsoumakas,et al.  Effective Voting of Heterogeneous Classifiers , 2004, ECML.

[20]  Hui Li,et al.  Case-based reasoning ensemble and business application: A computational approach from multiple case representations driven by randomness , 2012, Expert Syst. Appl..

[21]  Thomas G. Dietterich An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization , 2000, Machine Learning.

[22]  Manfred K. Warmuth,et al.  The weighted majority algorithm , 1989, 30th Annual Symposium on Foundations of Computer Science.

[23]  Bin Yang,et al.  Rotation-based RLS algorithms: unified derivations, numerical properties, and parallel implementations , 1992, IEEE Trans. Signal Process..

[24]  Soon Heung Chang,et al.  Detection of process anomalies using an improved statistical learning framework , 2011, Expert Syst. Appl..

[25]  Nicolas Garc ´ õa-Pedrajas Boosting k-Nearest Neighbor Classifier by Means of Input Space Projection , 2008 .

[26]  Lynne E. Parker,et al.  Detecting time-related changes in Wireless Sensor Networks using symbol compression and Probabilistic Suffix Trees , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[27]  Weisong Shi,et al.  Wireless Sensor Network Security: A Survey , 2006 .

[28]  Jean-Michel Morel,et al.  A Theory of Shape Identification , 2008 .

[29]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[30]  Sargur N. Srihari,et al.  Decision Combination in Multiple Classifier Systems , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Jaideep Srivastava,et al.  Diversity in Combinations of Heterogeneous Classifiers , 2009, PAKDD.

[32]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[33]  Berkman Sahiner,et al.  Dual system approach to computer-aided detection of breast masses on mammograms. , 2006, Medical physics.