Spatial anomaly detection in sensor networks using neighborhood information

A method of neighborhood data fusion in decentralized anomaly detection is proposed.The effects of neighborhood size and spatio-temporal correlation are explored.Performance increases when the system is deployed in a correlated environment.Fusing small neighborhoods is preferred over larger neighborhoods. The field of wireless sensor networks (WSNs), embedded systems with sensing and networking capability, has now matured after a decade-long research effort and technological advances in electronics and networked systems. An important remaining challenge now is to extract meaningful information from the ever-increasing amount of sensor data collected by WSNs. In particular, there is strong interest in algorithms capable of automatic detection of patterns, events or other out-of-the order, anomalous system behavior. Data anomalies may indicate states of the system that require further analysis or prompt actions. Traditionally, anomaly detection techniques are executed in a central processing facility, which requires the collection of all measurement data at a central location, an obvious limitation for WSNs due to the high data communication costs involved. In this paper we explore the extent by which one may depart from this classical centralized paradigm, looking at decentralized anomaly detection based on unsupervised machine learning. Our aim is to detect anomalies at the sensor nodes, as opposed to centrally, to reduce energy and spectrum consumption. We study the information gain coming from aggregate neighborhood data, in comparison to performing simple, in-node anomaly detection. We evaluate the effects of neighborhood size and spatio-temporal correlation on the performance of our new neighborhood-based approach using a range of real-world network deployments and datasets. We find the conditions that make neighborhood data fusion advantageous, identifying also the cases in which this approach does not lead to detectable improvements. Improvements are linked to the diffusive properties of data (spatio-temporal correlations) but also to the type of sensors, anomalies and network topological features. Overall, when a dataset stems from a similar mixture of diffusive processes precision tends to benefit, particularly in terms of recall. Our work paves the way towards understanding how distributed data fusion methods may help managing the complexity of wireless sensor networks, for instance in massive Internet of Things scenarios.

[1]  Antonio Liotta,et al.  Online Fusion of Incremental Learning for Wireless Sensor Networks , 2014, 2014 IEEE International Conference on Data Mining Workshop.

[2]  Daniel Curiac,et al.  Ensemble based sensing anomaly detection in wireless sensor networks , 2012, Expert Syst. Appl..

[3]  Magnus Löfstrand,et al.  Data stream forecasting for system fault prediction , 2012, Comput. Ind. Eng..

[4]  A. Liotta The cognitive NET is coming , 2013, IEEE Spectrum.

[5]  Ran Wolff,et al.  Noname manuscript No. (will be inserted by the editor) In-Network Outlier Detection in Wireless Sensor Networks , 2022 .

[6]  M. Palaniswami,et al.  Distributed Anomaly Detection in Wireless Sensor Networks , 2006, 2006 10th IEEE Singapore International Conference on Communication Systems.

[7]  Ramesh Govindan,et al.  On the Prevalence of Sensor Faults in Real-World Deployments , 2007, 2007 4th Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks.

[8]  Antonio Liotta,et al.  Machine Learning Approach for Quality of Experience Aware Networks , 2010, 2010 International Conference on Intelligent Networking and Collaborative Systems.

[9]  Louis G. Birta,et al.  Modelling and Simulation , 2013, Simulation Foundations, Methods and Applications.

[10]  Ting Wang,et al.  Adaptive Routing for Sensor Networks using Reinforcement Learning , 2006, The Sixth IEEE International Conference on Computer and Information Technology (CIT'06).

[11]  Simon A. Dobson,et al.  Data Collection with In-network Fault Detection Based on Spatial Correlation , 2014, 2014 International Conference on Cloud and Autonomic Computing.

[12]  G. Maciá-Fernández,et al.  Anomaly-based network intrusion detection: Techniques, systems and challenges , 2009, Comput. Secur..

[13]  K. Romer,et al.  Aggregating sensor data from overlapping multi-hop network neighborhoods: Push or pull? , 2008, 2008 5th International Conference on Networked Sensing Systems.

[14]  Raman K. Mehra,et al.  Ensemble methods for anomaly detection and distributed intrusion detection in Mobile Ad-Hoc Networks , 2008, Inf. Fusion.

[15]  Kah Phooi Seng,et al.  Classical and swarm intelligence based routing protocols for wireless sensor networks: A survey and comparison , 2012, J. Netw. Comput. Appl..

[16]  George Pavlou,et al.  Exploiting agent mobility for large-scale network monitoring , 2002, IEEE Netw..

[17]  Dejan J. Sobajic,et al.  Learning and generalization characteristics of the random vector Functional-link net , 1994, Neurocomputing.

[18]  Louis G. Birta,et al.  Modelling and Simulation: Exploring Dynamic System Behaviour , 2007 .

[19]  Wen-Zhan Song,et al.  Volcanic earthquake timing using wireless sensor networks , 2013, 2013 ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN).

[20]  XuLi,et al.  The Internet of Things--A survey of topics and trends , 2015 .

[21]  Antonio Liotta,et al.  Anomaly Detection in Sensor Systems Using Lightweight Machine Learning , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[22]  T. C. Aysal,et al.  Distributed Average Consensus With Dithered Quantization , 2008, IEEE Transactions on Signal Processing.

[23]  Arnold P. Boedihardjo,et al.  GLS-SOD: a generalized local statistical approach for spatial outlier detection , 2010, KDD '10.

[24]  Toufik Ahmed,et al.  On Energy Efficiency in Collaborative Target Tracking in Wireless Sensor Network: A Review , 2013, IEEE Communications Surveys & Tutorials.

[25]  Karsten Steinhaeuser,et al.  Motivating Complex Dependence Structures in Data Mining: A Case Study with Anomaly Detection in Climate , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[26]  Maria E. Orlowska,et al.  On the Optimal Robot Routing Problem in Wireless Sensor Networks , 2007 .

[27]  N. C. Silver,et al.  Averaging Correlation Coefficients: Should Fishers z Transformation Be Used? , 1987 .

[28]  Roozbeh Jafari,et al.  Enabling Effective Programming and Flexible Management of Efficient Body Sensor Network Applications , 2013, IEEE Transactions on Human-Machine Systems.

[29]  Fabienne Gaillard,et al.  Quality Control of Large Argo Datasets , 2009 .

[30]  N. Chitradevi,et al.  Efficient Density Based Techniques for Anomalous Data Detection in Wireless Sensor Networks , 2013 .

[31]  Shreyas Sundaram,et al.  Consensus of multi-agent networks in the presence of adversaries using only local information , 2012, HiCoNS '12.

[32]  David E. Culler,et al.  TOSSIM: accurate and scalable simulation of entire TinyOS applications , 2003, SenSys '03.

[33]  Guang-Bin Huang,et al.  Extreme learning machine: a new learning scheme of feedforward neural networks , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[34]  Giancarlo Fortino,et al.  A framework for collaborative computing and multi-sensor data fusion in body sensor networks , 2015, Inf. Fusion.

[35]  D. Powers Evaluation: From Precision, Recall and F-Factor to ROC, Informedness, Markedness & Correlation , 2008 .

[36]  Kate Smith-Miles,et al.  A Comprehensive Survey of Data Mining-based Fraud Detection Research , 2010, ArXiv.

[37]  Michael Batty,et al.  Entropy, complexity, and spatial information , 2014, Journal of Geographical Systems.

[38]  George Pavlou,et al.  Effective management through prediction-based clustering approach in the next-generation ad hoc networks , 2004, 2004 IEEE International Conference on Communications (IEEE Cat. No.04CH37577).

[39]  Nirvana Meratnia,et al.  Distributed online outlier detection in wireless sensor networks using ellipsoidal support vector machine , 2013, Ad Hoc Networks.

[40]  Marcus Chang,et al.  Mote-Based Online Anomaly Detection Using Echo State Networks , 2009, DCOSS.

[41]  Richard M. Murray,et al.  DISTRIBUTED SENSOR FUSION USING DYNAMIC CONSENSUS , 2005 .

[42]  HyungJune Lee,et al.  Improving Wireless Simulation Through Noise Modeling , 2007, 2007 6th International Symposium on Information Processing in Sensor Networks.

[43]  Ananthram Swami,et al.  Achieving Consensus in Self-Organizing Wireless Sensor Networks: The Impact of Network Topology on Energy Consumption , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[44]  F. Massey The Kolmogorov-Smirnov Test for Goodness of Fit , 1951 .

[45]  Muttukrishnan Rajarajan,et al.  A survey of intrusion detection techniques in Cloud , 2013, J. Netw. Comput. Appl..

[46]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[47]  Ling Li,et al.  Distributed data mining: a survey , 2012, Inf. Technol. Manag..

[48]  Antonio Liotta,et al.  A survey on networks for smart-metering systems , 2012, Int. J. Pervasive Comput. Commun..

[49]  Hans-Peter Kriegel,et al.  Local outlier detection reconsidered: a generalized view on locality with applications to spatial, video, and network outlier detection , 2012, Data Mining and Knowledge Discovery.

[50]  Jun Luo,et al.  Energy efficient routing with adaptive data fusion in sensor networks , 2005, DIALM-POMC '05.

[51]  J. Rodgers,et al.  Thirteen ways to look at the correlation coefficient , 1988 .

[52]  Weili Wu,et al.  Localized Outlying and Boundary Data Detection in Sensor Networks , 2007, IEEE Transactions on Knowledge and Data Engineering.

[53]  Koen Langendoen,et al.  Murphy loves potatoes: experiences from a pilot sensor network deployment in precision agriculture , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[54]  Ashok N. Srivastava,et al.  Anomaly Detection and Diagnosis Algorithms for Discrete Symbol Sequences with Applications to Airline Safety , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[55]  Giancarlo Fortino,et al.  A flexible building management framework based on wireless sensor and actuator networks , 2012, J. Netw. Comput. Appl..

[56]  Doina Bucur,et al.  Applying time series analysis and neighbourhood voting in a decentralised approach for fault detection and classification in WSNs , 2013, SoICT.

[57]  Jing Li,et al.  A Hierarchical Framework Using Approximated Local Outlier Factor for Efficient Anomaly Detection , 2013, ANT/SEIT.

[58]  Gabriel Maciá-Fernández,et al.  Anomaly-based network intrusion detection: Techniques, systems and challenges , 2009, Comput. Secur..

[59]  Yunhao Liu,et al.  Does Wireless Sensor Network Scale? A Measurement Study on GreenOrbs , 2011, IEEE Transactions on Parallel and Distributed Systems.

[60]  Antonio Liotta,et al.  Ensembles of incremental learners to detect anomalies in ad hoc sensor networks , 2015, Ad Hoc Networks.

[61]  Edwin Lughofer,et al.  Fault detection in multi-sensor networks based on multivariate time-series models and orthogonal transformations , 2014, Inf. Fusion.

[62]  Sanjay Chawla,et al.  SLOM: a new measure for local spatial outliers , 2006, Knowledge and Information Systems.

[63]  Mohd Fauzi Othman,et al.  Wireless Sensor Network Applications: A Study in Environment Monitoring System , 2012 .

[64]  Bernhard Sick,et al.  Online Segmentation of Time Series Based on Polynomial Least-Squares Approximations , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[65]  Giancarlo Fortino,et al.  Discovery of Hidden Correlations between Heterogeneous Wireless Sensor Data Streams , 2014, IDCS.

[66]  Weiming Shen,et al.  Collaborative Wireless Sensor Networks: Architectures, Algorithms and Applications , 2015, Inf. Fusion.

[67]  Vanish Talwar,et al.  Statistical techniques for online anomaly detection in data centers , 2011, 12th IFIP/IEEE International Symposium on Integrated Network Management (IM 2011) and Workshops.

[68]  Sanjay Kumar Madria,et al.  A Survey of Methods for Finding Outliers in Wireless Sensor Networks , 2013, Journal of Network and Systems Management.

[69]  Sriparna Basu,et al.  Modelling and Simulation of Diffusive Processes , 2014, Simulation Foundations, Methods and Applications.

[70]  Hwee Pink Tan,et al.  Machine Learning in Wireless Sensor Networks: Algorithms, Strategies, and Applications , 2014, IEEE Communications Surveys & Tutorials.

[71]  Antonio Liotta,et al.  Online Extreme Learning on Fixed-Point Sensor Networks , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.

[72]  Xiuzhen Cheng,et al.  Localized Outlying and Boundary Data Detection in Sensor Networks , 2007 .

[73]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[74]  Zahir Tari,et al.  Distributed anomaly detection for industrial wireless sensor networks based on fuzzy data modelling , 2013, J. Parallel Distributed Comput..

[75]  Baltasar Beferull-Lozano,et al.  Distributed consensus algorithms for SVM training in wireless sensor networks , 2008, 2008 16th European Signal Processing Conference.