Multivariate Statistical Approach for Anomaly Detection and Lost Data Recovery in Wireless Sensor Networks

Data loss due to integrity attacks or malfunction constitutes a principal concern in wireless sensor networks (WSNs). The present paper introduces a novel data loss/modification detection and recovery scheme in this context. Both elements, detection and data recovery, rely on a multivariate statistical analysis approach that exploits spatial density, a common feature in network environments such as WSNs. To evaluate the proposal, we consider WSN scenarios based on temperature sensors, both simulated and real. Furthermore, we consider three different routing algorithms, showing the strong interplay among (a) the routing strategy, (b) the negative effect of data loss on the network performance, and (c) the data recovering capability of the approach. We also introduce a novel data arrangement method to exploit the spatial correlation among the sensors in a more efficient manner. In this data arrangement, we only consider the nearest nodes to a given affected sensor, improving the data recovery performance up to 99%. According to the results, the proposed mechanisms based on multivariate techniques improve the robustness of WSNs against data loss.

[1]  Christos Georgakis,et al.  Disturbance detection and isolation by dynamic principal component analysis , 1995 .

[2]  Biswanath Mukherjee,et al.  Wireless sensor network survey , 2008, Comput. Networks.

[3]  Di Guo,et al.  Sparsity-Based Online Missing Data Recovery Using Overcomplete Dictionary , 2012, IEEE Sensors Journal.

[4]  José Camacho,et al.  Cross-validation in PCA models with the element-wise k-fold (ekf) algorithm: Practical aspects , 2014 .

[5]  Thomas G. Dietterich,et al.  Spatiotemporal Models for Data-Anomaly Detection in Dynamic Environmental Monitoring Campaigns , 2011, TOSN.

[6]  Lynne E. Parker,et al.  Nearest neighbor imputation using spatial-temporal correlations in wireless sensor networks , 2014, Inf. Fusion.

[7]  J. Macgregor,et al.  Control of batch product quality by trajectory manipulation using latent variable models , 2004 .

[8]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[9]  Jesús Picó,et al.  Bilinear modelling of batch processes. Part II: a comparison of PLS soft‐sensors , 2008 .

[10]  Wei Wang,et al.  Distributed H∞ filtering in sensor networks with randomly occurred missing measurements and communication link failures , 2013, Inf. Sci..

[11]  J.A. Stankovic,et al.  Denial of Service in Sensor Networks , 2002, Computer.

[12]  Jesús Picó,et al.  Bilinear modelling of batch processes. Part I: theoretical discussion , 2008 .

[13]  Lynne E. Parker,et al.  A spatial-temporal imputation technique for classification with missing data in a wireless sensor network , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[14]  Jong Chern Lim,et al.  Robust Data Collection and Lifetime Improvement in Wireless Sensor Networks through Data Imputation , 2010, 2010 Fifth International Conference on Systems and Networks Communications.

[15]  Kang Yen,et al.  Sensor network security: a survey , 2009, IEEE Communications Surveys & Tutorials.

[16]  J. Edward Jackson,et al.  A User's Guide to Principal Components. , 1991 .

[17]  K. Baskaran,et al.  Designing an efficient PCA based data model for wireless sensor networks , 2011, ACWR '11.

[18]  J. Macgregor,et al.  Image texture analysis: methods and comparisons , 2004 .

[19]  Theodora Kourti,et al.  Multivariate SPC Methods for Process and Product Monitoring , 1996 .

[20]  Rekha Jain,et al.  Wireless Sensor Network -A Survey , 2013 .

[21]  S. Wold Cross-Validatory Estimation of the Number of Components in Factor and Principal Components Models , 1978 .

[22]  B. Kowalski,et al.  Partial least-squares regression: a tutorial , 1986 .

[23]  Jesús Picó,et al.  Multi-phase principal component analysis for batch processes modelling , 2006 .

[24]  Elias S. Manolakos,et al.  Temperature field modeling and simulation of wireless sensor network behavior during a spreading wildfire , 2008, 2008 16th European Signal Processing Conference.

[25]  Mahdi Abadi,et al.  A PCA-based distributed approach for intrusion detection in wireless sensor networks , 2011, 2011 International Symposium on Computer Networks and Distributed Systems (CNDS).

[26]  JAMAL N. AL-KARAKI,et al.  Routing techniques in wireless sensor networks: a survey , 2004, IEEE Wireless Communications.

[27]  H. Hotelling,et al.  Multivariate Quality Control , 1947 .

[28]  Di Guo,et al.  Sparsity-Based Spatial Interpolation in Wireless Sensor Networks , 2011, Sensors.

[29]  Sajal K. Das,et al.  Data Collection in Wireless Sensor Networks with Mobile Elements: A Survey , 2011, TOSN.

[30]  Kay Smarsly,et al.  Decentralized fault detection and isolation in wireless structural health monitoring systems using analytical redundancy , 2014, Adv. Eng. Softw..

[31]  José Camacho,et al.  Cross‐validation in PCA models with the element‐wise k‐fold (ekf) algorithm: theoretical aspects , 2012 .

[32]  Le Gruenwald,et al.  DEMS: a data mining based technique to handle missing data in mobile sensor network applications , 2010, DMSN '10.

[33]  S. Wold,et al.  PLS-regression: a basic tool of chemometrics , 2001 .

[34]  Alberto Ferrer,et al.  Framework for regression‐based missing data imputation methods in on‐line MSPC , 2005 .

[35]  A. Ferrer,et al.  Dealing with missing data in MSPC: several methods, different interpretations, some examples , 2002 .