A Spatial Correlation Based Adaptive Missing Data Estimation Algorithm in Wireless Sensor Networks

In wireless sensor networks, the missing of sensor data is inevitable due to the inherent characteristic of wireless sensor networks, and it causes many difficulties in various applications. To solve the problem, the missing data should be estimated as accurately as possible. In this paper, an adaptive missing data estimation algorithm is proposed based on the spatial correlation of sensor data. It adopts multiple regression model to estimate the missing data with the data of multiple neighbor nodes jointly rather than independently, which makes its estimation performance stable and reliable. In addition, for different missing data, it can adjust the estimation equation adaptively to capture the dynamic correlation of sensor data. Thereby, it can estimate the missing data more accurately. Further more, it can also give the confidence interval of each missing data for the given confidence level, which is helpful greatly for users. Experimental results on two real-world datasets show that the proposed algorithm can estimate the missing data accurately.

[1]  Kamesh Munagala,et al.  Energy-efficient monitoring of extreme values in sensor networks , 2006, SIGMOD Conference.

[2]  Nan Jiang,et al.  Estimating Missing Data in Data Streams , 2007, DASFAA.

[3]  Wei Hong,et al.  The design of an acquisitional query processor for sensor networks , 2003, SIGMOD '03.

[4]  Wei Hong,et al.  Model-Driven Data Acquisition in Sensor Networks , 2004, VLDB.

[5]  Le Gruenwald,et al.  Using data mining to handle missing data in multi-hop sensor network applications , 2010, MobiDE '10.

[6]  Daniel J. Abadi,et al.  REED: Robust, Efficient Filtering and Event Detection in Sensor Networks , 2005, VLDB.

[7]  Jeffrey Considine,et al.  Approximate aggregation techniques for sensor databases , 2004, Proceedings. 20th International Conference on Data Engineering.

[8]  Suman Nath,et al.  Tributaries and deltas: efficient and robust aggregation in sensor network streams , 2005, SIGMOD '05.

[9]  José M. F. Moura,et al.  Estimation in sensor networks: a graph approach , 2005, IPSN 2005. Fourth International Symposium on Information Processing in Sensor Networks, 2005..

[10]  Yong Yao,et al.  The cougar approach to in-network query processing in sensor networks , 2002, SGMD.

[11]  Wei Hong,et al.  Exploiting correlated attributes in acquisitional query processing , 2005, 21st International Conference on Data Engineering (ICDE'05).

[12]  Le Gruenwald,et al.  Estimating Missing Values in Related Sensor Data Streams , 2005, COMAD.

[13]  Yingshu Li,et al.  Data Estimation in Sensor Networks Using Physical and Statistical Methodologies , 2008, 2008 The 28th International Conference on Distributed Computing Systems.

[14]  Deborah Estrin,et al.  Guest Editors' Introduction: Overview of Sensor Networks , 2004, Computer.

[15]  Wei Hong,et al.  Approximate Data Collection in Sensor Networks using Probabilistic Models , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[16]  David Sun,et al.  COUGAR: the network is the database , 2002, SIGMOD '02.

[17]  Kian-Lee Tan,et al.  In-network execution of monitoring queries in sensor networks , 2007, SIGMOD '07.

[18]  Kamesh Munagala,et al.  A Sampling-Based Approach to Optimizing Top-k Queries in Sensor Networks , 2006, 22nd International Conference on Data Engineering (ICDE'06).