Trend Cluster Based Kriging Interpolation in Sensor Data Networks

Spatio-temporal data collected in sensor networks are often affected by faults due to power outage at nodes, wrong time synchronizations, interference, network transmission failures, sensor hardware issues or high energy consumption during communications. Therefore, acquisition of information by wireless sensor networks is a challenging step in monitoring physical ubiquitous phenomena (e.g. weather, pollution, traffic). This issue gives raise to a fundamental trade-off: higher density of sensors provides more data, higher resolution and better accuracy, but requires more communications and processing. A data mining approach to reduce communication and energy requirements is investigated: the number of transmitting sensors is decreased as much as possible, even keeping a reasonable degree of data accuracy. Kriging techniques and trend cluster discovery are employed to estimate unknown data in any un-sampled location of the space and at any time point of the past. Kriging is a statistical interpolation group of techniques, suited for spatial data, which estimates the unknown data in any space location by a proper weighted mean of nearby observed data. The trend clusters are stream patterns which compactly represent sensor data by means of spatial clusters having prominent data trends in time. Kriging is here applied to estimate unknown data taking into account a spatial correlation model of the sensor network. Trends are used as a guideline to transfer this model across the time horizon of the trend itself. Experiments are performed with a real sensor data network, in order to evaluate this interpolation technique and demonstrate that Kriging and trend clusters outperform, in terms of accuracy, interpolation competitors like Nearest Neighbor or Inverse Distance Weighting.

[1]  Jerry L. Prince,et al.  The kriging update model and recursive space-time function estimation , 1999, IEEE Trans. Signal Process..

[2]  Shonali Krishnaswamy,et al.  Mining data streams: a review , 2005, SGMD.

[3]  D. Shepard A two-dimensional interpolation function for irregularly-spaced data , 1968, ACM National Conference.

[4]  Z. Ignjatovic,et al.  An energy conservation method for wireless sensor networks employing a blue noise spatial sampling technique , 2004, Third International Symposium on Information Processing in Sensor Networks, 2004. IPSN 2004.

[5]  M. Tomczak,et al.  Spatial Interpolation and its Uncertainty Using Automated Anisotropic Inverse Distance Weighting (IDW) - Cross-Validation/Jackknife Approach , 1998 .

[6]  Donato Malerba,et al.  Trend cluster based interpolation everywhere in a sensor network , 2012, SAC '12.

[7]  N. Cressie The origins of kriging , 1990 .

[8]  Michael Edward Hohn,et al.  An Introduction to Applied Geostatistics: by Edward H. Isaaks and R. Mohan Srivastava, 1989, Oxford University Press, New York, 561 p., ISBN 0-19-505012-6, ISBN 0-19-505013-4 (paperback), $55.00 cloth, $35.00 paper (US) , 1991 .

[9]  Dinesh Verma,et al.  A survey of sensor selection schemes in wireless sensor networks , 2007, SPIE Defense + Commercial Sensing.

[10]  Donato Malerba,et al.  Summarization for Geographically Distributed Data Streams , 2010, KES.

[11]  R. Nowak,et al.  Backcasting: adaptive sampling for sensor networks , 2004, Third International Symposium on Information Processing in Sensor Networks, 2004. IPSN 2004.

[12]  Neeraj Suri,et al.  ASample: Adaptive Spatial Sampling in Wireless Sensor Networks , 2010, 2010 IEEE International Conference on Sensor Networks, Ubiquitous, and Trustworthy Computing.

[13]  Donato Malerba,et al.  Online and Offline Trend Cluster Discovery in Spatially Distributed Data Streams , 2010, MSM/MUSE.