Deterministic Data Reduction in Sensor Networks

The processing capabilities of wireless sensor nodes enable to aggregate redundant data to limit total data flow over the network. The main property of a good aggregation algorithm is to extract the most representative data by using minimum resources. From this point of view, sampling is a promising aggregation method, that acts as surrogate for the whole data, and once extracted can be used to answer multiple kinds of queries (such as AVG, MEDIAN, SUM, COUNT, etc.), at no extra cost. Additionally, sampling also preserves the correlation info within multi-dimensional data, which is quite valuable for further data mining. In this paper, we propose a novel, distributed, weighted sampling algorithm to sample sensor network data and compare to an existing random sampling algorithm, which to the best of our knowledge is the only algorithm to work in this kind of setting

[1]  Graham Cormode,et al.  Holistic aggregates in a networked world: distributed tracking of approximate quantiles , 2005, SIGMOD '05.

[2]  Hüseyin Akcan,et al.  Deterministic Sampling beyond EASE : Reducing Multi-Dimensional Data ∗ , 2005 .

[3]  S. Muthukrishnan,et al.  AQUA: System and Techniques for Approximate Query Answering , 1998 .

[4]  Hannu Toivonen,et al.  Sampling Large Databases for Association Rules , 1996, VLDB.

[5]  Sanjeev Khanna,et al.  Power-conserving computation of order-statistics over sensor networks , 2004, PODS.

[6]  Venkatesh Saligrama,et al.  Adaptive statistical sampling methods for decentralized estimation and detection of localized phenomena , 2005 .

[7]  Peter J. Haas,et al.  The New Jersey Data Reduction Report , 1997 .

[8]  Sándor P. Fekete,et al.  Shawn: A new approach to simulating wireless sensor networks , 2005, ArXiv.

[9]  Bin Chen,et al.  Efficient data reduction with EASE , 2003, KDD '03.

[10]  L. Bukhman,et al.  Approximation of iceberg cubes using data reduction techniques , 2005 .

[11]  John Anderson,et al.  Wireless sensor networks for habitat monitoring , 2002, WSNA '02.

[12]  Haibo Hu,et al.  Energy-Efficient Monitoring of Spatial Predicates over Moving Objects , 2005, IEEE Data Eng. Bull..

[13]  Divyakant Agrawal,et al.  Medians and beyond: new aggregation techniques for sensor networks , 2004, SenSys '04.

[14]  Srinivasan Parthasarathy,et al.  Evaluation of sampling for data mining of association rules , 1997, Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications.

[15]  Wei Hong,et al.  Proceedings of the 5th Symposium on Operating Systems Design and Implementation Tag: a Tiny Aggregation Service for Ad-hoc Sensor Networks , 2022 .

[16]  Jeffrey Considine,et al.  Approximately uniform random sampling in sensor networks , 2004, DMSN '04.

[17]  Srinivasan Seshan,et al.  Synopsis diffusion for robust aggregation in sensor networks , 2004, SenSys '04.

[18]  Wei Hong,et al.  The design of an acquisitional query processor for sensor networks , 2003, SIGMOD '03.

[19]  Bin Chen,et al.  Efficient Data-Reduction Methods for On-line Association Rule Discovery , 2004 .

[20]  Venkatesh Saligrama,et al.  Adaptive statistical sampling methods for decentralized estimation and detection of localized phenomena , 2005, IPSN 2005. Fourth International Symposium on Information Processing in Sensor Networks, 2005..

[21]  Jeffrey Considine,et al.  Approximate aggregation techniques for sensor databases , 2004, Proceedings. 20th International Conference on Data Engineering.

[22]  Bin Chen,et al.  A new two-phase sampling based algorithm for discovering association rules , 2002, KDD.

[23]  Suman Nath,et al.  Tributaries and deltas: efficient and robust aggregation in sensor network streams , 2005, SIGMOD '05.

[24]  Robert Tappan Morris,et al.  Span: An Energy-Efficient Coordination Algorithm for Topology Maintenance in Ad Hoc Wireless Networks , 2001, MobiCom '01.

[25]  Edward Y. Chang,et al.  Adaptive sampling for sensor networks , 2004, DMSN '04.

[26]  Antonios Deligiannakis,et al.  Data Reduction Techniques in Sensor Networks , 2005, IEEE Data Eng. Bull..