Dynamic join optimization in multi-hop wireless sensor networks

To enable smart environments and self-tuning data centers, we are developing the Aspen system for integrating physical sensor data, as well as stream data coming from machine logical state, and database or Web data from the Internet. A key component of this system is a query processor optimized for limited-bandwidth, possibly battery-powered devices with multiple hop wireless radio communications. This query processor is given a portion of a data integration query, possibly including joins among sensors, to execute. Several recent papers have developed techniques for computing joins in sensors, but these techniques are static and are only appropriate for specific join selectivity ratios. We consider the problem of dynamic join optimization for sensor networks, developing solutions that employ cost modeling, as well as adaptive learning and self-tuning heuristics to choose the best algorithm under real and variable selectivity values. We focus on in-network join computation, but our architecture extends to other approaches (and we compare against these). We develop basic techniques assuming selectivities are uniform and known in advance, and optimization can be done on a pairwise basis; we then extend the work to handle joins between multiple pairs, when selectivities are not fully known. We experimentally validate our work at scale using standard datasets.

[1]  Kian-Lee Tan,et al.  In-network execution of monitoring queries in sensor networks , 2007, SIGMOD '07.

[2]  Daniel J. Abadi,et al.  REED: Robust, Efficient Filtering and Event Detection in Sensor Networks , 2005, VLDB.

[3]  Margo I. Seltzer,et al.  Network-Aware Operator Placement for Stream-Processing Systems , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[4]  Rajmohan Rajaraman,et al.  The Cougar Project: a work-in-progress report , 2003, SGMD.

[5]  Himanshu Gupta,et al.  Communication-Efficient Implementation of Join in Sensor Networks , 2005, DASFAA.

[6]  Paul M. Aoki Generalizing Search'' in Generalized Search Trees (Extended Abstract) , 1998, ICDE 1998.

[7]  Philippe Bonnet,et al.  Adaptive and Decentralized Operator Placement for In-Network Query Processing , 2003, Telecommun. Syst..

[8]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[9]  Himanshu Gupta,et al.  Communication-Efficient Implementation of Range-Joins in Sensor Networks , 2006, DASFAA.

[10]  Ee-Peng Lim,et al.  On In-network Synopsis Join Processing for Sensor Networks , 2006, 7th International Conference on Mobile Data Management (MDM'06).

[11]  Wei Hong,et al.  The design of an acquisitional query processor for sensor networks , 2003, SIGMOD '03.

[12]  Jennifer Widom,et al.  Operator placement for in-network stream query processing , 2005, PODS.

[13]  David E. Culler,et al.  TOSSIM: accurate and scalable simulation of entire TinyOS applications , 2003, SenSys '03.

[14]  Dimitrios Gunopulos,et al.  The threshold join algorithm for top-k queries in distributed sensor networks , 2005, DMSN '05.

[15]  Himanshu Gupta,et al.  Communication-efficient implementation of join in sensor networks , 2007, Ad Hoc Networks.

[16]  Val Tannen,et al.  Update Exchange with Mappings and Provenance , 2007, VLDB.

[17]  Jennifer Widom,et al.  The CQL continuous query language: semantic foundations and query execution , 2006, The VLDB Journal.

[18]  Sudipto Guha,et al.  A substrate for in-network sensor data integration , 2008, DMSN '08.

[19]  Paul M. Aoki Generalizing "search" in generalized search trees , 1998, Proceedings 14th International Conference on Data Engineering.

[20]  Peter Druschel,et al.  Pastry: Scalable, distributed object location and routing for large-scale peer-to- , 2001 .

[21]  Deborah Estrin,et al.  Data-Centric Storage in Sensornets with GHT, a Geographic Hash Table , 2003, Mob. Networks Appl..

[22]  Mario A. Nascimento,et al.  A Distributed Algorithm for Joins in Sensor Networks , 2007, 19th International Conference on Scientific and Statistical Database Management (SSDBM 2007).

[23]  S. Sitharama Iyengar,et al.  Distributed Sensor Networks , 2004 .