In-Network Processing of Joins in Wireless Sensor Networks

The join or correlated filtering of sensor readings is one of the fundamental query operations in wireless sensor networks (WSNs). Although the join in centralized or distributed databases is a well-researched problem, join processing in WSNs has quite different characteristics and is much more difficult to perform due to the lack of statistics on sensor readings and the resource constraints of sensor nodes. Since data transmission is orders of magnitude more costly than processing at a sensor node, in-network processing of joins is essential. In this paper, the state-of-the-art techniques for join implementation in WSNs are surveyed. The requirements and challenges, join types, and components of join implementation are described. The open issues for further research are identified.

[1]  Wei Hong,et al.  TinyDB: an acquisitional query processing system for sensor networks , 2005, TODS.

[2]  Daniel J. Abadi,et al.  REED: Robust, Efficient Filtering and Event Detection in Sensor Networks , 2005, VLDB.

[3]  Peter J. Haas,et al.  Improved histograms for selectivity estimation of range predicates , 1996, SIGMOD '96.

[4]  Klemens Böhm,et al.  Towards Efficient Processing of General-Purpose Joins in Sensor Networks , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[5]  Chin-Wan Chung,et al.  Cost based in-network join strategy in tree routing sensor networks , 2011, Inf. Sci..

[6]  Rajeev Motwani,et al.  Sampling from a moving window over streaming data , 2002, SODA '02.

[7]  Burton H. Bloom,et al.  Space/time trade-offs in hash coding with allowable errors , 1970, CACM.

[8]  Deborah Estrin,et al.  GHT: a geographic hash table for data-centric storage , 2002, WSNA '02.

[9]  Jeffrey F. Naughton,et al.  Sampling-Based Estimation of the Number of Distinct Values of an Attribute , 1995, VLDB.

[10]  Scott Shenker,et al.  The Architecture of PIER: an Internet-Scale Query Processor , 2005, CIDR.

[11]  Erik Buchmann,et al.  Best Effort Query Processing in DHT-based P2P Systems , 2005, 21st International Conference on Data Engineering Workshops (ICDEW'05).

[12]  Qin Zhang,et al.  Optimal sampling from distributed streams , 2010, PODS '10.

[13]  Himanshu Gupta,et al.  Communication-efficient implementation of join in sensor networks , 2007, Ad Hoc Networks.

[14]  Jeffrey F. Naughton,et al.  Evaluating window joins over unbounded streams , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[15]  Klemens Böhm,et al.  Fault-tolerant query processing in structured P2P-systems , 2010, Distributed and Parallel Databases.

[16]  Sang Hyuk Son,et al.  QoS Management of Real-Time Data Stream Queries in Distributed Environments , 2007, 10th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing (ISORC'07).

[17]  Vasundhara Unnava,et al.  Query processing in distributed database systems , 1992 .

[18]  Sudipto Guha,et al.  A substrate for in-network sensor data integration , 2008, DMSN '08.

[19]  Jeffrey Scott Vitter,et al.  Wavelet-based histograms for selectivity estimation , 1998, SIGMOD '98.

[20]  Hanan Samet,et al.  The Quadtree and Related Hierarchical Data Structures , 1984, CSUR.

[21]  Rajeev Motwani,et al.  Computing Iceberg Queries Efficiently , 1998, VLDB.

[22]  Himanshu Gupta,et al.  Communication-Efficient Implementation of Range-Joins in Sensor Networks , 2006, DASFAA.

[23]  Margaret H. Dunham,et al.  Join processing in relational databases , 1992, CSUR.

[24]  Jennifer Widom,et al.  Models and issues in data stream systems , 2002, PODS.

[25]  J. S. Saini,et al.  Adaptive Query Processing , 2006 .

[26]  Jeffrey F. Naughton,et al.  Fixed-precision estimation of join selectivity , 1993, PODS '93.

[27]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[28]  Xing Gao,et al.  SRJA: Iceberg Join Processing in Wireless Sensor Networks , 2010, 2010 2nd International Workshop on Database Technology and Applications.

[29]  Ee-Peng Lim,et al.  On In-network Synopsis Join Processing for Sensor Networks , 2006, 7th International Conference on Mobile Data Management (MDM'06).

[30]  Clement T. Yu,et al.  Distributed query processing , 1984, CSUR.

[31]  Sudipto Guha,et al.  Dynamic join optimization in multi-hop wireless sensor networks , 2010, Proc. VLDB Endow..

[32]  Jan V. Garwick Remark on further generalization of ALGOL , 1964, CACM.

[33]  Sudipto Guha,et al.  Dynamic multidimensional histograms , 2002, SIGMOD '02.

[34]  Leonidas J. Guibas,et al.  Wireless sensor networks - an information processing approach , 2004, The Morgan Kaufmann series in networking.

[35]  Philippe Bonnet,et al.  Adaptive and Decentralized Operator Placement for In-Network Query Processing , 2003, Telecommun. Syst..

[36]  Arbee L. P. Chen,et al.  Improving Distributed Query Processing by Hash-Semijoins , 1992, J. Inf. Sci. Eng..

[37]  Jörg Sander,et al.  On Join Location in Sensor Networks , 2007, 2007 International Conference on Mobile Data Management.

[38]  Klemens Böhm,et al.  Processing continuous join queries in sensor networks: a filtering approach , 2010, SIGMOD Conference.

[39]  Mario A. Nascimento,et al.  A Distributed Algorithm for Joins in Sensor Networks , 2007, 19th International Conference on Scientific and Statistical Database Management (SSDBM 2007).

[40]  Hong Chen,et al.  PEJA: Progressive Energy-Efficient Join Processing for Sensor Networks , 2008, Journal of Computer Science and Technology.

[41]  Carla Schlatter Ellis,et al.  Practical prefetching techniques for multiprocessor file systems , 2005, Distributed and Parallel Databases.

[42]  Wolfgang Lehner,et al.  Sampling time-based sliding windows in bounded space , 2008, SIGMOD Conference.

[43]  Brad Karp,et al.  Greedy Perimeter Stateless Routing for Wireless Networks , 2000 .

[44]  B. R. Badrinath,et al.  Routing on a curve , 2003, CCRV.

[45]  Johannes Gehrke,et al.  Query Processing in Sensor Networks , 2003, CIDR.

[46]  James K. Mullin,et al.  Optimal Semijoins for Distributed Database Systems , 1990, IEEE Trans. Software Eng..

[47]  Brad Karp,et al.  GPSR : Greedy Perimeter Stateless Routing for Wireless , 2000, MobiCom 2000.

[48]  Jennifer Widom,et al.  The CQL continuous query language: semantic foundations and query execution , 2006, The VLDB Journal.

[49]  Alan F. Babich Proving Total Correctness of Parallel Programs , 1979, IEEE Transactions on Software Engineering.

[50]  Wei Hong,et al.  The design of an acquisitional query processor for sensor networks , 2003, SIGMOD '03.

[51]  Guy M. Lohman,et al.  Optimizer Validation and Performance Evaluation for Distributed Queries , 1998 .

[52]  Kian-Lee Tan,et al.  In-network execution of monitoring queries in sensor networks , 2007, SIGMOD '07.

[53]  Himanshu Gupta,et al.  Communication-Efficient Implementation of Join in Sensor Networks , 2005, DASFAA.

[54]  Byung Suk Lee,et al.  Distributed stream join query processing with semijoins , 2010, Distributed and Parallel Databases.

[55]  Philip A. Bernstein,et al.  Using Semi-Joins to Solve Relational Queries , 1981, JACM.

[56]  Alan R. Hevner,et al.  Query Processing in Distributed Database System , 1979, IEEE Transactions on Software Engineering.

[57]  Abhinandan Das,et al.  Approximate join processing over data streams , 2003, SIGMOD '03.