Data reduction and gps-free node localization in wireless sensor networks

This thesis addresses the topics of data reduction via sampling in both central database environments, and wireless sensor networks, and GPS-free node localization in wireless sensor networks. The first contribution of this thesis is a deterministic sampling algorithm for sampling count data, which is common in data mining applications. We show that our algorithm creates more accurate and higher quality samples compared to previous work, and the samples it generates can be used as a surrogate for the original high volume data. Our second contribution is a deterministic weighted sampling algorithm that can be used as a new data aggregation method for wireless sensor network data. The aggregation algorithm shares similar ideas with our previous sampling algorithm. In order to adapt to the sensor network environment, however, we designed our algorithm to perform weighted sampling in a distributed manner. The weighted sampling design allows the algorithm to work with any arbitrary network topology, while the distributed design divides the sampling work equally on all the sensor nodes in the network and prevents any node from being a bottleneck (both with regards to CPU consumption, and communication). We show that our aggregation algorithm generates samples of better quality than previous algorithms, using far less energy. Our last contribution is two GPS-free node localization algorithms, termed GPS-free Directed Localization (GDL), and GPS & Compass-free Directed Localization (GCDL). The importance of localization is apparent in mobile wireless sensor networks, where the neighborhood changes frequently and knowledge about the neighbor positions is essential for performing additional tasks such as aggregation or coherent movement. Our algorithms perform localization without the need of Global Positioning System (GPS) or any other infrastructure (e.g., anchor points). These algorithms work with only local knowledge without using historical data, and exploit mobility to perform localization. The memoryless aspect of our algorithms avoids the accumulation error over time, which is essential in mobility scenarios where coherent movement of a swarm of nodes is required. We show that our algorithms do work even at high environmental noise levels, and keep a nice semi-rigid network formation in mobility scenarios.

[1]  James N. Rosenau,et al.  To learn more , 2004, IEEE Potentials.

[2]  Hervé Brönnimann,et al.  A new deterministic data aggregation method for wireless sensor networks , 2007, Signal Process..

[3]  Yossi Matias,et al.  New sampling-based summary statistics for improving approximate query answers , 1998, SIGMOD '98.

[4]  Hervé Brönnimann,et al.  Deterministic algorithms for sampling count data , 2008, Data Knowl. Eng..

[5]  Mohamed A. Sharaf,et al.  Balancing energy efficiency and quality of aggregate data in sensor networks , 2004, The VLDB Journal.

[6]  Gianluca Mazzini,et al.  Localization in sensor networks with fading and mobility , 2002, The 13th IEEE International Symposium on Personal, Indoor and Mobile Radio Communications.

[7]  Carsten Lund,et al.  Learn more, sample less: control of volume and variance in network measurement , 2005, IEEE Transactions on Information Theory.

[8]  David E. Culler,et al.  System architecture directions for networked sensors , 2000, SIGP.

[9]  Haibo Hu,et al.  Energy-Efficient Monitoring of Spatial Predicates over Moving Objects , 2005, IEEE Data Eng. Bull..

[10]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[11]  Krishna M. Sivalingam,et al.  Data Gathering Algorithms in Sensor Networks Using Energy Metrics , 2002, IEEE Trans. Parallel Distributed Syst..

[12]  Hannu Toivonen,et al.  Sampling Large Databases for Association Rules , 1996, VLDB.

[13]  Erik D. Demaine,et al.  Anchor-Free Distributed Localization in Sensor Networks , 2003 .

[14]  John Anderson,et al.  Wireless sensor networks for habitat monitoring , 2002, WSNA '02.

[15]  Bernard Chazelle,et al.  The Discrepancy Method , 1998, ISAAC.

[16]  Bin Chen,et al.  Efficient Data-Reduction Methods for On-line Association Rule Discovery , 2004 .

[17]  Jeffrey Scott Vitter,et al.  Random sampling with a reservoir , 1985, TOMS.

[18]  Divyakant Agrawal,et al.  Medians and beyond: new aggregation techniques for sensor networks , 2004, SenSys '04.

[19]  Srinivasan Parthasarathy,et al.  Evaluation of sampling for data mining of association rules , 1997, Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications.

[20]  Doron Rotem,et al.  Random sampling from databases: a survey , 1995 .

[21]  Sándor P. Fekete,et al.  Shawn: A new approach to simulating wireless sensor networks , 2005, ArXiv.

[22]  Sharad Mehrotra,et al.  Capturing sensor-generated time series with quality guarantees , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[23]  Ying Xing,et al.  The Design of the Borealis Stream Processing Engine , 2005, CIDR.

[24]  Yechiam Yemini,et al.  Some theoretical aspects of position-location problems , 1979, 20th Annual Symposium on Foundations of Computer Science (sfcs 1979).

[25]  Pat Langley,et al.  Static Versus Dynamic Sampling for Data Mining , 1996, KDD.

[26]  Deborah Estrin,et al.  Robust range estimation using acoustic and multimodal sensing , 2001, Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180).

[27]  Alex Delis,et al.  GPS-Free node localization in mobile wireless sensor networks , 2006, MobiDE '06.

[28]  Tracy Camp,et al.  A survey of mobility models for ad hoc network research , 2002, Wirel. Commun. Mob. Comput..

[29]  Xiaoyan Hong,et al.  A group mobility model for ad hoc wireless networks , 1999, MSWiM '99.

[30]  Srdjan Capkun,et al.  GPS-free Positioning in Mobile Ad Hoc Networks , 2001, Proceedings of the 34th Annual Hawaii International Conference on System Sciences.

[31]  Suman Nath,et al.  Tributaries and deltas: efficient and robust aggregation in sensor network streams , 2005, SIGMOD '05.

[32]  Peter J. Haas,et al.  The New Jersey Data Reduction Report , 1997 .

[33]  Gordon L. Stüber,et al.  Overview of radiolocation in CDMA cellular systems , 1998, IEEE Commun. Mag..

[34]  Nick Roussopoulos,et al.  Compressing historical information in sensor networks , 2004, SIGMOD '04.

[35]  Yong Yao,et al.  The cougar approach to in-network query processing in sensor networks , 2002, SGMD.

[36]  Anantha P. Chandrakasan,et al.  An application-specific protocol architecture for wireless microsensor networks , 2002, IEEE Trans. Wirel. Commun..

[37]  Nitin H. Vaidya,et al.  Location-aided routing (LAR) in mobile ad hoc networks , 1998, MobiCom '98.

[38]  Sudipto Guha,et al.  Approximation and streaming algorithms for histogram construction problems , 2006, TODS.

[39]  Edward Y. Chang,et al.  Adaptive sampling for sensor networks , 2004, DMSN '04.

[40]  Antonios Deligiannakis,et al.  Data Reduction Techniques in Sensor Networks , 2005, IEEE Data Eng. Bull..

[41]  Piotr Indyk,et al.  Low-Dimensional Embedding with Extra Information , 2006, Discret. Comput. Geom..

[42]  Hervé Brönnimann,et al.  Practical and Efficient Geometric Epsilon-Approximations , 2006, CCCG.

[43]  Wei Hong,et al.  Proceedings of the 5th Symposium on Operating Systems Design and Implementation Tag: a Tiny Aggregation Service for Ad-hoc Sensor Networks , 2022 .

[44]  S. Muthukrishnan,et al.  AQUA: System and Techniques for Approximate Query Answering , 1998 .

[45]  Wei Hong,et al.  The design of an acquisitional query processor for sensor networks , 2003, SIGMOD '03.

[46]  Gaurav S. Sukhatme,et al.  Constrained coverage for mobile sensor networks , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[47]  Jeffrey Considine,et al.  Approximately uniform random sampling in sensor networks , 2004, DMSN '04.

[48]  Srinivasan Seshan,et al.  Synopsis diffusion for robust aggregation in sensor networks , 2004, SenSys '04.

[49]  Sanjeev Khanna,et al.  Power-conserving computation of order-statistics over sensor networks , 2004, PODS.

[50]  Wendi Heinzelman,et al.  Energy-efficient communication protocol for wireless microsensor networks , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[51]  Jeffrey Considine,et al.  Approximate aggregation techniques for sensor databases , 2004, Proceedings. 20th International Conference on Data Engineering.

[52]  Gaurav S. Sukhatme,et al.  Ad-hoc localization using ranging and sectoring , 2004, IEEE INFOCOM 2004.

[53]  David Evans,et al.  Localization for mobile sensor networks , 2004, MobiCom '04.

[54]  S. Muthukrishnan,et al.  Estimating Rarity and Similarity over Data Stream Windows , 2002, ESA.

[55]  Bin Chen,et al.  A new two-phase sampling based algorithm for discovering association rules , 2002, KDD.

[56]  Robert Tappan Morris,et al.  Span: An Energy-Efficient Coordination Algorithm for Topology Maintenance in Ad Hoc Wireless Networks , 2002, Wirel. Networks.

[57]  Leonidas J. Guibas,et al.  Wireless sensor networks - an information processing approach , 2004, The Morgan Kaufmann series in networking.

[58]  Deborah Estrin,et al.  GPS-less low-cost outdoor localization for very small devices , 2000, IEEE Wirel. Commun..

[59]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[60]  Helen J. Wang,et al.  Online aggregation , 1997, SIGMOD '97.

[61]  Vijay Kumar,et al.  Performance of dead reckoning-based location service for mobile ad hoc networks , 2004, Wirel. Commun. Mob. Comput..

[62]  Hervé Brönnimann,et al.  Deterministic Data Reduction in Sensor Networks , 2006, 2006 IEEE International Conference on Mobile Ad Hoc and Sensor Systems.

[63]  Carla E. Brodley,et al.  KDD-Cup 2000 organizers' report: peeling the onion , 2000, SKDD.

[64]  Theodore Johnson,et al.  Sampling algorithms in a stream operator , 2005, SIGMOD '05.

[65]  Rajeev Motwani,et al.  Sampling from a moving window over streaming data , 2002, SODA '02.

[66]  Bin Chen,et al.  Efficient data reduction with EASE , 2003, KDD '03.

[67]  Yossi Matias,et al.  Fast incremental maintenance of approximate histograms , 1997, TODS.