Model-based approximate querying in sensor networks

Declarative queries are proving to be an attractive paradigm for interacting with networks of wireless sensors. The metaphor that “the sensornet is a database” is problematic, however, because sensors do not exhaustively represent the data in the real world. In order to map the raw sensor readings onto physical reality, a model of that reality is required to complement the readings. In this article, we enrich interactive sensor querying with statistical modeling techniques. We demonstrate that such models can help provide answers that are both more meaningful, and, by introducing approximations with probabilistic confidences, significantly more efficient to compute in both time and energy. Utilizing the combination of a model and live data acquisition raises the challenging optimization problem of selecting the best sensor readings to acquire, balancing the increase in the confidence of our answer against the communication and data acquisition costs in the network. We describe an exponential time algorithm for finding the optimal solution to this optimization problem, and a polynomial-time heuristic for identifying solutions that perform well in practice. We evaluate our approach on several real-world sensor-network datasets, taking into account the real measured data and communication quality, demonstrating that our model-based approach provides a high-fidelity representation of the real phenomena and leads to significant performance gains versus traditional data acquisition techniques.

[1]  Brian W. Kernighan,et al.  An Effective Heuristic Algorithm for the Traveling-Salesman Problem , 1973, Oper. Res..

[2]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[3]  Karen Ward,et al.  Dynamic query evaluation plans , 1989, SIGMOD '89.

[4]  Keiji Kanazawa,et al.  A model for reasoning about persistence and causation , 1989 .

[5]  R. I. Bahar,et al.  Algebraic decision diagrams and their applications , 1993, Proceedings of 1993 International Conference on Computer Aided Design (ICCAD).

[6]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[7]  Goetz Graefe,et al.  Optimization of dynamic query evaluation plans , 1994, SIGMOD '94.

[8]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[9]  David Heckerman,et al.  Learning With Bayesian Networks (Abstract) , 1995, ICML.

[10]  Craig Boutilier,et al.  Exploiting Structure in Policy Construction , 1995, IJCAI.

[11]  B. Sriraman Call for papers. , 2021, Journal of back and musculoskeletal rehabilitation.

[12]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[13]  Timos K. Sellis,et al.  Parametric query optimization , 1992, The VLDB Journal.

[14]  Helen J. Wang,et al.  Online aggregation , 1997, SIGMOD '97.

[15]  Nir Friedman,et al.  Learning Belief Networks in the Presence of Missing Values and Hidden Variables , 1997, ICML.

[16]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1998, Learning in Graphical Models.

[17]  Hector Garcia-Molina,et al.  Filtering with Approximate Predicates , 1998, VLDB.

[18]  Sumit Ganguly,et al.  Design and Analysis of Parametric Query Optimization Algorithms , 1998, VLDB.

[19]  Xavier Boyen,et al.  Tractable Inference for Complex Stochastic Processes , 1998, UAI.

[20]  Yossi Matias,et al.  New sampling-based summary statistics for improving approximate query answers , 1998, SIGMOD '98.

[21]  Peter J. Haas,et al.  Interactive data Analysis: The Control Project , 1999, Computer.

[22]  Sridhar Ramaswamy,et al.  Join synopses for approximate query answering , 1999, SIGMOD '99.

[23]  Peter J. Haas,et al.  In-teractive data analysis with CONTROL , 1999 .

[24]  Michael I. Jordan,et al.  Probabilistic Networks and Expert Systems , 1999 .

[25]  Mehul A. Shah,et al.  Adaptive Query Processing: Technology in Evolution , 2000, IEEE Data Eng. Bull..

[26]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[27]  Deborah Estrin,et al.  Directed diffusion: a scalable and robust communication paradigm for sensor networks , 2000, MobiCom '00.

[28]  Gregory J. Pottie,et al.  Wireless integrated network sensors , 2000, Commun. ACM.

[29]  Phillip B. Gibbons Distinct Sampling for Highly-Accurate Answers to Distinct Values Queries and Event Reports , 2001, VLDB.

[30]  Ben Taskar,et al.  Selectivity estimation using probabilistic models , 2001, SIGMOD '01.

[31]  Rajeev Rastogi,et al.  Independence is good: dependency-based histogram synopses for high-dimensional data , 2001, SIGMOD '01.

[32]  Jennifer Widom,et al.  Adaptive precision setting for cached approximate values , 2001, SIGMOD '01.

[33]  Carlos Guestrin,et al.  Multiagent Planning with Factored MDPs , 2001, NIPS.

[34]  Jennifer Widom,et al.  Best-effort cache synchronization with source cooperation , 2002, SIGMOD '02.

[35]  Sheila A. McIlraith,et al.  Monitoring a Complez Physical System using a Hybrid Dynamic Bayes Net , 2002, UAI.

[36]  Feng Zhao,et al.  Scalable Information-Driven Sensor Querying and Routing for Ad Hoc Heterogeneous Sensor Networks , 2002, Int. J. High Perform. Comput. Appl..

[37]  Sunil Prabhakar,et al.  Evaluating probabilistic queries over imprecise data , 2003, SIGMOD '03.

[38]  Wei Hong,et al.  The design of an acquisitional query processor for sensor networks , 2003, SIGMOD '03.

[39]  Wei Hong,et al.  Beyond Average: Toward Sophisticated Sensing with Queries , 2003, IPSN.

[40]  Joseph Polastre,et al.  Design and implementation ofwireless sensor networks for habitat monitoring , 2003 .

[41]  Samuel Madden,et al.  The design and evaluation of a query processing architecture for sensor networks , 2003 .

[42]  Srinivasan Seshan,et al.  Synopsis diffusion for robust aggregation in sensor networks , 2004, SenSys '04.

[43]  Wei Hong,et al.  Model-Driven Data Acquisition in Sensor Networks , 2004, VLDB.

[44]  Jennifer Widom,et al.  Adaptive ordering of pipelined stream filters , 2004, SIGMOD '04.

[45]  Samuel Madden,et al.  Distributed regression: an efficient framework for modeling sensor network data , 2004, Third International Symposium on Information Processing in Sensor Networks, 2004. IPSN 2004.

[46]  Johannes Gehrke,et al.  Query Processing in Sensor Networks , 2003, CIDR.

[47]  Carlos Guestrin,et al.  Robust Probabilistic Inference in Distributed Systems , 2004, UAI.

[48]  Mohamed A. Sharaf,et al.  Balancing energy efficiency and quality of aggregate data in sensor networks , 2004, The VLDB Journal.

[49]  Jeffrey Considine,et al.  Approximate aggregation techniques for sensor databases , 2004, Proceedings. 20th International Conference on Data Engineering.

[50]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[51]  Wei Hong,et al.  Exploiting correlated attributes in acquisitional query processing , 2005, 21st International Conference on Data Engineering (ICDE'05).

[52]  Ian F. Akyildiz,et al.  Wireless sensor networks , 2007 .