The Clustered AGgregation (CAG) technique leveraging spatial and temporal correlations in wireless sensor networks

Sensed data in Wireless Sensor Networks (WSN) reflect the spatial and temporal correlations of physical attributes existing intrinsically in the environment. In this article, we present the Clustered AGgregation (CAG) algorithm that forms clusters of nodes sensing similar values within a given threshold (spatial correlation), and these clusters remain unchanged as long as the sensor values stay within a threshold over time (temporal correlation). With CAG, only one sensor reading per cluster is transmitted whereas with Tiny AGgregation (TAG) all the nodes in the network transmit the sensor readings. Thus, CAG provides energy efficient and approximate aggregation results with small and often negligible and bounded error. In this article we extend our initial work in CAG in five directions: First, we investigate the effectiveness of CAG that exploits the temporal as well as spatial correlations using both the measured and modeled data. Second, we design CAG for two modes of operation (interactive and streaming) to enable CAG to be used in different environments and for different purposes. Interactive mode provides mechanisms for one-shot queries, whereas the streaming mode provides those for continuous queries. Third, we propose a fixed range clustering method, which makes the performance of our system independent of the magnitude of the sensor readings and the network topology. Fourth, using mica2 motes, we perform a large-scale measurement of real environmental data (temperature and light, both indoor and outdoor) and the wireless radio reliability, which were used for both analytical modeling and simulation experiments. Fifth, we model the spatially correlated data using the properties of our real world measurements. Our experimental results show that when we compute the average of sensor readings in the network using the CAG interactive mode with the user-provided error threshold of, 20%, we can save 68.25% of transmissions over TAG with only 2.46% inaccuracy in the result. The streaming mode of CAG can save even more transmissions (up to 70.24% in our experiments) over TAG, when data shows high spatial and temporal correlations. We expect these results to hold in reality, because we used the mica2 radio profile and empirical datasets for our simulation study. CAG is the first system that leverages spatial and temporal correlations to improve energy efficiency of in-network aggregation. This study analytically and empirically validates CAG's effectiveness.

[1]  Carol A. Gotway,et al.  Statistical Methods for Spatial Data Analysis , 2004 .

[2]  Edward Y. Chang,et al.  Adaptive stream resource management using Kalman Filters , 2004, SIGMOD '04.

[3]  Wendi Heinzelman,et al.  Energy-efficient communication protocol for wireless microsensor networks , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[4]  Jeffrey Considine,et al.  Approximate aggregation techniques for sensor databases , 2004, Proceedings. 20th International Conference on Data Engineering.

[5]  Baltasar Beferull-Lozano,et al.  On network correlated data gathering , 2004, IEEE INFOCOM 2004.

[6]  Martin Vetterli,et al.  Network correlated data gathering with explicit communication: NP-completeness and algorithms , 2006 .

[7]  Cyrus Shahabi,et al.  Exploiting spatial correlation towards an energy efficient clustered aggregation technique (CAG) [wireless sensor network applications] , 2005, IEEE International Conference on Communications, 2005. ICC 2005. 2005.

[8]  Deborah Estrin,et al.  A wireless sensor network For structural monitoring , 2004, SenSys '04.

[9]  Deborah Estrin,et al.  Computing aggregates for monitoring wireless sensor networks , 2003, Proceedings of the First IEEE International Workshop on Sensor Network Protocols and Applications, 2003..

[10]  Tomasz Imielinski,et al.  Prediction-based monitoring in sensor networks: taking lessons from MPEG , 2001, CCRV.

[11]  John Anderson,et al.  An analysis of a large scale habitat monitoring application , 2004, SenSys '04.

[12]  Phillip B. Gibbons Distinct Sampling for Highly-Accurate Answers to Distinct Values Queries and Event Reports , 2001, VLDB.

[13]  Johannes Gehrke,et al.  Query Processing in Sensor Networks , 2003, CIDR.

[14]  Monica D. Kohler,et al.  Seismic Amplitude Variations due to Site and Basin Edge Effects in the Los Angeles Basin , 2003 .

[15]  Deborah Estrin,et al.  An evaluation of multi-resolution storage for sensor networks , 2003, SenSys '03.

[16]  David E. Culler,et al.  Analysis of wireless sensor networks for habitat monitoring , 2004 .

[17]  Michael S. Rosenberg,et al.  Conceptual and Mathematical Relationships among Methods for Spatial Analysis , 2022 .

[18]  D.P. Agrawal,et al.  APTEEN: a hybrid protocol for efficient routing and comprehensive information retrieval in wireless , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[19]  Ramesh Govindan,et al.  Networking support for query processing in sensor networks , 2004, CACM.

[20]  Wei Hong,et al.  Proceedings of the 5th Symposium on Operating Systems Design and Implementation Tag: a Tiny Aggregation Service for Ad-hoc Sensor Networks , 2022 .

[21]  A. Manjeshwar,et al.  TEEN: a routing protocol for enhanced efficiency in wireless sensor networks , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[22]  Srinivasan Seshan,et al.  Synopsis diffusion for robust aggregation in sensor networks , 2004, SenSys '04.

[23]  Jack J. Lennon,et al.  Red-shifts and red herrings in geographical ecology , 2000 .

[24]  Samir R. Das,et al.  Efficient gathering of correlated data in sensor networks , 2005, MobiHoc '05.

[25]  S. Hartley,et al.  Uses and abuses of fractal methodology in ecology , 2004 .

[26]  Vishnu Navda,et al.  Efficient gathering of correlated data in sensor networks , 2008, TOSN.

[27]  Chris Chatfield,et al.  Statistical Methods for Spatial Data Analysis , 2004 .

[28]  Leonidas J. Guibas,et al.  Lightweight sensing and communication protocols for target enumeration and aggregation , 2003, MobiHoc '03.

[29]  David E. Culler,et al.  TOSSIM: accurate and scalable simulation of entire TinyOS applications , 2003, SenSys '03.

[30]  W. Tobler A Computer Movie Simulating Urban Growth in the Detroit Region , 1970 .

[31]  Konstantinos Psounis,et al.  Modeling spatially-correlated sensor network data , 2004, 2004 First Annual IEEE Communications Society Conference on Sensor and Ad Hoc Communications and Networks, 2004. IEEE SECON 2004..

[32]  Deborah Estrin,et al.  Simultaneous Optimization for Concave Costs: Single Sink Aggregation or Single Source Buy-at-Bulk , 2003, SODA '03.

[33]  Deborah Estrin,et al.  Directed diffusion: a scalable and robust communication paradigm for sensor networks , 2000, MobiCom '00.

[34]  C. Guestrin,et al.  Distributed regression: an efficient framework for modeling sensor network data , 2004, Third International Symposium on Information Processing in Sensor Networks, 2004. IPSN 2004.

[35]  Bhaskar Krishnamachari,et al.  Distributed parameter estimation for monitoring diffusion phenomena using physical models , 2004, 2004 First Annual IEEE Communications Society Conference on Sensor and Ad Hoc Communications and Networks, 2004. IEEE SECON 2004..

[36]  P. Green,et al.  Modelling spatially correlated data via mixtures: a Bayesian approach , 2002 .

[37]  Wei Hong,et al.  Model-Driven Data Acquisition in Sensor Networks , 2004, VLDB.

[38]  Jennifer Widom,et al.  Adaptive filters for continuous queries over distributed data streams , 2003, SIGMOD '03.

[39]  Mohamed A. Sharaf,et al.  TiNA: a scheme for temporal coherency-aware in-network aggregation , 2003, MobiDe '03.

[40]  C. Shahabi,et al.  An Experimental Study of the Effectiveness of Clustered AGgregation ( CAG ) Leveraging Spatial and Temporal Correlations in Wireless Sensor Networks , 2005 .

[41]  Ramesh Govindan,et al.  The impact of spatial correlation on routing with compression in wireless sensor networks , 2008, TOSN.

[42]  Robert Szewczyk,et al.  System architecture directions for networked sensors , 2000, ASPLOS IX.