In-situ prediction on sensor networks using distributed multiple linear regression models

Within sensor networks for environmental monitoring, a class of problems exists that requires in-situ control and modeling. In this thesis, we provide a solution to these problems, enabling model-driven computation where complex models are replaced by in-situ sensing and communication. These prediction models utilize low-computation, low-communication, and distributed algorithms suited to autonomous operation and multiple applications. We achieve this through development of new algorithms that enable distributed computation of the pseudoinverse of a matrix on a sensor network, thereby enabling a wide range of prediction methods. We apply these models to three different application areas: (1) river flooding for early warning, (2) solar recharging current for power management, and (3) job congestion prediction on multi-function device networks for achieving quality of service. Additionally, we use these applications to explore other aspects of sensor networks: river flooding to design a predictive environmental monitoring sensor network, solar current to develop a dynamic version of the model for better fault tolerance, and job congestion to explore modeling multi-function device networks. For each, we comprehensively tested the full solutions. We implemented the river flood prediction and solar current prediction solutions on two different sensor network platforms with full field deployments; we had a final test of over 5 weeks operation for both. Overall, we achieve the following contributions: (1) distributed algorithms for computing a matrix pseudoinverse and multiple linear regression model on a sensor network, (2) three applications of these algorithms with associated field experiments demonstrating their versatility, (3) a sensor network architecture and implementation for river flood prediction as well as other applications requiring real-time data and a low node count to geographic area ratio, and (4) a IVIED simulator predicting and resolving congestion. (Copies available exclusively from MIT Libraries, Rm. 14-0551, Cambridge, MA 02139-4307. Ph. 617-253-5668; Fax 617-253-1690.)

[1]  Franklin T. Luk,et al.  Computing the Singular Value Decomposition on the Connection Machine , 1990, IEEE Trans. Computers.

[2]  Mac Schwager,et al.  Data‐driven identification of group dynamics for motion prediction and control , 2008, J. Field Robotics.

[3]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[4]  Mani B. Srivastava,et al.  Adaptive Duty Cycling for Energy Harvesting Systems , 2006, ISLPED'06 Proceedings of the 2006 International Symposium on Low Power Electronics and Design.

[5]  Konstantine P. Georgakakos,et al.  Analytical results for operational flash flood guidance , 2006 .

[6]  John Anderson,et al.  Wireless sensor networks for habitat monitoring , 2002, WSNA '02.

[7]  Dara Entekhabi,et al.  Preserving high-resolution surface and rainfall data in operational-scale basin hydrology: a fully-distributed physically-based approach , 2004 .

[8]  Lothar Thiele,et al.  Power management in energy harvesting embedded systems with discrete service levels , 2009, ISLPED.

[9]  Soundar R. T. Kumara,et al.  Collaborative Self-Organization by Devices Providing Document Services - A Multi-Agent Perspective , 2006, 2006 IEEE International Conference on Autonomic Computing.

[10]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[11]  MD. Rashed Chowdhury,et al.  Consensus Seasonal Flood Forecasts and Warning Response System (FFWRS): An Alternate for Nonstructural Flood Management in Bangladesh , 2005, Environmental management.

[12]  R. Brent,et al.  Almost linear-time computation of the singular value decomposition using mesh-connected processors , 1983 .

[13]  D. Seo,et al.  Overall distributed model intercomparison project results , 2004 .

[14]  Mani B. Srivastava,et al.  Power management in energy harvesting sensor networks , 2007, TECS.

[15]  Durga Lal Shrestha,et al.  Instance‐based learning compared to other data‐driven methods in hydrological forecasting , 2008 .

[16]  Youngjin Choi,et al.  New form of block matrix inversion , 2009, 2009 IEEE/ASME International Conference on Advanced Intelligent Mechatronics.

[17]  Guiomar Corral,et al.  Prediction and Control of Short-Term Congestion in ATM Networks Using Artificial Intelligence Techniques , 2001, ICN.

[18]  Gordon S. Blair,et al.  GridStix: supporting flood prediction using embedded hardware and next generation grid middleware , 2006, 2006 International Symposium on a World of Wireless, Mobile and Multimedia Networks(WoWMoM'06).

[19]  K. Beven,et al.  An intelligent and adaptable grid-based flood monitoring and warning system. , 2006 .

[20]  Armando Brath,et al.  Neural networks and non-parametric methods for improving real-time flood forecasting through conceptual hydrological models , 2002 .

[21]  Dimitri P. Solomatine,et al.  Modular learning models in forecasting natural phenomena , 2006, Neural Networks.

[22]  Deborah Estrin,et al.  Experiences with the Extensible Sensing System ESS , 2006 .

[23]  Dong-Jun Seo,et al.  Space-time scale sensitivity of the Sacramento model to radar-gage precipitation inputs , 1997 .

[24]  Peter I. Corke,et al.  From Robots to Animals: Virtual Fences for Controlling Cattle , 2006, Int. J. Robotics Res..

[25]  S. Muthukrishnan,et al.  Faster least squares approximation , 2007, Numerische Mathematik.

[26]  Peter M. A. Sloot,et al.  Application of parallel computing to stochastic parameter estimation in environmental models , 2006, Comput. Geosci..

[27]  Christian H. Bischof,et al.  Computing the singular value decomposition on a distributed system of vector processors , 1987, Parallel Comput..

[28]  Dong-Jun Seo,et al.  The distributed model intercomparison project (DMIP): Motivation and experiment design , 2004 .

[29]  Qing Wu,et al.  An adaptive scheduling and voltage/frequency selection algorithm for real-time energy harvesting systems , 2009, 2009 46th ACM/IEEE Design Automation Conference.

[30]  Andrew G. Barto,et al.  Adaptive Control of Duty Cycling in Energy-Harvesting Wireless Sensor Networks , 2007, 2007 4th Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks.

[31]  Satish Kumar,et al.  Next century challenges: scalable coordination in sensor networks , 1999, MobiCom.

[32]  John A. Stankovic,et al.  LUSTER: wireless sensor network for environmental research , 2007, SenSys '07.

[33]  Gordon S. Blair,et al.  Managing Heterogeneous Data Flows in Wireless Sensor Networks Using a "Split Personality' Mote Platform , 2008, 2008 International Symposium on Applications and the Internet.

[34]  William W. Hager,et al.  Updating the Inverse of a Matrix , 1989, SIAM Rev..

[35]  Faisal Hossain,et al.  The emerging role of satellite rainfall data in improving the hydro-political situation of flood monitoring in the under-developed regions of the world , 2007 .

[36]  Mile K. Stojcev,et al.  Matrix Inversion Algorithm for Linear Array Processor , 1992, CONPAR.

[37]  Peter I. Corke,et al.  Wireless adhoc sensor and actuator networks on the farm , 2006, International Symposium on Information Processing in Sensor Networks.

[38]  Lothar Thiele,et al.  PermaDAQ: A scientific instrument for precision sensing and data recovery in environmental extremes , 2009, 2009 International Conference on Information Processing in Sensor Networks.

[39]  Naveen Sharma,et al.  Clustering Analysis for the Management of Self-Monitoring Device Networks , 2008, 2008 International Conference on Autonomic Computing.

[40]  H. Vincent Poor,et al.  Regression in sensor networks: training distributively with alternating projections , 2005, SPIE Optics + Photonics.

[41]  Pavan Sikka,et al.  Demo abstract : FOS —- a new operating system for sensor networks , 2008 .

[42]  Mark DeMaria,et al.  An Updated Statistical Hurricane Intensity Prediction Scheme (SHIPS) for the Atlantic and Eastern North Pacific Basins , 1999 .

[43]  Gaurav S. Sukhatme,et al.  Designing Wireless Sensor Networks as a Shared Resource for Sustainable Development , 2006, 2006 International Conference on Information and Communication Technologies and Development.

[44]  C. Guestrin,et al.  Distributed regression: an efficient framework for modeling sensor network data , 2004, Third International Symposium on Information Processing in Sensor Networks, 2004. IPSN 2004.

[45]  Dimitri P. Solomatine,et al.  M5 Model Trees and Neural Networks: Application to Flood Forecasting in the Upper Reach of the Huai River in China , 2004 .

[46]  Daniela Rus,et al.  Model-based monitoring for early warning flood detection , 2008, SenSys '08.

[47]  David E. Culler,et al.  Perpetual environmentally powered sensor networks , 2005, IPSN 2005. Fourth International Symposium on Information Processing in Sensor Networks, 2005..

[48]  C. Loan The Block Jacobi Method for Computing the Singular Value Decomposition , 1985 .

[49]  V. Delouille,et al.  Robust distributed estimation in sensor networks using the embedded polygons algorithm , 2004, Third International Symposium on Information Processing in Sensor Networks, 2004. IPSN 2004.

[50]  Ana Bianco,et al.  Outlier Detection in Regression Models with ARIMA Errors Using Robust Estimates , 2001 .

[51]  M. Castillo-Effer,et al.  Wireless sensor networks for flash-flood alerting , 2004, Proceedings of the Fifth IEEE International Caracas Conference on Devices, Circuits and Systems, 2004..

[52]  David B. Skillicorn,et al.  A Distributed Approach for Prediction in Sensor Networks , 2005 .

[53]  Wei Hong,et al.  A macroscope in the redwoods , 2005, SenSys '05.

[54]  Jing Zhou,et al.  FloodNet: Coupling Adaptive Sampling with Energy Aware Routing in a Flood Warning System , 2007, Journal of Computer Science and Technology.

[55]  Yong Wang,et al.  Energy-efficient computing for wildlife tracking: design tradeoffs and early experiences with ZebraNet , 2002, ASPLOS X.

[56]  Dharma P. Agrawal,et al.  Using polynomial regression for data representation in wireless sensor networks , 2007, Int. J. Commun. Syst..

[57]  Luca Benini,et al.  Robust and Low Complexity Rate Control for Solar Powered Sensors , 2008, 2008 Design, Automation and Test in Europe.

[58]  I. Rodríguez‐Iturbe,et al.  Random Functions and Hydrology , 1984 .

[59]  Magdi A. Koutb,et al.  A Survey on Unicast Congestion Control Protocols for Media Traffic , 2009 .

[60]  Peter C. Young,et al.  Data assimilation in the identification of flood inundation models: derivation of on-line muti-step ahead predictions in flows. , 2004 .

[61]  Naveen Sharma,et al.  Robust clustering analysis for the management of self-monitoring distributed systems , 2008, Cluster Computing.

[62]  Matt Welsh,et al.  Fidelity and yield in a volcano monitoring sensor network , 2006, OSDI '06.

[63]  S. Sorooshian,et al.  Calibration of a semi-distributed hydrologic model for streamflow estimation along a river system , 2004, Journal of Hydrology.

[64]  Margaret Martonosi,et al.  Hardware design experiences in ZebraNet , 2004, SenSys '04.

[65]  Victor Y. Pan,et al.  Parallel least-squares solution of general and Toeplitz systems , 1990, SPAA '90.

[66]  Peter C. Young,et al.  THE UNCERTAINTY CASCADE IN FLOOD FORECASTING , 2006 .

[67]  H. Vincent Poor,et al.  A Collaborative Training Algorithm for Distributed Learning , 2009, IEEE Transactions on Information Theory.

[68]  David E. Culler,et al.  Design, Modeling, and Capacity Planning for Micro-solar Power Sensor Networks , 2008, 2008 International Conference on Information Processing in Sensor Networks (ipsn 2008).

[69]  R. Tsay,et al.  Outlier Detection in Multivariate Time Series by Projection Pursuit , 2006 .

[70]  C. Bischof A Parallel Ordering for the Block Jacobi Method on a Hypercube Architecture , 1986 .

[71]  C. M. Kishtawal,et al.  Multimodel Ensemble Forecasts for Weather and Seasonal Climate , 2000 .

[72]  Carrick Detweiler,et al.  AquaNodes: an underwater sensor network , 2007, Underwater Networks.

[73]  Prasun Sinha,et al.  Steady and fair rate allocation for rechargeable sensors in perpetual sensor networks , 2008, SenSys '08.

[74]  Danny Hughes,et al.  The Evolution of the GridStix Wireless Sensor Network Platform , 2008 .

[75]  Mani B. Srivastava,et al.  Harvesting aware power management for sensor networks , 2006, 2006 43rd ACM/IEEE Design Automation Conference.

[76]  Franz Nestmann,et al.  River Water Level Prediction Using Physically Based and Data Driven Models , 2005 .

[77]  Tore Syversen,et al.  Electronic shepherd - a low-cost, low-bandwidth, wireless network system , 2004, MobiSys '04.

[78]  Peter J. Webster,et al.  Operational Short-Term Flood Forecasting for Bangladesh: Application of ECMWF Ensemble Precipitation Forecasts , 2004 .

[79]  Gordon S. Blair,et al.  An experiment with reflective middleware to support grid‐based flood monitoring , 2008, Concurr. Comput. Pract. Exp..

[80]  Gordon S. Blair,et al.  Exploiting extreme heterogeneity in a flood warning scenario using the Gridkit middleware , 2008, Companion '08.

[81]  Daniela Rus,et al.  Design of early warning flood detection systems for developing countries , 2007, 2007 International Conference on Information and Communication Technologies and Development.