Long-term information collection with energy harvesting wireless sensors: a multi-armed bandit based approach

This paper reports on the development of a multi-agent approach to long-term information collection in networks of energy harvesting wireless sensors. In particular, we focus on developing energy management and data routing policies that adapt their behaviour according to the energy that is harvested, in order to maximise the amount of information collected given the available energy budget. In so doing, we introduce a new energy management technique, based on multi-armed bandit learning, that allows each agent to adaptively allocate its energy budget across the tasks of data sampling, receiving and transmitting. By using this approach, each agent can learn the optimal energy budget settings that give it efficient information collection in the long run. Then, we propose two novel decentralised multi-hop algorithms for data routing. The first proveably maximises the information throughput in the network, but can sometimes involve high communication cost. The second algorithm provides near-optimal performance, but with reduced computational and communication costs. Finally, we demonstrate that, by using our approaches for energy management and routing, we can achieve a 120% improvement in long-term information collection against state-of-the-art benchmarks.

[1]  Cauligi S. Raghavendra,et al.  PEGASIS: Power-efficient gathering in sensor information systems , 2002, Proceedings, IEEE Aerospace Conference.

[2]  Deborah Estrin,et al.  Rumor routing algorthim for sensor networks , 2002, WSNA '02.

[3]  H. Robbins Some aspects of the sequential design of experiments , 1952 .

[4]  Geoff V. Merrett,et al.  Energy- and information-managed wireless sensor networks : modelling and simulation , 2008 .

[5]  Victor R. Lesser,et al.  The Soft Real-Time Agent Control Architecture , 2005, Autonomous Agents and Multi-Agent Systems.

[6]  Giuseppe Anastasi,et al.  Performance measurements of motes sensor networks , 2004, MSWiM '04.

[7]  S. Beeby,et al.  Energy harvesting vibration sources for microsystems applications , 2006 .

[8]  Andreas Krause,et al.  Near-optimal sensor placements in Gaussian processes , 2005, ICML.

[9]  Deepak Ganesan,et al.  Ultra-low power data storage for sensor networks , 2006, 2006 5th International Conference on Information Processing in Sensor Networks.

[10]  Azzedine Boukerche,et al.  Algorithms and Protocols for Wireless Sensor Networks , 2008, Wiley series on parallel and distributed computing.

[11]  Tao Wu,et al.  Minimizing inter-cluster interference by self-reorganizing MAC allocation in sensor networks , 2007, Wirel. Networks.

[12]  Gang Zhou,et al.  Achieving Real-Time Target Tracking UsingWireless Sensor Networks , 2006, 12th IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS'06).

[13]  Shlomo Zilberstein,et al.  Formal models and algorithms for decentralized decision making under uncertainty , 2008, Autonomous Agents and Multi-Agent Systems.

[14]  Nicholas R. Jennings,et al.  Decentralized control of adaptive sampling in wireless sensor networks , 2009, TOSN.

[15]  Jing Zhou,et al.  FloodNet: Coupling Adaptive Sampling with Energy Aware Routing in a Flood Warning System , 2007, Journal of Computer Science and Technology.

[16]  Margaret Martonosi,et al.  Hardware design experiences in ZebraNet , 2004, SenSys '04.

[17]  Radhika Nagpal,et al.  Towards Desynchronization of Multi-hop Topologies , 2008, 2008 Second IEEE International Conference on Self-Adaptive and Self-Organizing Systems.

[18]  Deborah Estrin,et al.  Time synchronization for wireless sensor networks , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[19]  Wendi Heinzelman,et al.  Energy-efficient communication protocol for wireless microsensor networks , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[20]  Deborah Estrin,et al.  Directed diffusion for wireless sensor networking , 2003, TNET.

[21]  Saibal Roy,et al.  Self-powered autonomous wireless sensor node using vibration energy harvesting , 2008 .

[22]  Mohamed F. Younis,et al.  On handling QoS traffic in wireless sensor networks , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[23]  Ying Zhang,et al.  A Learning-based Adaptive Routing Tree for Wireless Sensor Networks , 2006, J. Commun..

[24]  Nicholas R. Jennings,et al.  An Agent-Based Distributed Coordination Mechanism for Wireless Visual Sensor Nodes Using Dynamic Programming , 2010, Comput. J..

[25]  Yong Wang,et al.  Energy-efficient computing for wildlife tracking: design tradeoffs and early experiences with ZebraNet , 2002, ASPLOS X.

[26]  B. Frieden Science from Fisher Information , 2004 .

[27]  Kay Römer,et al.  The design space of wireless sensor networks , 2004, IEEE Wireless Communications.

[28]  Makoto Yokoo,et al.  DCOPs meet the realworld: exploring unknown reward matrices with applications to mobile sensor networks , 2009, IJCAI 2009.

[29]  Michèle Sebag,et al.  Adaptive operator selection with dynamic multi-armed bandits , 2008, GECCO '08.

[30]  Ian F. Akyildiz,et al.  Wireless sensor networks: a survey , 2002, Comput. Networks.

[31]  Chee-Yee Chong,et al.  Sensor networks: evolution, opportunities, and challenges , 2003, Proc. IEEE.

[32]  Nicholas R. Jennings,et al.  Decentralised Adaptive Sampling of Wireless Sensor Networks , 2007 .

[33]  NICHOLAS R. JENNINGS,et al.  An agent-based approach for building complex software systems , 2001, CACM.

[34]  Michèle Sebag,et al.  Multi-armed Bandit, Dynamic Environments and Meta-Bandits , 2006 .

[35]  Peter Auer,et al.  The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..

[36]  Leen-Kiat Soh,et al.  A Real-Time Negotiation Model and A Multi-Agent Sensor Network Implementation , 2005, Autonomous Agents and Multi-Agent Systems.

[37]  Deborah Estrin,et al.  Rumor Routing Algorithm For Sensor Networks , 2002 .

[38]  Erol Gelenbe,et al.  Power-aware ad hoc cognitive packet networks , 2004, Ad Hoc Networks.

[39]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[40]  A. Kansal,et al.  An environmental energy harvesting framework for sensor networks , 2003, Proceedings of the 2003 International Symposium on Low Power Electronics and Design, 2003. ISLPED '03..

[41]  Marie desJardins,et al.  Local strategy learning in networked multi-agent team formation , 2006, Autonomous Agents and Multi-Agent Systems.

[42]  Heribert Baldus,et al.  Reliable Set-Up of Medical Body-Sensor Networks , 2004, EWSN.

[43]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[44]  Johnsen Kho,et al.  Decentralised control of wireless sensor networks , 2009 .

[45]  Dorothea Wagner,et al.  Algorithms for Sensor and Ad Hoc Networks, Advanced Lectures [result from a Dagstuhl seminar] , 2007, Algorithms for Sensor and Ad Hoc Networks.

[46]  John A. Stankovic,et al.  Research challenges for wireless sensor networks , 2004, SIGBED.

[47]  Ajay D. Kshemkalyani,et al.  Clock synchronization for wireless sensor networks: a survey , 2005, Ad Hoc Networks.

[48]  Sarvapali D. Ramchurn,et al.  2008 International Conference on Information Processing in Sensor Networks Towards Real-Time Information Processing of Sensor Network Data using Computationally Efficient Multi-output Gaussian Processes , 2022 .

[49]  Baohua Zhao,et al.  A Global-Energy-Balancing Real-Time Routing in Wireless Sensor Networks , 2007 .

[50]  Armin Dekorsy,et al.  Optimal Distributed Routing and Power Control Decomposition for Wireless Networks , 2007, IEEE GLOBECOM 2007 - IEEE Global Telecommunications Conference.

[51]  Peter Desnoyers,et al.  Ultra-low power data storage for sensor networks , 2006, 2006 5th International Conference on Information Processing in Sensor Networks.

[52]  S. Sitharama Iyengar,et al.  Energy equivalence routing in wireless sensor networks , 2004, Microprocess. Microsystems.

[53]  Ann Nowé,et al.  Decentralized Learning in Wireless Sensor Networks , 2009, ALA.

[54]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[55]  Giuseppe Anastasi,et al.  Performance Measurements of Mote Sensor Networks , 2004 .

[56]  Michal Pechoucek,et al.  Industrial deployment of multi-agent technologies: review and selected case studies , 2008, Autonomous Agents and Multi-Agent Systems.

[57]  Alexandros Labrinidis,et al.  Proceeedings of the 1st international workshop on Data management for sensor networks: in conjunction with VLDB 2004 , 2004 .

[58]  Gyula Simon,et al.  Sensor network-based countersniper system , 2004, SenSys '04.

[59]  Cem Ersoy,et al.  MAC protocols for wireless sensor networks: a survey , 2006, IEEE Communications Magazine.

[60]  Subhash Suri,et al.  Catching elephants with mice: Sparse sampling for monitoring sensor networks , 2009, TOSN.

[61]  A. Cassandra,et al.  Exact and approximate algorithms for partially observable markov decision processes , 1998 .

[62]  A. El Saddik,et al.  Ant Colony-Based Reinforcement Learning Algorithm for Routing in Wireless Sensor Networks , 2007, 2007 IEEE Instrumentation & Measurement Technology Conference IMTC 2007.

[63]  Edward Y. Chang,et al.  Adaptive sampling for sensor networks , 2004, DMSN '04.

[64]  Jan M. Rabaey,et al.  Power Sources for Wireless Sensor Networks , 2004, EWSN.

[65]  C. Guestrin,et al.  Near-optimal sensor placements: maximizing information while minimizing communication cost , 2006, 2006 5th International Conference on Information Processing in Sensor Networks.

[66]  Andrew G. Barto,et al.  Adaptive Control of Duty Cycling in Energy-Harvesting Wireless Sensor Networks , 2007, 2007 4th Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks.

[67]  Robert D. Nowak,et al.  Backcasting: adaptive sampling for sensor networks , 2004, Third International Symposium on Information Processing in Sensor Networks, 2004. IPSN 2004.

[68]  Nicholas R. Jennings,et al.  A utility-based adaptive sensing and multihop communication protocol for wireless sensor networks , 2010, TOSN.

[69]  Nicholas R. Jennings,et al.  Agent Technologies for Sensor Networks , 2009, IEEE Intelligent Systems.

[70]  Richard R. Brooks,et al.  Distributed Sensor Networks: A Multiagent Perspective , 2008 .

[71]  Soundar R. T. Kumara,et al.  Distributed energy balanced routing for wireless sensor networks , 2009, Comput. Ind. Eng..