RLMan: An Energy Manager Based on Reinforcement Learning for Energy Harvesting Wireless Sensor Networks

A promising solution for achieving autonomous wireless sensor networks is to enable each node to harvest energy in its environment. To address the time-varying behavior of energy sources, each node embeds an energy manager responsible for dynamically adapting the power consumption of the node in order to maximize the quality of service while avoiding power failures. A novel energy management algorithm based on reinforcement learning (RLMan) is proposed in this paper. By continuously exploring the environment, RLMan adapts its energy management policy to time-varying environment, regarding both the harvested energy and the energy consumption of the node. Linear function approximations are used to achieve very low computational and memory footprint, making RLMan suitable for resource-constrained systems, such as wireless sensor nodes. Moreover, RLMan only requires the state of charge of the energy storage device to operate, which makes it practical to implement. Exhaustive simulations using real measurements of indoor light and outdoor wind show that RLMan outperforms current state-of-the-art approaches, by enabling almost 70% gain regarding the average packet rate. Moreover, RLMan is more robust to variability of the node energy consumption.

[1]  Olivier Berder,et al.  Energy-Efficient Power Manager and MAC Protocol for Multi-Hop Wireless Sensor Networks Powered by Periodic Energy Harvesting Sources , 2015, IEEE Sensors Journal.

[2]  S. Peng,et al.  Prediction free energy neutral power management for energy harvesting wireless sensor nodes , 2014, Ad Hoc Networks.

[3]  Luca P. Carloni,et al.  Energy-Harvesting Active Networked Tags (EnHANTs) , 2015, ACM Trans. Sens. Networks.

[4]  Olivier Berder,et al.  PowWow : Power Optimized Hardware/Software Framework for Wireless Motes , 2010, ARCS Workshops.

[5]  Mani B. Srivastava,et al.  Power management in energy harvesting sensor networks , 2007, TECS.

[6]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[7]  Andrew G. Barto,et al.  Adaptive Control of Duty Cycling in Energy-Harvesting Wireless Sensor Networks , 2007, 2007 4th Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks.

[8]  Cécile Belleudy,et al.  A framework for modeling and simulating energy harvesting WSN nodes with efficient power management policies , 2012, EURASIP J. Embed. Syst..

[9]  Gil Zussman,et al.  Networking Low-Power Energy Harvesting Devices: Measurements and Algorithms , 2011, IEEE Transactions on Mobile Computing.

[10]  Dewen Hu,et al.  Multiobjective Reinforcement Learning: A Comprehensive Overview , 2015, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[11]  Tobias Weber,et al.  Reinforcement Learning for Energy Harvesting Decode-and-Forward Two-Hop Communications , 2017, IEEE Transactions on Green Communications and Networking.

[12]  Olivier Berder,et al.  Fuzzy power management for energy harvesting Wireless Sensor Nodes , 2016, 2016 IEEE International Conference on Communications (ICC).

[13]  Hao-Li Wang,et al.  A Reinforcement Learning-Based ToD Provisioning Dynamic Power Management for Sustainable Operation of Energy Harvesting Wireless Sensor Node , 2014, IEEE Transactions on Emerging Topics in Computing.

[14]  Olivier Berder,et al.  GRAPMAN: Gradual power manager for consistent throughput of energy harvesting wireless sensor nodes , 2015, 2015 IEEE 26th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC).

[15]  Robert Babuska,et al.  A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[16]  Christian R. Shelton,et al.  Balancing Multiple Sources of Reward in Reinforcement Learning , 2000, NIPS.

[17]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[18]  Julien Penders,et al.  Energy Harvesting for Autonomous Wireless Sensor Networks , 2010, IEEE Solid-State Circuits Magazine.