Learning and Fairness in Energy Harvesting: A Maximin Multi-Armed Bandits Approach

Recent advances in wireless radio frequency (RF) energy harvesting allows sensor nodes to increase their lifespan by remotely charging their batteries. The amount of energy harvested by the nodes varies depending on their ambient environment, and proximity to the energy source, and lifespan of the sensor network depends on the minimum amount of energy a node can harvest in the network. It is thus important to learn the least amount of energy harvested by nodes so that the source can transmit on a frequency band that maximizes this amount. We model this learning problem as a novel stochastic Maximin Multi-Armed Bandits (Maximin MAB) problem and propose an Upper Confidence Bound (UCB) based algorithm named Maximin UCB. Maximin MAB is a generalization of standard MAB, and Maximin UCB enjoys the same performance guarantee as to the UCBI algorithm. Our experimental results validate the performance guarantees of the proposed algorithm.

[1]  Nicholas R. Jennings,et al.  Long-term information collection with energy harvesting wireless sensors: a multi-armed bandit based approach , 2012, Autonomous Agents and Multi-Agent Systems.

[2]  Setareh Maghsudi,et al.  Distributed User Association in Energy Harvesting Dense Small Cell Networks: A Mean-Field Multi-Armed Bandit Approach , 2016, IEEE Access.

[3]  Quan Wang,et al.  Frequency- and Pattern-Reconfigurable Antenna for Multistandard Wireless Applications , 2015, IEEE Antennas and Wireless Propagation Letters.

[4]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[5]  Hyungsik Ju,et al.  Throughput Maximization in Wireless Powered Communication Networks , 2013, IEEE Trans. Wirel. Commun..

[6]  Bhaskar Krishnamachari,et al.  Online Learning Schemes for Power Allocation in Energy Harvesting Communications , 2018, IEEE Transactions on Information Theory.

[7]  Jacques Palicot,et al.  Distributed decision making policy for frequency band selection boosting RF energy harvesting rate in wireless sensor nodes , 2018, Wirel. Networks.

[8]  Zhu Han,et al.  Wireless Networks With RF Energy Harvesting: A Contemporary Survey , 2014, IEEE Communications Surveys & Tutorials.

[9]  Rahul Vaze,et al.  Distributed Algorithms for Efficient Learning and Coordination in Ad Hoc Networks , 2019, 2019 International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOPT).

[10]  Yong Liang Guan,et al.  Throughput Optimization for Massive MIMO Systems Powered by Wireless Energy Transfer , 2014, IEEE Journal on Selected Areas in Communications.

[11]  Gregory D. Durgin,et al.  Harvesting Wireless Power: Survey of Energy-Harvester Conversion Efficiency in Far-Field, Wireless Power Transfer Systems , 2014, IEEE Microwave Magazine.

[12]  Hyungsik Ju,et al.  User cooperation in wireless powered communication networks , 2014, 2014 IEEE Global Communications Conference.

[13]  Kaibin Huang,et al.  Energy Harvesting Wireless Communications: A Review of Recent Advances , 2015, IEEE Journal on Selected Areas in Communications.

[14]  Harshvardhan Tibrewal,et al.  Distributed Learning and Optimal Assignment in Multiplayer Heterogeneous Networks , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[15]  Swades De,et al.  Smart RF energy harvesting communications: challenges and opportunities , 2015, IEEE Communications Magazine.