On-Board Deep Q-Network for UAV-Assisted Online Power Transfer and Data Collection

Unmanned Aerial Vehicles (UAVs) with Microwave Power Transfer (MPT) capability provide a practical means to deploy a large number of wireless powered sensing devices into areas with no access to persistent power supplies. The UAV can charge the sensing devices remotely and harvest their data. A key challenge is online MPT and data collection in the presence of on-board control of a UAV (e.g., patrolling velocity) for preventing battery drainage and data queue overflow of the devices, while up-to-date knowledge on battery level and data queue of the devices is not available at the UAV. In this paper, an on-board deep Q-network is developed to minimize the overall data packet loss of the sensing devices, by optimally deciding the device to be charged and interrogated for data collection, and the instantaneous patrolling velocity of the UAV. Specifically, we formulate a Markov Decision Process (MDP) with the states of battery level and data queue length of devices, channel conditions, and waypoints given the trajectory of the UAV; and solve it optimally with Q-learning. Furthermore, we propose the on-board deep Q-network that enlarges the state space of the MDP, and a deep reinforcement learning based scheduling algorithm that asymptotically derives the optimal solution online, even when the UAV has only outdated knowledge on the MDP states. Numerical results demonstrate that our deep reinforcement learning algorithm reduces the packet loss by at least 69.2%, as compared to existing non-learning greedy algorithms.

[1]  Ismail Güvenç,et al.  Dynamic Mobility-Aware Interference Avoidance for Aerial Base Stations in Cognitive Radio Networks , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[2]  Eduardo Tovar,et al.  Reinforcement Learning for Scheduling Wireless Powered Sensor Communications , 2019, IEEE Transactions on Green Communications and Networking.

[3]  Jun Li,et al.  Simultaneous Wireless Information and Power Transfer (SWIPT): Recent Advances and Future Challenges , 2018, IEEE Communications Surveys & Tutorials.

[4]  Joseph Lipka,et al.  A Table of Integrals , 2010 .

[5]  Ryu Miura,et al.  AC-POCA: Anticoordination Game Based Partially Overlapping Channels Assignment in Combined UAV and D2D-Based Networks , 2017, IEEE Transactions on Vehicular Technology.

[6]  Angelos Antonopoulos,et al.  Breaking the Boundaries of Aerial Networks with Charging Stations , 2019, ICC 2019 - 2019 IEEE International Conference on Communications (ICC).

[7]  Xin Wang,et al.  Energy-Efficient Cooperative Relaying for Unmanned Aerial Vehicles , 2016, IEEE Transactions on Mobile Computing.

[8]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[9]  Liang Liu,et al.  On Coverage of Wireless Sensor Networks for Rolling Terrains , 2012, IEEE Transactions on Parallel and Distributed Systems.

[10]  Chau Yuen,et al.  Poster: Fair Scheduling for Energy Harvesting WSN in Smart City , 2015, SenSys.

[11]  Eduardo Tovar,et al.  PELE: Power efficient legitimate eavesdropping via jamming in UAV communications , 2017, 2017 13th International Wireless Communications and Mobile Computing Conference (IWCMC).

[12]  Wei Ni,et al.  Wireless Power Transfer and Data Collection in Wireless Sensor Networks , 2017, IEEE Transactions on Vehicular Technology.

[13]  Lihua Li,et al.  UAV-assisted Cooperative Communications with Wireless Information and Power Transfer , 2017, ArXiv.

[14]  Dan Keun Sung,et al.  Energy-efficient maneuvering and communication of a single UAV-based relay , 2014, IEEE Transactions on Aerospace and Electronic Systems.

[15]  Feng Jiang,et al.  Optimization of UAV Heading for the Ground-to-Air Uplink , 2011, IEEE Journal on Selected Areas in Communications.

[16]  Agathoniki Trigoni,et al.  Supporting Search and Rescue Operations with UAVs , 2010, 2010 International Conference on Emerging Security Technologies.

[17]  Jiming Chen,et al.  Demo: Mobile Wireless Charging and Sensing by Drones , 2016, MobiSys '16 Companion.

[18]  Darius Burschka,et al.  Toward a Fully Autonomous UAV: Research Platform for Indoor and Outdoor Urban Search and Rescue , 2012, IEEE Robotics & Automation Magazine.

[19]  Tarik Taleb,et al.  A green strategic activity scheduling for UAV networks: A sub-modular game perspective , 2016, IEEE Communications Magazine.

[20]  Marco Wiering,et al.  Reinforcement Learning and Markov Decision Processes , 2012, Reinforcement Learning.

[21]  Sanjay Jha,et al.  Reliable transmissions in AWSNs by using O-BESPAR hybrid antenna , 2016, Pervasive Mob. Comput..

[22]  Manos M. Tentzeris,et al.  A drone-based wireless power transfer anc communications platform , 2017, 2017 IEEE Wireless Power Transfer Conference (WPTC).

[23]  Zheng Ma,et al.  Design of wireless power transfer device for UAV , 2016, 2016 IEEE International Conference on Mechatronics and Automation.

[24]  Abbas Jamalipour,et al.  Toward the Evolution of Wireless Powered Communication Networks for the Future Internet of Things , 2017, IEEE Network.

[25]  Carrick Detweiler,et al.  Charge selection algorithms for maximizing sensor network life with UAV-based limited wireless recharging , 2013, 2013 IEEE Eighth International Conference on Intelligent Sensors, Sensor Networks and Information Processing.

[26]  Rose Qingyang Hu,et al.  Computation Rate Maximization in UAV-Enabled Wireless-Powered Mobile-Edge Computing Systems , 2018, IEEE Journal on Selected Areas in Communications.

[27]  Ryu Miura,et al.  On A Novel Adaptive UAV-Mounted Cloudlet-Aided Recommendation System for LBSNs , 2019, IEEE Transactions on Emerging Topics in Computing.

[28]  Miao Pan,et al.  Efficient data collection for wireless rechargeable sensor clusters in Harsh terrains using UAVs , 2014, 2014 IEEE Global Communications Conference.

[29]  Luis Alonso,et al.  Communication recovery with emergency aerial networks , 2017, IEEE Transactions on Consumer Electronics.

[30]  Carrick Detweiler,et al.  Resonant wireless power transfer to ground sensors from a UAV , 2012, 2012 IEEE International Conference on Robotics and Automation.

[31]  Long-Ji Lin,et al.  Reinforcement learning for robots using neural networks , 1992 .

[32]  Jiming Chen,et al.  Distributed Sampling Rate Control for Rechargeable Sensor Nodes with Limited Battery Capacity , 2013, IEEE Transactions on Wireless Communications.

[33]  Ryu Miura,et al.  A dynamic trajectory control algorithm for improving the communication throughput and delay in UAV-aided networks , 2016, IEEE Network.

[34]  Lingyang Song,et al.  Joint Trajectory and Power Optimization for UAV Relay Networks , 2018, IEEE Communications Letters.

[35]  Kuang-Hao Liu,et al.  Selection cooperation using RF energy harvesting relays with finite energy buffer , 2014, 2014 IEEE Wireless Communications and Networking Conference (WCNC).

[36]  Carrick Detweiler,et al.  Experimental Analysis of a UAV-Based Wireless Power Transfer Localization System , 2014, ISER.

[37]  A. Lee Swindlehurst,et al.  Wireless Relay Communications with Unmanned Aerial Vehicles: Performance and Optimization , 2011, IEEE Transactions on Aerospace and Electronic Systems.

[38]  Mohamed-Slim Alouini,et al.  Adaptive Modulation over Nakagami Fading Channels , 2000, Wirel. Pers. Commun..

[39]  Renjie Huang,et al.  Design and Deployment of Sensor Network for Real-Time High-Fidelity Volcano Monitoring , 2010, IEEE Transactions on Parallel and Distributed Systems.

[40]  Dong In Kim,et al.  Optimal Data Scheduling and Admission Control for Backscatter Sensor Networks , 2017, IEEE Transactions on Communications.

[41]  Wei Ni,et al.  SWPT: A Joint-Scheduling Model for Wireless Powered Sensor Networks , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[42]  Lihua Li,et al.  UAV-Assisted Cooperative Communications with Time-Sharing SWIPT , 2018, 2018 IEEE International Conference on Communications (ICC).

[43]  Xin Wang,et al.  EPLA: Energy-balancing packets scheduling for airborne relaying networks , 2015, 2015 IEEE International Conference on Communications (ICC).

[44]  Jie Xu,et al.  UAV-Enabled Wireless Power Transfer: Trajectory Design and Energy Optimization , 2017, IEEE Transactions on Wireless Communications.

[45]  John N. Tsitsiklis,et al.  Analysis of temporal-difference learning with function approximation , 1996, NIPS 1996.

[46]  Stephen Cameron,et al.  Communication provision for a team of remotely searching UAVs: A mobile relay approach , 2012, 2012 IEEE Globecom Workshops.

[47]  Youngnam Han,et al.  Optimal Resource Allocation for Non-Orthogonal Transmission in UAV Relay Systems , 2018, IEEE Wireless Communications Letters.

[48]  Rui Zhang,et al.  Throughput Maximization for UAV-Enabled Mobile Relaying Systems , 2016, IEEE Transactions on Communications.

[49]  I. S. Gradshteyn,et al.  Table of Integrals, Series, and Products , 1976 .

[50]  Pieter Abbeel,et al.  Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.

[51]  Rui Zhang,et al.  Wireless communications with unmanned aerial vehicles: opportunities and challenges , 2016, IEEE Communications Magazine.

[52]  Nei Kato,et al.  Efficient Resource Allocation Utilizing Q-Learning in Multiple UA Communications , 2019, IEEE Transactions on Network Science and Engineering.

[53]  Abbas Jamalipour,et al.  Throughput Maximization in Dual-Hop Wireless Powered Communication Networks , 2017, IEEE Transactions on Vehicular Technology.

[54]  Qingqing Wu,et al.  Joint Trajectory and Communication Design for Multi-UAV Enabled Wireless Networks , 2017, IEEE Transactions on Wireless Communications.

[55]  Manos M. Tentzeris,et al.  Design of a novel wireless power system using machine learning techniques for drone applications , 2017, 2017 IEEE Wireless Power Transfer Conference (WPTC).

[56]  Yimin D. Zhang,et al.  Multi-source cooperative communications using multiple small relay UAVs , 2010, 2010 IEEE Globecom Workshops.

[57]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[58]  Jin Chen,et al.  Power Control in UAV-Supported Ultra Dense Networks: Communications, Caching, and Energy Transfer , 2017, IEEE Communications Magazine.

[59]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[60]  Rui Zhang,et al.  Energy-Efficient UAV Communication With Trajectory Optimization , 2016, IEEE Transactions on Wireless Communications.

[61]  Chadi Abou-Rjeily,et al.  UAV-Aided Cooperation for FSO Communication Systems , 2018, IEEE Communications Magazine.