Optimal Wireless Information and Power Transfer Using Deep Q-Network

In this paper, a multiantenna wireless transmitter communicates with an information receiver while radiating RF energy to surrounding energy harvesters. The channel between the transceivers is known to the transmitter, but the channels between the transmitter and the energy harvesters are unknown to the transmitter. By designing its transmit covariance matrix, the transmitter fully charges the energy buffers of all energy harvesters in the shortest amount of time while maintaining the target information rate toward the receiver. At the beginning of each time slot, the transmitter determines the particular beam pattern to transmit with. Throughout the whole charging process, the transmitter does not estimate the energy harvesting channel vectors. Due to the high complexity of the system, we propose a novel deep Q-network algorithm to determine the optimal transmission strategy for complex systems. Simulation results show that deep Q-network is superior to the existing algorithms in terms of the time consumption to fulfill the wireless charging process.

[1]  Saman Atapattu,et al.  Beamformer design for wireless energy transfer with fairness , 2016, 2016 IEEE International Conference on Communications (ICC).

[2]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[3]  Dong In Kim,et al.  Received Power-Based Channel Estimation for Energy Beamforming in Multiple-Antenna RF Energy Transfer System , 2017, IEEE Transactions on Signal Processing.

[4]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[5]  Jie Xu,et al.  A General Design Framework for MIMO Wireless Energy Transfer With Limited Feedback , 2015, IEEE Transactions on Signal Processing.

[6]  Jing Wang,et al.  A deep reinforcement learning based framework for power-efficient resource allocation in cloud RANs , 2017, 2017 IEEE International Conference on Communications (ICC).

[7]  Liang Dong,et al.  Deep Learning for Optimized Wireless Transmission to Multiple RF Energy Harvesters , 2018, 2018 IEEE 88th Vehicular Technology Conference (VTC-Fall).

[8]  Wei Wu,et al.  Max-min fair wireless energy transfer for multiple-input multiple-output wiretap channels , 2016, IET Commun..

[9]  Deepak Mishra,et al.  Jointly Optimal Spatial Channel Assignment and Power Allocation for MIMO SWIPT Systems , 2018, IEEE Wireless Communications Letters.

[10]  Victor C. M. Leung,et al.  Deep-Reinforcement-Learning-Based Optimization for Cache-Enabled Opportunistic Interference Alignment Wireless Networks , 2017, IEEE Transactions on Vehicular Technology.

[11]  Yuan Xing,et al.  A Multi-Armed Bandit Approach to Wireless Information and Power Transfer , 2020, IEEE Communications Letters.

[12]  Liang Dong,et al.  Parallel Sub-Channel Transmission for Cognitive Radios with Multiple Antennas , 2014, Wirel. Pers. Commun..

[13]  Rui Zhang,et al.  MIMO Broadcasting for Simultaneous Wireless Information and Power Transfer , 2013 .

[14]  Hong-Chuan Yang,et al.  On the Performance of Overlaid Wireless Sensor Transmission With RF Energy Harvesting , 2015, IEEE Journal on Selected Areas in Communications.

[15]  Kostas Berberidis,et al.  Spatial Domain Simultaneous Information and Power Transfer for MIMO Channels , 2015, IEEE Transactions on Wireless Communications.

[16]  Bhaskar Krishnamachari,et al.  Deep Reinforcement Learning for Dynamic Multichannel Access in Wireless Networks , 2018, IEEE Transactions on Cognitive Communications and Networking.

[17]  James K. Cavers,et al.  Mobile Channel Characteristics , 2000 .

[18]  Bo Hu,et al.  Optimal Transceiver Design for SWIPT in $K$-User MIMO Interference Channels , 2016, IEEE Transactions on Wireless Communications.

[19]  Aleksandrs Slivkins,et al.  Introduction to Multi-Armed Bandits , 2019, Found. Trends Mach. Learn..

[20]  Luis Alonso,et al.  Information Exchange in Randomly Deployed Dense WSNs With Wireless Energy Harvesting Capabilities , 2016, IEEE Transactions on Wireless Communications.

[21]  Multiuser Data Dissemination in OFDMA System Based on Deep Q-Network , 2021, 2021 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS).

[22]  Shimon Whiteson,et al.  Learning to Communicate with Deep Multi-Agent Reinforcement Learning , 2016, NIPS.

[23]  A. Robert Calderbank,et al.  MIMO Wireless Communications , 2007 .

[24]  Andrew G. Barto,et al.  Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..

[25]  Liang Liu,et al.  Collaborative Wireless Energy and Information Transfer in Interference Channel , 2014, IEEE Transactions on Wireless Communications.

[26]  Liang Dong,et al.  Passive Radio-Frequency Energy Harvesting through Wireless Information Transmission , 2017, 2017 13th International Conference on Distributed Computing in Sensor Systems (DCOSS).

[27]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.