Simultaneous Energy Harvesting and Information Transmission in a MIMO Full-Duplex System: A Machine Learning-Based Design

We propose a multiple-input multiple-output (MIMO)-based full-duplex (FD) scheme that enables wireless devices to simultaneously transmit information and harvest energy using the same time-frequency resources. In this scheme, for a MIMO point-to-point set up, the energy transmitting device simultaneously receives information from the energy harvesting device. Furthermore, the self-interference (SI) at the energy harvesting device caused by the FD mode of operation is utilized as a desired power signal to be harvested by the device. For implementation-friendly antenna selection and MIMO precoding at both the devices, we propose two methods: (i) a sub-optimal method based on relaxation, and (ii) a hybrid deep reinforcement learning (DRL)-based method, specifically, a deep deterministic policy gradient (DDPG)-deep double Q-network (DDQN) method. Finally, we study the performance of the proposed system under the two implementation methods and compare it with that of the conventional time switching-based simultaneous wireless information and power transfer (SWIPT) method. Findings show that the proposed system gives a significant improvement in spectral efficiency compared to the time switching-based SWIPT. In particular, the DRL-based method provides the highest spectral efficiency. Furthermore, numerical results show that, for the considered system set up, the number of antennas in each device should exceed three to mitigate self-interference to an acceptable level.

[1]  Sneha A. Dalvi,et al.  Internet of Things for Smart Cities , 2017 .

[2]  Qing Wang,et al.  A Survey on Device-to-Device Communication in Cellular Networks , 2013, IEEE Communications Surveys & Tutorials.

[3]  Jie Xu,et al.  A General Design Framework for MIMO Wireless Energy Transfer With Limited Feedback , 2015, IEEE Transactions on Signal Processing.

[4]  Tho Le-Ngoc,et al.  MIMO Full-Duplex Precoding: A Joint Beamforming and Self-Interference Cancellation Structure , 2015, IEEE Transactions on Wireless Communications.

[5]  Taneli Riihonen,et al.  Mitigation of Loopback Self-Interference in Full-Duplex MIMO Relays , 2011, IEEE Transactions on Signal Processing.

[6]  Sachin Katti,et al.  Full duplex radios , 2013, SIGCOMM.

[7]  Joseph A. Paradiso,et al.  Energy scavenging for mobile and wireless electronics , 2005, IEEE Pervasive Computing.

[8]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[9]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[10]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[11]  Rui Zhang,et al.  MIMO Broadcasting for Simultaneous Wireless Information and Power Transfer , 2011, IEEE Transactions on Wireless Communications.

[12]  Chan-Byoung Chae,et al.  Compact Full Duplex MIMO Radios in D2D Underlaid Cellular Networks: From System Design to Prototype Results , 2016, IEEE Access.

[13]  Risto Wichman,et al.  In-Band Full-Duplex Wireless: Challenges and Opportunities , 2013, IEEE Journal on Selected Areas in Communications.

[14]  Kee Chaing Chua,et al.  Wireless Information and Power Transfer: A Dynamic Power Splitting Approach , 2013, IEEE Transactions on Communications.

[15]  Pingzhi Fan,et al.  Full-Duplex Device-to-Device-Aided Cooperative Nonorthogonal Multiple Access , 2017, IEEE Transactions on Vehicular Technology.

[16]  Muhammad Ali Imran,et al.  Energy-Efficiency Analysis and Optimization for Virtual-MIMO Systems , 2014, IEEE Transactions on Vehicular Technology.

[17]  A. Robert Calderbank,et al.  MIMO Wireless Communications , 2007 .

[18]  C.-H. Yu,et al.  Power Optimization of Device-to-Device Communication Underlaying Cellular Communication , 2009, 2009 IEEE International Conference on Communications.

[19]  Matti Latva-aho,et al.  Precoding for Full Duplex Multiuser MIMO Systems: Spectral and Energy Efficiency Maximization , 2013, IEEE Transactions on Signal Processing.

[20]  Yue Chen,et al.  Spectral and Energy Efficiency of Uplink D2D Underlaid Massive MIMO Cellular Networks , 2017, IEEE Transactions on Communications.

[21]  Ekram Hossain,et al.  Multiple Access in Dynamic Cell-Free Networks: Outage Performance and Deep Reinforcement Learning-Based Design , 2020, ArXiv.

[22]  Aimin Tang,et al.  Cooperative Full Duplex Device to Device Communication Underlaying Cellular Networks , 2017, IEEE Transactions on Wireless Communications.

[23]  Victor C. M. Leung,et al.  Green Full-Duplex Self-Backhaul and Energy Harvesting Small Cell Networks With Massive MIMO , 2016, IEEE Journal on Selected Areas in Communications.

[24]  Ekram Hossain,et al.  The D-OMA Method for Massive Multiple Access in 6G: Performance, Security, and Challenges , 2019, IEEE Vehicular Technology Magazine.

[25]  Sivarama Venkatesan,et al.  MIMO Communication for Cellular Networks , 2011 .

[26]  Gorazd Kandus,et al.  MIMO: Wireless Communications , 2008, Encyclopedia of Wireless and Mobile Communications.

[27]  Gábor Fodor,et al.  A Distributed Power Control Scheme for Cellular Network Assisted D2D Communications , 2011, 2011 IEEE Global Telecommunications Conference - GLOBECOM 2011.

[28]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[29]  Klaus David,et al.  6G Vision and Requirements: Is There Any Need for Beyond 5G? , 2018, IEEE Vehicular Technology Magazine.

[30]  Athanasios V. Vasilakos,et al.  Full-Duplex Wireless Communications: Challenges, Solutions, and Future Research Directions , 2016, Proceedings of the IEEE.

[31]  Yasser F. Al-Eryani Exact Performance of Wireless-Powered Communications with Maximum Ratio Combining , 2019, Wireless Personal Communications.

[32]  Dong In Kim,et al.  Generalized Coordinated Multipoint (GCoMP)-Enabled NOMA: Outage, Capacity, and Power Allocation , 2019, IEEE Transactions on Communications.

[33]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.