Energy-Efficient Power Allocation and Q-Learning-Based Relay Selection for Relay-Aided D2D Communication

Device-to-device (D2D) communication is a promising paradigm to meet the requirement of ultra-dense, low-latency and high-rate in the fifth-generation networks. However, energy consumption is a critical issue for the D2D communication, especially for D2D relay networks. To make the best use of D2D communication, the problem of optimizing energy efficiency (EE) must be addressed. In this paper, we propose a joint power allocation and relay selection (JPARS) scheme for the improvement of energy efficiency in relay-aided D2D communications underlaying cellular network. A mixed integer nonlinear fractional programming (MINLP) problem of the total EE for D2D pairs is formulated. While ensuring the quality of service (QoS) of cellular users and D2D links, we solve the power allocation problem by Dinkelbach method and Lagrange dual decomposition. After that, Q-learning, one of the reinforcement learning algorithms, is employed to solve the relay selection problem. Finally, we provide in-depth theoretical analysis of the proposed scheme in terms of complexity and signaling overhead. Simulation results verify that the proposed scheme not only overcomes the bottleneck effect, but also nearly reaches the theoretical maximum in terms of the total EE of D2D pairs.

[1]  Yingzhuang Liu,et al.  Optimal Resource and Power Allocation With Relay Selection for RF/RE Energy Harvesting Relay-Aided D2D Communication , 2019, IEEE Access.

[2]  Xiang Cheng,et al.  Efficiency Resource Allocation for Device-to-Device Underlay Communication Systems: A Reverse Iterative Combinatorial Auction Based Approach , 2012, IEEE Journal on Selected Areas in Communications.

[3]  Minghui Chen,et al.  Relay Selection for Radio Frequency Energy-Harvesting Wireless Body Area Network With Buffer , 2018, IEEE Internet of Things Journal.

[4]  Nei Kato,et al.  Networking and Communications in Autonomous Driving: A Survey , 2019, IEEE Communications Surveys & Tutorials.

[5]  Lajos Hanzo,et al.  Mobile-Traffic-Aware Offloading for Energy- and Spectral-Efficient Large-Scale D2D-Enabled Cellular Networks , 2019, IEEE Transactions on Wireless Communications.

[6]  Cheng-Xiang Wang,et al.  Towards Energy-Efficient Underlaid Device-to-Device Communications: A Joint Resource Management Approach , 2019, IEEE Access.

[7]  Ryu Miura,et al.  AC-POCA: Anticoordination Game Based Partially Overlapping Channels Assignment in Combined UAV and D2D-Based Networks , 2017, IEEE Transactions on Vehicular Technology.

[8]  Wenson Chang,et al.  Energy Efficient Relay Matching With Bottleneck Effect Elimination Power Adjusting for Full-Duplex Relay Assisted D2D Networks Using mmWave Technology , 2018, IEEE Access.

[9]  Ekram Hossain,et al.  Distributed Resource Allocation for Relay-Aided Device-to-Device Communication: A Message Passing Approach , 2014, IEEE Transactions on Wireless Communications.

[10]  Mohsen Guizani,et al.  5G D2D Networks: Techniques, Challenges, and Future Prospects , 2018, IEEE Systems Journal.

[11]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[12]  Nei Kato,et al.  On the Outage Probability of Device-to-Device-Communication-Enabled Multichannel Cellular Networks: An RSS-Threshold-Based Perspective , 2016, IEEE Journal on Selected Areas in Communications.

[13]  Zhihong Qian,et al.  Analysis of Joint Relay Selection and Resource Allocation Scheme for Relay-Aided D2D Communication Networks , 2019, IEEE Access.

[14]  Fengye Hu,et al.  Sum-Throughput Maximization by Power Allocation in WBAN With Relay Cooperation , 2019, IEEE Access.

[15]  Jia Liu,et al.  Energy efficient power allocation for relay-aided D2D communications in 5G networks , 2017, China Communications.

[16]  Xiaohu You,et al.  Energy-Efficient Joint Resource Allocation and Power Control for D2D Communications , 2016, IEEE Transactions on Vehicular Technology.

[17]  Justin P. Coon,et al.  Resource Allocation for Full-Duplex Relay-Assisted Device-to-Device Multicarrier Systems , 2017, IEEE Wireless Communications Letters.

[18]  Zhu Han,et al.  Game-theoretic resource allocation methods for device-to-device communication , 2014, IEEE Wireless Communications.

[19]  Nei Kato,et al.  Device-to-Device Communication in LTE-Advanced Networks: A Survey , 2015, IEEE Communications Surveys & Tutorials.

[20]  Kai Ma,et al.  Chance-Constrained Optimization in D2D-Based Vehicular Communication Network , 2019, IEEE Transactions on Vehicular Technology.

[21]  Alia Asheralieva,et al.  Bayesian Reinforcement Learning-Based Coalition Formation for Distributed Resource Sharing by Device-to-Device Users in Heterogeneous Cellular Networks , 2017, IEEE Transactions on Wireless Communications.

[22]  Zhu Han,et al.  Two-Stage Matching for Energy-Efficient Resource Management in D2D Cooperative Relay Communications , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[23]  Zhu Han,et al.  Energy-Efficient Resource Allocation for Device-to-Device Underlay Communication , 2022 .

[24]  Mianxiong Dong,et al.  Energy-Efficient Matching for Resource Allocation in D2D Enabled Cellular Networks , 2017, IEEE Transactions on Vehicular Technology.

[25]  Lingyang Song,et al.  D2D-U: Device-to-Device Communications in Unlicensed Bands for 5G System , 2016, IEEE Transactions on Wireless Communications.

[26]  I. Stancu-Minasian Nonlinear Fractional Programming , 1997 .

[27]  Huaping Liu,et al.  DFT-Based Channel Estimation Scheme for Sidelink in D2D Communication , 2017, Wirel. Pers. Commun..

[28]  Chengwen Xing,et al.  Uplink Resource Allocation for Relay-Aided Device-to-Device Communication , 2018, IEEE Transactions on Intelligent Transportation Systems.

[29]  Sohail Ahmed,et al.  A Distributed Multi-Agent RL-Based Autonomous Spectrum Allocation Scheme in D2D Enabled Multi-Tier HetNets , 2019, IEEE Access.