Dynamic User Pairing and Power Allocation for NOMA with Deep Reinforcement Learning

In this paper, we investigate the user pairing and power allocation scheme for multiple cellular users (CUs) under the downlink non-orthogonal multiple access (NOMA) system. To maximize the sum-rate of all CUs, we formulate the resource allocation issue through optimizing the user pairing relationship and power allocation method. However, due to the nonconvex property of the formulated problem, the original problem is decoupled into two subproblems. First, the optimal power allocation scheme with a given subchannel assignment is obtained via a closed-form solution. Furthermore, based on the obtained optimal allocation scheme, a classical deep reinforcement learning (DRL) method called Deep Q-Network (DQN) algorithm is adopted to find the optimal user pairing scheme, where the DQN algorithm is characterized by higher learning efficiency and better performance of the features extraction ability compared with traditional reinforcement learning (RL) schemes. Simulation results validate the effectiveness of our proposed resource allocation, as compared against the RL based scheme and conventional orthogonal multiple access (OMA) method.

[1]  Wei Liang,et al.  User Pairing for Downlink Non-Orthogonal Multiple Access Networks Using Matching Algorithm , 2017, IEEE Transactions on Communications.

[2]  Takehiro Nakamura,et al.  5G Evolution and 6G , 2020, 2020 IEEE Symposium on VLSI Technology.

[3]  H. Vincent Poor,et al.  Reinforcement Learning-Based NOMA Power Allocation in the Presence of Smart Jamming , 2018, IEEE Transactions on Vehicular Technology.

[4]  Keping Long,et al.  Deep Neural Network for Resource Management in NOMA Networks , 2020, IEEE Transactions on Vehicular Technology.

[5]  Qi Zhu,et al.  Joint User-Channel Assignment and Power Allocation for Non-Orthogonal Multiple Access Relaying Networks , 2019, IEEE Access.

[6]  Wei Li,et al.  Spectrum Resource and Power Allocation With Adaptive Proportional Fair User Pairing for NOMA Systems , 2019, IEEE Access.

[7]  Yik-Chung Wu,et al.  Sum Rate Maximization of Secure NOMA Transmission with Imperfect CSI , 2020, ICC 2020 - 2020 IEEE International Conference on Communications (ICC).

[8]  H. Vincent Poor,et al.  Joint Power and Time Allocation for NOMA–MEC Offloading , 2018, IEEE Transactions on Vehicular Technology.