Joint Power Allocation and Channel Assignment for NOMA With Deep Reinforcement Learning

Non-orthogonal multiple access (NOMA) has been considered as a significant candidate technique for the next generation wireless communication to support high throughput and massive connectivity. It allows different users to be multiplexed on one channel through applying superposition coding at the transmitter and successive interference cancellation (SIC) at the receiver. To fully utilize the benefit of the NOMA technique, the key problem is how to optimally allocate resources, such as power and channels, to users to maximize the system performance. There have been some existing works on the power allocation for the single-carrier NOMA system. However, how to optimally assign channels in the multi-carrier NOMA system is still unclear. In this paper, we propose a deep reinforcement learning framework to allocate resources to users in a near optimal way. Specifically, we exploit an attention-based neural network (ANN) to perform the channel assignment. Simulation results show that the proposed framework can achieve better system performance, compared with the state-of-the-art approaches.

[1]  Anass Benjebbour,et al.  Concept and practical considerations of non-orthogonal multiple access (NOMA) for future radio access , 2013, 2013 International Symposium on Intelligent Signal Processing and Communication Systems.

[2]  Tharmalingam Ratnarajah,et al.  A Minorization-Maximization Method for Optimizing Sum Rate in the Downlink of Non-Orthogonal Multiple Access Systems , 2015, IEEE Transactions on Signal Processing.

[3]  Thomas Hofmann,et al.  Kernel Methods for Missing Variables , 2005, AISTATS.

[4]  Derrick Wing Kwan Ng,et al.  Optimal Joint Power and Subcarrier Allocation for MC-NOMA Systems , 2016, 2016 IEEE Global Communications Conference (GLOBECOM).

[5]  Moritz Diehl,et al.  Local Convergence of Sequential Convex Programming for Nonconvex Optimization , 2010 .

[6]  D. Hunter,et al.  A Tutorial on MM Algorithms , 2004 .

[7]  H. Vincent Poor,et al.  Cooperative Non-Orthogonal Multiple Access in 5G Systems , 2015, IEEE Communications Letters.

[8]  Li Ping,et al.  Comparison of orthogonal and non-orthogonal approaches to future wireless cellular systems , 2006 .

[9]  Ioannis Krikidis,et al.  Fairness for Non-Orthogonal Multiple Access in 5G Systems , 2015, IEEE Signal Processing Letters.

[10]  Ya-Feng Liu,et al.  On the Complexity of Joint Subcarrier and Power Allocation for Multi-User OFDMA Systems , 2012, IEEE Transactions on Signal Processing.

[11]  Muhammad Imran,et al.  Non-Orthogonal Multiple Access (NOMA) for cellular future radio access , 2017 .

[12]  Di Yuan,et al.  Joint Optimization of Power and Channel Allocation with Non-Orthogonal Multiple Access for 5G Cellular Systems , 2014, 2015 IEEE Global Communications Conference (GLOBECOM).

[13]  Guan Gui,et al.  Deep Learning for an Effective Nonorthogonal Multiple Access Scheme , 2018, IEEE Transactions on Vehicular Technology.

[14]  Jinho Choi,et al.  Power Allocation for Max-Sum Rate and Max-Min Rate Proportional Fairness in NOMA , 2016, IEEE Communications Letters.

[15]  Li Ping,et al.  Comparison of orthogonal and non-orthogonal approaches to future wireless cellular systems , 2006, IEEE Vehicular Technology Magazine.

[16]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[17]  P. Stoica,et al.  Cyclic minimizers, majorization techniques, and the expectation-maximization algorithm: a refresher , 2004, IEEE Signal Process. Mag..

[18]  Yan Chen,et al.  Lyapunov Optimized Resource Management for Multiuser Mobile Video Streaming , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Zhu Han,et al.  User Scheduling and Resource Allocation in HetNets With Hybrid Energy Supply: An Actor-Critic Reinforcement Learning Approach , 2018, IEEE Transactions on Wireless Communications.

[20]  Rudy Lauwereins,et al.  Impact of CSI Feedback Strategies on LTE Downlink and Reinforcement Learning Solutions for Optimal Allocation , 2017, IEEE Transactions on Vehicular Technology.

[21]  Pingzhi Fan,et al.  Impact of User Pairing on 5G Nonorthogonal Multiple-Access Downlink Transmissions , 2016, IEEE Transactions on Vehicular Technology.

[22]  D. Hunter,et al.  Variable Selection using MM Algorithms. , 2005, Annals of statistics.

[23]  Pingzhi Fan,et al.  A Novel Power Allocation Scheme Under Outage Constraints in NOMA Systems , 2016, IEEE Signal Processing Letters.

[24]  K. J. Ray Liu,et al.  Why Time Reversal for Future 5G Wireless? [Perspectives] , 2016, IEEE Signal Processing Magazine.

[25]  Jiaheng Wang,et al.  On Optimal Power Allocation for Downlink Non-Orthogonal Multiple Access Systems , 2017, IEEE Journal on Selected Areas in Communications.

[26]  Geng Wu,et al.  5G Network Capacity: Key Elements and Technologies , 2014, IEEE Vehicular Technology Magazine.

[27]  Yan Chen,et al.  Cluster-Oriented Device-to-Device Multimedia Communications: Joint Power, Bandwidth, and Link Selection Optimization , 2018, IEEE Transactions on Vehicular Technology.

[28]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[29]  H. Vincent Poor,et al.  Reinforcement Learning-Based NOMA Power Allocation in the Presence of Smart Jamming , 2018, IEEE Transactions on Vehicular Technology.

[30]  Jeffrey G. Andrews,et al.  What Will 5G Be? , 2014, IEEE Journal on Selected Areas in Communications.

[31]  Houqiang Li,et al.  MCast: High-Quality Linear Video Transmission With Time and Frequency Diversities , 2018, IEEE Transactions on Image Processing.