Resource Allocation in Multi-cell NOMA Systems with Multi-Agent Deep Reinforcement Learning

Non-orthogonal multiple access (NOMA) technology can meet user access requirements and improve system capacity. In this paper, we investigate the joint subcarrier assignment and power allocation problem in an uplink multi-cell NOMA system to maximize the energy efficiency (EE) while ensuring the minimum data rate of all users. We propose a multi-agent deep reinforcement learning (MADRL) method with centralized training and distributed execution to solve this dynamic optimization problem. In our method, we design a deep q-network (DQN) with parameter sharing to generate the subcarrier assignment policy, and use multi-agent deep deterministic policy gradient (MADDPG) network for power allocation of NOMA user. Finally, we adjust the entire resource allocation policy by updating the parameters of neural networks according to the reward. The simulation shows that our method has better and more stable sum EE than centralized and distributed methods.

[1]  Yi Wu,et al.  Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.

[2]  Lajos Hanzo,et al.  Nonorthogonal Multiple Access for 5G and Beyond , 2017, Proceedings of the IEEE.

[3]  Noe Casas,et al.  Deep Deterministic Policy Gradient for Urban Traffic Light Control , 2017, ArXiv.

[4]  Lenan Wu,et al.  Power Allocation in Multi-User Cellular Networks with Deep Q Learning Approach , 2018, ICC 2019 - 2019 IEEE International Conference on Communications (ICC).

[5]  Dorian Kodelja,et al.  Multiagent cooperation and competition with deep reinforcement learning , 2015, PloS one.

[6]  Mykel J. Kochenderfer,et al.  Cooperative Multi-agent Control Using Deep Reinforcement Learning , 2017, AAMAS Workshops.

[7]  Yung-Fang Chen,et al.  Low-Complexity Resource Allocation for Downlink Multicarrier NOMA Systems , 2018, 2018 IEEE 29th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC).

[8]  Tamer Basar,et al.  Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents , 2018, ICML.

[9]  Dongning Guo,et al.  Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks , 2018, IEEE Journal on Selected Areas in Communications.

[10]  Huiming Wang,et al.  Energy-Efficient Transmission Design in Non-orthogonal Multiple Access , 2016, IEEE Transactions on Vehicular Technology.

[11]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[12]  Youyun Xu,et al.  Energy-Efficient Resource Allocation in Uplink NOMA Systems with Deep Reinforcement Learning , 2019, 2019 11th International Conference on Wireless Communications and Signal Processing (WCSP).

[13]  H. Vincent Poor,et al.  Reinforcement Learning-Based NOMA Power Allocation in the Presence of Smart Jamming , 2018, IEEE Transactions on Vehicular Technology.

[14]  Yawen Fan,et al.  Distributed Approximating Global Optimality with Local Reinforcement Learning in HetNets , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.