Reinforcement Learning-Based Joint User Pairing and Power Allocation in MIMO-NOMA Systems

In this paper, we consider a multiple-input multiple-output (MIMO)—non-orthogonal multiple access (NOMA) system with reinforcement learning (RL). NOMA, which is a technique for increasing the spectrum efficiency, has been extensively studied in fifth-generation (5G) wireless communication systems. The application of MIMO to NOMA can result in an even higher spectral efficiency. Moreover, user pairing and power allocation problem are important techniques in NOMA. However, NOMA has a fundamental limitation of the high computational complexity due to rapidly changing radio channels. This limitation makes it difficult to utilize the characteristics of the channel and allocate radio resources efficiently. To reduce the computational complexity, we propose an RL-based joint user pairing and power allocation scheme. By applying Q-learning, we are able to perform user pairing and power allocation simultaneously, which reduces the computational complexity. The simulation results show that the proposed scheme achieves a sum rate similar to that achieved with the exhaustive search (ES).

[1]  Guo Li,et al.  User Pairing and Pair Scheduling in Massive MIMO-NOMA Systems , 2018, IEEE Communications Letters.

[2]  Yang Hu,et al.  Joint Power Allocation and Channel Assignment for NOMA With Deep Reinforcement Learning , 2019, IEEE Journal on Selected Areas in Communications.

[3]  Nam-I Kim,et al.  Deep Learning-Aided SCMA , 2018, IEEE Communications Letters.

[4]  Ioannis Krikidis,et al.  Fairness for Non-Orthogonal Multiple Access in 5G Systems , 2015, IEEE Signal Processing Letters.

[5]  Felipe A. P. de Figueiredo,et al.  Multi-Agent Deep Learning for Multi-Channel Access in Slotted Wireless Networks , 2020, IEEE Access.

[6]  Ying-Chang Liang,et al.  Applications of Deep Reinforcement Learning in Communications and Networking: A Survey , 2018, IEEE Communications Surveys & Tutorials.

[7]  Kun Lu,et al.  A Survey of Non-Orthogonal Multiple Access for 5G , 2017, 2017 IEEE 86th Vehicular Technology Conference (VTC-Fall).

[8]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[9]  Gianluigi Liva,et al.  Improving the Decoding Threshold of Tailbiting Spatially Coupled LDPC Codes by Energy Shaping , 2018, IEEE Communications Letters.

[10]  Liang Xiao,et al.  Fast Reinforcement Learning for Anti-jamming Communications , 2020, ArXiv.

[11]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[12]  Xiangyang Wang,et al.  User Pairing and Power Allocation for Downlink Non-Orthogonal Multiple Access , 2016, 2016 IEEE Globecom Workshops (GC Wkshps).

[13]  H. Vincent Poor,et al.  Reinforcement Learning-Based NOMA Power Allocation in the Presence of Smart Jamming , 2018, IEEE Transactions on Vehicular Technology.

[14]  Hsiao-Hwa Chen,et al.  Joint User Pairing and Power Allocation in Virtual MIMO Systems , 2018, IEEE Transactions on Wireless Communications.

[15]  Ekram Hossain,et al.  A Deep Q-Learning Method for Downlink Power Allocation in Multi-Cell Networks , 2019, ArXiv.

[16]  Anass Benjebbour,et al.  Non-Orthogonal Multiple Access (NOMA) for Cellular Future Radio Access , 2013, 2013 IEEE 77th Vehicular Technology Conference (VTC Spring).

[17]  Haijian Sun,et al.  A NOMA and MU-MIMO Supported Cellular Network with Underlaid D2D Communications , 2016, 2016 IEEE 83rd Vehicular Technology Conference (VTC Spring).

[18]  Hamed Haddadi,et al.  Deep Learning in Mobile and Wireless Networking: A Survey , 2018, IEEE Communications Surveys & Tutorials.

[19]  Bhaskar Krishnamachari,et al.  Deep Reinforcement Learning for Dynamic Multichannel Access in Wireless Networks , 2018, IEEE Transactions on Cognitive Communications and Networking.

[20]  Marina Petrova,et al.  Proportional fairness-based user pairing and power allocation for non-orthogonal multiple access , 2015, 2015 IEEE 26th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC).

[21]  Jae-Mo Kang,et al.  Deep Learning-Based MIMO-NOMA With Imperfect SIC Decoding , 2020, IEEE Systems Journal.

[22]  Zhengang Pan,et al.  On the Ergodic Capacity of MIMO NOMA Systems , 2015, IEEE Wireless Communications Letters.

[23]  Fumiyuki Adachi,et al.  The Application of MIMO to Non-Orthogonal Multiple Access , 2015, IEEE Transactions on Wireless Communications.

[24]  Hyundong Shin,et al.  Power Allocation in Cache-Aided NOMA Systems: Optimization and Deep Reinforcement Learning Approaches , 2019, IEEE Transactions on Communications.

[25]  Soung Chang Liew,et al.  Outage-Limit-Approaching Channel Coding for Future Wireless Communications: Root-Protograph Low-Density Parity-Check Codes , 2019, IEEE Vehicular Technology Magazine.

[26]  Tiejun Lv,et al.  Multi-Agent Reinforcement Learning-Based User Pairing in Multi-Carrier NOMA Systems , 2019, 2019 IEEE International Conference on Communications Workshops (ICC Workshops).

[27]  Guan Gui,et al.  Deep Learning for an Effective Nonorthogonal Multiple Access Scheme , 2018, IEEE Transactions on Vehicular Technology.

[28]  George K. Karagiannidis,et al.  A Survey on Non-Orthogonal Multiple Access for 5G Networks: Research Challenges and Future Trends , 2017, IEEE Journal on Selected Areas in Communications.