A NOMA-Based Q-Learning Random Access Method for Machine Type Communications

Machine Type Communications (MTC) is a main use case of 5G and beyond wireless networks. Moreover, due to the ultra-dense nature of massive MTC networks, Random Access (RA) optimization is very challenging. A promising solution is to use machine learning methods, such as reinforcement learning, to efficiently accommodate the MTC devices in RA slots. In this sense, we propose a distributed method based on Non-Orthogonal Multiple Access (NOMA) and ${Q}$ -Learning to dynamically allocate RA slots to MTC devices. Numerical results show that the proposed method can significantly improve the network throughput when compared to recent work.

[1]  Paul Daniel Mitchell,et al.  Intelligent RACH Access Techniques to Support M2M Traffic in Cellular Networks , 2018, IEEE Transactions on Vehicular Technology.

[2]  Abbas Jamalipour,et al.  Wireless communications , 2005, GLOBECOM '05. IEEE Global Telecommunications Conference, 2005..

[3]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[4]  Emil Björnson,et al.  A Random Access Protocol for Pilot Allocation in Crowded Massive MIMO Systems , 2016, IEEE Transactions on Wireless Communications.

[5]  Matti Latva-aho,et al.  Key drivers and research challenges for 6G ubiquitous wireless intelligence , 2019 .

[6]  Anass Benjebbour,et al.  Non-Orthogonal Multiple Access (NOMA) for Cellular Future Radio Access , 2013, 2013 IEEE 77th Vehicular Technology Conference (VTC Spring).

[7]  Xianbin Wang,et al.  Toward Massive Machine Type Communications in Ultra-Dense Cellular IoT Networks: Current Issues and Machine Learning-Assisted Solutions , 2018, IEEE Communications Surveys & Tutorials.

[8]  H. Vincent Poor,et al.  Distributed reinforcement learning in multi-agent networks , 2013, 2013 5th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP).

[9]  Petar Popovski,et al.  From 5G to 6G: Has the Time for Modern Random Access Come? , 2019, 1903.03063.

[10]  Alagan Anpalagan,et al.  Base Station Selection in M2M Communication Using Q-Learning Algorithm in LTE-A Networks , 2015, 2015 IEEE 29th International Conference on Advanced Information Networking and Applications.

[11]  Petar Popovski,et al.  The METIS 5G System Concept: Meeting the 5G Requirements , 2016, IEEE Communications Magazine.

[12]  Jihun Moon,et al.  Access control of MTC devices using reinforcement learning approach , 2017, 2017 International Conference on Information Networking (ICOIN).

[13]  H. Vincent Poor,et al.  Finite Blocklength Information Theory: What Is the Practical Impact on Wireless Communications? , 2016, 2016 IEEE Globecom Workshops (GC Wkshps).

[14]  Xiaofeng Tao,et al.  QoS-based Dynamic Allocation and Adaptive ACB Mechanism for RAN Overload Avoidance in MTC , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[15]  王家志 Technical Specification Group Services and System Aspects ; 3 G Security ; Specification of the MILENAGE Algorithm Set : An example algorithm set for the 3 GPP authentication and key generation functions , 2001 .

[16]  Klaus Moessner,et al.  Energy-Efficient Short Packet Communications for Uplink NOMA-Based Massive MTC Networks , 2019, IEEE Transactions on Vehicular Technology.

[17]  Qi Bi,et al.  Ten Trends in the Cellular Industry and an Outlook on 6G , 2019, IEEE Communications Magazine.

[18]  Xianbin Wang,et al.  Collaborative Distributed Q-Learning for RACH Congestion Minimization in Cellular IoT Networks , 2019, IEEE Communications Letters.