Distributed Q-Learning Aided Uplink Grant-Free NOMA for Massive Machine-Type Communications

The explosive growth of machine-type communications (MTC) devices poses critical challenges to the existing cellular networks. Therefore, how to support massive MTC devices with limited resources is an urgent problem to be solved. Bursty traffic is an important characteristic of MTC devices, which makes it difficult for agents to learn useful experience and has a negative impact on model convergence. However, most existing reinforcement learning-based literatures assume that devices have saturate data. Towards this end, we propose two distributed Q-learning aided uplink grant-free non-orthogonal multiple access (NOMA) schemes (including all-devices distributed Q-learning (ADDQ) scheme and portion-devices distributed Q-learning (PDDQ) scheme) to maximize the number of accessible devices, where the bursty traffic of massive MTC devices is carefully considered. In order to reduce the dimension of scheduling space and mitigate the impact of bursty traffic, the idea of grouping devices as well as transmission resources and the intermittent learning mode are adopted in our schemes. Extensive numerical results demonstrate the advantages of proposed schemes from multiple perspectives.

[1]  Amr M. Youssef,et al.  NOMA-Assisted Machine-Type Communications in UDN: State-of-the-Art and Challenges , 2020, IEEE Communications Surveys & Tutorials.

[2]  Geoffrey Ye Li,et al.  Deep Reinforcement Learning Based Resource Allocation for V2V Communications , 2018, IEEE Transactions on Vehicular Technology.

[3]  Dong In Kim,et al.  LTE/LTE-A Random Access for Massive Machine-Type Communications in Smart Cities , 2016, IEEE Communications Magazine.

[4]  Huici Wu,et al.  Deep Reinforcement Learning for Throughput Improvement of the Uplink Grant-Free NOMA System , 2020, IEEE Internet of Things Journal.

[5]  Jang-Won Lee,et al.  SARA: Sparse Code Multiple Access-Applied Random Access for IoT Devices , 2018, IEEE Internet of Things Journal.

[6]  Jinho Choi,et al.  NOMA-Based Random Access With Multichannel ALOHA , 2017, IEEE Journal on Selected Areas in Communications.

[7]  Sarah J. Johnson,et al.  Grant-Free Non-Orthogonal Multiple Access for IoT: A Survey , 2019, IEEE Communications Surveys & Tutorials.

[8]  Geoffrey Ye Li,et al.  Spectrum Sharing in Vehicular Networks Based on Multi-Agent Reinforcement Learning , 2019, IEEE Journal on Selected Areas in Communications.

[9]  Branka Vucetic,et al.  A Novel Analytical Framework for Massive Grant-Free NOMA , 2019, IEEE Transactions on Communications.

[10]  Wuyi Yue,et al.  The effect of capture on performance of multichannel slotted ALOHA systems , 1991, IEEE Trans. Commun..

[11]  Alireza Bayesteh,et al.  Uplink contention based SCMA for 5G radio access , 2014, 2014 IEEE Globecom Workshops (GC Wkshps).

[12]  Ying-Chang Liang,et al.  Applications of Deep Reinforcement Learning in Communications and Networking: A Survey , 2018, IEEE Communications Surveys & Tutorials.

[13]  Alireza Bayesteh,et al.  Blind detection of SCMA for uplink grant-free multiple-access , 2014, 2014 11th International Symposium on Wireless Communications Systems (ISWCS).

[14]  Hirley Alves,et al.  A NOMA-Based Q-Learning Random Access Method for Machine Type Communications , 2020, IEEE Wireless Communications Letters.

[15]  Zhi Chen,et al.  Efficient Multi-User Detection for Uplink Grant-Free NOMA: Prior-Information Aided Adaptive Compressive Sensing Perspective , 2017, IEEE Journal on Selected Areas in Communications.

[16]  Xu Li,et al.  Non-Orthogonal Random Access for 5G Networks , 2017, IEEE Transactions on Wireless Communications.

[17]  Hsiao-Hwa Chen,et al.  M2M Communications in 3GPP LTE/LTE-A Networks: Architectures, Service Requirements, Challenges, and Applications , 2015, IEEE Communications Surveys & Tutorials.

[18]  Hilbert J. Kappen,et al.  Speedy Q-Learning , 2011, NIPS.

[19]  Bo Ai,et al.  Coded Tandem Spreading Multiple Access for Massive Machine-Type Communications , 2018, IEEE Wireless Communications.

[20]  Sarah J. Johnson,et al.  On the Fundamental Limits of Random Non-Orthogonal Multiple Access in Cellular Massive IoT , 2017, IEEE Journal on Selected Areas in Communications.

[21]  Xianbin Wang,et al.  Toward Massive Machine Type Communications in Ultra-Dense Cellular IoT Networks: Current Issues and Machine Learning-Assisted Solutions , 2018, IEEE Communications Surveys & Tutorials.

[22]  Arumugam Nallanathan,et al.  Reinforcement Learning for Real-Time Optimization in NB-IoT Networks , 2019, IEEE Journal on Selected Areas in Communications.

[23]  Xianbin Wang,et al.  Collaborative Distributed Q-Learning for RACH Congestion Minimization in Cellular IoT Networks , 2019, IEEE Communications Letters.

[24]  Kobi Cohen,et al.  Deep Multi-User Reinforcement Learning for Distributed Dynamic Spectrum Access , 2017, IEEE Transactions on Wireless Communications.

[25]  Wei Peng,et al.  A Novel Perspective on Multiple Access in 5G Network: Framework and Solutions , 2019, IEEE Wireless Communications.