A Q-Learning-Based Approach for Distributed Beam Scheduling in mmWave Networks

We consider the problem of distributed downlink beam scheduling and power allocation for millimeter-Wave (mmWave) cellular networks where multiple base stations (BSs) belonging to different service operators share the same unlicensed spectrum with no centralized control or explicit coordination among them. Our goal is to design efficient distributed beam scheduling and power allocation schemes such that the network-level payoff, defined as the weighted sum of the total throughput and a power penalization term, can be maximized. To this end, we propose a distributed scheduling approach to power allocation and adaptation for efficient interference management over the shared spectrum by modeling each BS as an independent Q-learning agent. As a baseline, we compare the proposed approach to the state-of-the-art non-cooperative game-based approach which was previously developed for the same problem. We conduct extensive experiments under various scenarios to verify the effect of multiple factors on the performance of both approaches. Experiment results show that the proposed approach adapts well to different interference conditions by learning from experience and thus can achieve higher payoff than the game-based approach. The proposed approach can also be integrated into our previously developed Lyapunov stochastic optimization framework for the purpose of network utility maximization with optimality guarantee. As a result, the weights in the payoff function can be automatically and optimally determined by the virtual queue values from the sub-problems derived from the Lyapunov optimization framework.

[1]  M. Yacoub,et al.  Nakagami-m approximation to the sum of M non-identical independent Nakagami-m variates , 2004 .

[2]  Francisco Facchinei,et al.  Design of Cognitive Radio Systems Under Temperature-Interference Constraints: A Variational Inequality Approach , 2010, IEEE Transactions on Signal Processing.

[3]  Eitan Altman,et al.  CDMA Uplink Power Control as a Noncooperative Game , 2002, Wirel. Networks.

[4]  Christian Hartmann,et al.  On Connectivity Limits in Ad Hoc Networks with Beamforming Antennas , 2009, EURASIP J. Wirel. Commun. Netw..

[5]  Euhanna Ghadimi,et al.  A reinforcement learning approach to power control and rate adaptation in cellular networks , 2016, 2017 IEEE International Conference on Communications (ICC).

[6]  H. Vincent Poor,et al.  Distributed reinforcement learning in multi-agent networks , 2013, 2013 5th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP).

[7]  Enabling Uncoordinated Spectrum Sharing in Millimeter Wave Networks Using Carrier Sensing , 2020, 2020 54th Asilomar Conference on Signals, Systems, and Computers.

[8]  Dongning Guo,et al.  Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks , 2018, IEEE Journal on Selected Areas in Communications.

[9]  Leonardo Badia,et al.  Spectrum sharing improves the network efficiency for cellular operators , 2014, IEEE Communications Magazine.

[10]  Asuman E. Ozdaglar,et al.  Near-Optimal Power Control in Wireless Networks: A Potential Game Approach , 2010, 2010 Proceedings IEEE INFOCOM.

[11]  Meryem Simsek,et al.  Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning , 2020, 2020 IEEE 21st International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[12]  Ram Ramanathan,et al.  On the performance of ad hoc networks with beamforming antennas , 2001, MobiHoc '01.

[13]  Lenan Wu,et al.  Power Allocation in Multi-User Cellular Networks with Deep Q Learning Approach , 2018, ICC 2019 - 2019 IEEE International Conference on Communications (ICC).

[14]  Ana Galindo-Serrano,et al.  Distributed Q-Learning for Aggregated Interference Control in Cognitive Radio Networks , 2010, IEEE Transactions on Vehicular Technology.

[15]  Jeffrey G. Andrews,et al.  On the Feasibility of Sharing Spectrum Licenses in mmWave Cellular Systems , 2015, IEEE Transactions on Communications.

[16]  H.-M. Gross,et al.  A neural field approach to topological reinforcement learning in continuous action spaces , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).

[17]  Ashwin Ram,et al.  Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces , 1997, Adapt. Behav..

[18]  Xuesong Qiu,et al.  Interference Control Based on Stackelberg Game for D2D Underlaying 5G mmWave Small Cell Networks , 2019, ICC 2019 - 2019 IEEE International Conference on Communications (ICC).

[19]  Sneha Kumar Kasera,et al.  A Non-cooperative Game-based Distributed Beam Scheduling for 5G mm-Wave Networks , 2020, ArXiv.

[20]  Carlo Fischione,et al.  Spectrum Pooling in MmWave Networks: Opportunities, Challenges, and Enablers , 2016, IEEE Communications Magazine.

[21]  François Horlin,et al.  Power allocation in cognitive radio networks using distributed machine learning , 2012, 2012 IEEE 23rd International Symposium on Personal, Indoor and Mobile Radio Communications - (PIMRC).