Deep Contextual Bandits for Fast Neighbor-Aided Initial Access in mmWave Cell-Free Networks

Access points (APs) in millimeter-wave (mmWave) user-centric (UC) networks will have sleep mode functionality. Initial access (IA) is a challenging problem in UC networks due to the coherent serving of the users. In this letter, a novel deep contextual bandit (DCB) learning-based instantaneous beam selection method is proposed as a complementary tool to current IA schemes. In the proposed approach, the DCB model at an AP uses beam selection information from the neighboring active APs as the input to solve the beam search problem of the host AP. The proposed fast beam selection scheme enables APs to be in energy-saving modes while maintaining the ability to serve users without any delay when restored. Simulations are carried out with realistic channel models generated using a ray-tracing tool. The results show that the proposed system with the 5G IA scheme can respond to dynamic throughput demands with negligible latency compared to the 5G IA scheme without the proposed scheme.

[1]  Hien Quoc Ngo,et al.  Energy Efficiency in Cell-Free Massive MIMO with Zero-Forcing Precoding Design , 2017, IEEE Communications Letters.

[2]  Theodore S. Rappaport,et al.  Millimeter Wave Channel Modeling and Cellular Capacity Evaluation , 2013, IEEE Journal on Selected Areas in Communications.

[3]  D. Woolley,et al.  The white paper , 1943, Public Health.

[4]  Muhammad Alrabeiah,et al.  Deep Learning for mmWave Beam and Blockage Prediction Using Sub-6 GHz Channels , 2019, IEEE Transactions on Communications.

[5]  W. R. Thompson ON THE LIKELIHOOD THAT ONE UNKNOWN PROBABILITY EXCEEDS ANOTHER IN VIEW OF THE EVIDENCE OF TWO SAMPLES , 1933 .

[6]  R. M. A. P. Rajatheva,et al.  Deep Contextual Bandits for Fast Initial Access in mmWave Based User-Centric Ultra-Dense Networks , 2021, 2021 IEEE 93rd Vehicular Technology Conference (VTC2021-Spring).

[7]  Stefano Buzzi,et al.  Cell-Free Massive MIMO: User-Centric Approach , 2017, IEEE Wireless Communications Letters.

[8]  Abbas Jamalipour,et al.  Wireless communications , 2005, GLOBECOM '05. IEEE Global Telecommunications Conference, 2005..

[9]  Walid Saad,et al.  Contextual Bandit Learning for Machine Type Communications in the Null Space of Multi-Antenna Systems , 2020, IEEE Transactions on Communications.

[10]  Shivendra S. Panwar,et al.  The Impact of Mobile Blockers on Millimeter Wave Cellular Systems , 2018, IEEE Journal on Selected Areas in Communications.

[11]  Joint Power Allocation and Load Balancing Optimization for Energy-Efficient Cell-Free Massive MIMO Networks , 2020, IEEE Transactions on Wireless Communications.

[12]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[13]  R. M. A. P. Rajatheva,et al.  6G White Paper on Machine Learning in Wireless Communication Networks , 2020, ArXiv.

[14]  Tao Jiang,et al.  BOOST: Base Station on-off Switching Strategy for Green Massive MIMO HetNets , 2017, IEEE Transactions on Wireless Communications.