Optimal Beam Association for High Mobility mmWave Vehicular Networks: Lightweight Parallel Reinforcement Learning Approach

In intelligent transportation systems (ITS), vehicles are expected to feature with advanced applications and services which demand ultra-high data rates and low-latency communications. For that, the millimeter wave (mmWave) communication has been emerging as a very promising solution. However, incorporating the mmWave into ITS is particularly challenging due to the high mobility of vehicles and the inherent sensitivity of mmWave beams to dynamic blockages. This article addresses these problems by developing an optimal beam association framework for mmWave vehicular networks under high mobility. Specifically, we use the semi-Markov decision process to capture the dynamics and uncertainty of the environment. The Q-learning algorithm is then often used to find the optimal policy. However, Q-learning is notorious for its slow-convergence. Instead of adopting deep reinforcement learning structures (like most works in the literature), we leverage the fact that there are usually multiple vehicles on the road to speed up the learning process. To that end, we develop a lightweight yet very effective parallel Q-learning algorithm to quickly obtain the optimal policy by simultaneously learning from various vehicles. Extensive simulations demonstrate that our proposed solution can increase the data rate by 47% and reduce the disconnection probability by 29% compared to other solutions.

[1]  Wei Yu,et al.  Hybrid Analog and Digital Beamforming for mmWave OFDM Large-Scale Antenna Arrays , 2017, IEEE Journal on Selected Areas in Communications.

[2]  Hani Mehrpouyan,et al.  Self-Organizing mm Wave Networks: A Power Allocation Scheme Based on Machine Learning , 2018, 2018 11th Global Symposium on Millimeter Waves (GSMM).

[3]  Sundeep Rangan,et al.  Improved Handover Through Dual Connectivity in 5G mmWave Mobile Networks , 2016, IEEE Journal on Selected Areas in Communications.

[4]  Bo Ai,et al.  Channel Characterization for Satellite Link and Terrestrial Link of Vehicular Communication in the mmWave Band , 2019, IEEE Access.

[5]  Robert W. Heath,et al.  Beam design for beam switching based millimeter wave vehicle-to-infrastructure communications , 2016, 2016 IEEE International Conference on Communications (ICC).

[6]  J. Filar,et al.  Competitive Markov Decision Processes , 1996 .

[7]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[8]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[9]  Xuemin Shen,et al.  Video Quality Provisioning for Millimeter Wave 5G Cellular Networks With Link Outage , 2015, IEEE Transactions on Wireless Communications.

[10]  Jeffrey G. Andrews,et al.  Downlink and Uplink Cell Association With Traditional Macrocells and Millimeter Wave Small Cells , 2016, IEEE Transactions on Wireless Communications.

[11]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[12]  Sundeep Rangan,et al.  An MDP model for optimal handover decisions in mmWave cellular networks , 2015, 2016 European Conference on Networks and Communications (EuCNC).

[13]  Xiaohu Tang,et al.  SMDP-Based Coordinated Virtual Machine Allocations in Cloud-Fog Computing Systems , 2018, IEEE Internet of Things Journal.

[14]  Robert W. Heath,et al.  MmWave Vehicle-to-Infrastructure Communication: Analysis of Urban Microcellular Networks , 2017, IEEE Transactions on Vehicular Technology.

[15]  Furqan Jameel,et al.  Propagation Channels for mmWave Vehicular Communications: State-of-the-art and Future Research Directions , 2018, IEEE Wireless Communications.

[16]  Choong Seon Hong,et al.  Reinforcement Learning-Based Vehicle-Cell Association Algorithm for Highly Mobile Millimeter Wave Communication , 2019, IEEE Transactions on Cognitive Communications and Networking.

[17]  Sundeep Rangan,et al.  Multi-connectivity in 5G mmWave cellular networks , 2016, 2016 Mediterranean Ad Hoc Networking Workshop (Med-Hoc-Net).

[18]  Erik G. Ström,et al.  Location-aided mm-wave channel estimation for vehicular communication , 2016, 2016 IEEE 17th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[19]  Ahmed Alkhateeb,et al.  Deep Reinforcement Learning for 5G Networks: Joint Beamforming, Power Control, and Interference Coordination , 2019, IEEE Transactions on Communications.

[20]  Optimal Beam Association in mmWave Vehicular Networks with Parallel Reinforcement Learning , 2020, GLOBECOM 2020 - 2020 IEEE Global Communications Conference.

[21]  Li-Chun Wang,et al.  Learning-assisted beam search for indoor mmWave networks , 2018, 2018 IEEE Wireless Communications and Networking Conference Workshops (WCNCW).

[22]  Guihai Chen,et al.  Millimeter Wave Communication: A Comprehensive Survey , 2018, IEEE Communications Surveys & Tutorials.

[23]  Vincent W. S. Wong,et al.  An MDP-Based Vertical Handoff Decision Algorithm for Heterogeneous Wireless Networks , 2008, IEEE Transactions on Vehicular Technology.

[24]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[25]  Michele Rossi,et al.  Beam Training and Data Transmission Optimization in Millimeter-Wave Vehicular Networks , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[26]  Maria Scalabrin,et al.  Adaptive Millimeter-Wave Communications Exploiting Mobility and Blockage Dynamics , 2020, ICC 2020 - 2020 IEEE International Conference on Communications (ICC).

[27]  Walid Saad,et al.  Joint Millimeter Wave and Microwave Resources Allocation in Cellular Networks With Dual-Mode Base Stations , 2016, IEEE Transactions on Wireless Communications.

[28]  Lei Zhang,et al.  Reinforcement Learning Method for Beam Management in Millimeter-Wave Networks , 2019, 2019 UK/ China Emerging Technologies (UCET).

[29]  Carlo Fischione,et al.  Spectrum Sharing in mmWave Cellular Networks via Cell Association, Coordination, and Beamforming , 2016, IEEE Journal on Selected Areas in Communications.

[30]  Branka Vucetic,et al.  Managing Vertical Handovers in Millimeter Wave Heterogeneous Networks , 2019, IEEE Transactions on Communications.

[31]  Masahiro Morikura,et al.  Reinforcement learning based predictive handover for pedestrian-aware mmWave networks , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[32]  Bin Li,et al.  Learning-Based Spectrum Sharing and Spatial Reuse in mm-Wave Ultradense Networks , 2018, IEEE Transactions on Vehicular Technology.

[33]  Jeroen Famaey,et al.  A Scalable Parallel Q-Learning Algorithm for Resource Constrained Decentralized Computing Environments , 2016, 2016 2nd Workshop on Machine Learning in HPC Environments (MLHPC).

[34]  Eryk Dutkiewicz,et al.  “Jam Me If You Can:” Defeating Jammer With Deep Dueling Neural Network Architecture and Ambient Backscattering Augmented Communications , 2019, IEEE Journal on Selected Areas in Communications.

[35]  Carlo Fischione,et al.  The impact of beamforming and coordination on spectrum pooling in mmWave cellular networks , 2016, 2016 50th Asilomar Conference on Signals, Systems and Computers.

[36]  Shiwen Mao,et al.  Dealing with Limited Backhaul Capacity in Millimeter-Wave Systems: A Deep Reinforcement Learning Approach , 2018, IEEE Communications Magazine.

[37]  Anja Klein,et al.  An Online Context-Aware Machine Learning Algorithm for 5G mmWave Vehicular Communications , 2018, IEEE/ACM Transactions on Networking.

[38]  Sung-En Chiu,et al.  Active Learning and CSI Acquisition for mmWave Initial Alignment , 2018, IEEE Journal on Selected Areas in Communications.

[39]  Robert W. Heath,et al.  Millimeter Wave Vehicular Communications: A Survey , 2016, Found. Trends Netw..

[40]  Muhammad Alrabeiah,et al.  Deep Learning for mmWave Beam and Blockage Prediction Using Sub-6 GHz Channels , 2019, IEEE Transactions on Communications.

[41]  Byung-Chul Kim,et al.  Handover Mechanism in NR for Ultra-Reliable Low-Latency Communications , 2018, IEEE Network.

[42]  Ying-Chang Liang,et al.  The SMART Handoff Policy for Millimeter Wave Heterogeneous Cellular Networks , 2018, IEEE Transactions on Mobile Computing.

[43]  Eryk Dutkiewicz,et al.  Optimal and Fast Real-Time Resource Slicing With Deep Dueling Neural Networks , 2019, IEEE Journal on Selected Areas in Communications.