Reinforcement Learning-Based Vehicle-Cell Association Algorithm for Highly Mobile Millimeter Wave Communication

Vehicle-to-everything (V2X) communication is a growing area of communication with a variety of use cases. This paper investigates the problem of vehicle-cell association in millimeter wave (mmWave) communication networks. The aim is to maximize the time average rate per vehicular user (VUE) while ensuring a target minimum rate for all VUEs with low signaling overhead. We first formulate the user (vehicle) association problem as a discrete non-convex optimization problem. Then, by leveraging tools from machine learning, specifically distributed deep reinforcement learning (DDRL) and the asynchronous actor critic algorithm (A3C), we propose a low complexity algorithm that approximates the solution of the proposed optimization problem. The proposed DDRL-based algorithm endows every road side unit (RSU) with a local RL agent that selects a local action based on the observed input state. Actions of different RSUs are forwarded to a central entity, that computes a global reward which is then fed back to RSUs. It is shown that each independently trained RL performs the vehicle-RSU association action with low control overhead and less computational complexity compared to running an online complex algorithm to solve the non-convex optimization problem. Finally, simulation results show that the proposed solution achieves up to 15% gains in terms of sum rate and 20% reduction in VUE outages compared to several baseline designs.

[1]  Daniela Tuninetti,et al.  Coverage in mmWave Cellular Networks With Base Station Co-Operation , 2015, IEEE Transactions on Wireless Communications.

[2]  Theodore S. Rappaport,et al.  Channel Model with Improved Accuracy and Efficiency in mmWave Bands , 2017 .

[3]  Tianqing Zhou,et al.  User Association With Maximizing Weighted Sum Energy Efficiency for Massive MIMO-Enabled Heterogeneous Cellular Networks , 2016, IEEE Communications Letters.

[4]  Angela Sara Cacciapuoti,et al.  Mobility-Aware User Association for 5G mmWave Networks , 2017, IEEE Access.

[5]  Wessam Ajib,et al.  On the user association and resource allocation in hetnets with mmWave base stations , 2017, 2017 IEEE 28th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC).

[6]  Wei Yu,et al.  Distributed Pricing-Based User Association for Downlink Heterogeneous Cellular Networks , 2014, IEEE Journal on Selected Areas in Communications.

[7]  Giuseppe Caire,et al.  Optimal User-Cell Association for Massive MIMO Wireless Networks , 2014, IEEE Transactions on Wireless Communications.

[8]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[9]  Mehdi Bennis,et al.  Reinforcement Learning Based Scheduling Algorithm for Optimizing Age of Information in Ultra Reliable Low Latency Networks , 2018, 2019 IEEE Symposium on Computers and Communications (ISCC).

[10]  John B. Kenney,et al.  Dedicated Short-Range Communications (DSRC) Standards in the United States , 2011, Proceedings of the IEEE.

[11]  Theodore S. Rappaport,et al.  3-D Millimeter-Wave Statistical Channel Model for 5G Wireless System Design , 2016, IEEE Transactions on Microwave Theory and Techniques.

[12]  Mai H. Vu,et al.  User Association in Millimeter Wave MIMO Networks , 2018, ArXiv.

[13]  Exploring 5 G New Radio : Use Cases , Capabilities & Timeline , 2016 .

[14]  Ying Li,et al.  Deep Learning Coordinated Beamforming for Highly-Mobile Millimeter Wave Systems , 2018, IEEE Access.

[15]  Michael L. Honig,et al.  Energy-Efficient Cell Activation, User Association, and Spectrum Allocation in Heterogeneous Networks , 2015, IEEE Journal on Selected Areas in Communications.

[16]  H. Vincent Poor,et al.  Ultrareliable and Low-Latency Wireless Communication: Tail, Risk, and Scale , 2018, Proceedings of the IEEE.

[17]  Weng Chon Ao,et al.  Approximation Algorithms for Online User Association in Multi-Tier Multi-Cell Mobile Networks , 2017, IEEE/ACM Transactions on Networking.

[18]  Richard D. Gitlin,et al.  Base station prediction and proactive mobility management in virtual cells using recurrent neural networks , 2017, 2017 IEEE 18th Wireless and Microwave Technology Conference (WAMICON).

[19]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[20]  Yigal Bejerano,et al.  Cell Breathing Techniques for Load Balancing in Wireless LANs , 2009, IEEE Trans. Mob. Comput..

[21]  Akihito Taya,et al.  Deep-Reinforcement-Learning-Based Distributed Vehicle Position Controls for Coverage Expansion in mmWave V2X , 2018, IEICE Trans. Commun..

[22]  Theodore S. Rappaport,et al.  Millimeter Wave Channel Modeling and Cellular Capacity Evaluation , 2013, IEEE Journal on Selected Areas in Communications.

[23]  Yongming Huang,et al.  Joint Design of User Association and Power Allocation With Proportional Fairness in Massive MIMO HetNets , 2017, IEEE Access.

[24]  Walid Saad,et al.  Federated Learning for Ultra-Reliable Low-Latency V2V Communications , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[25]  Mate Boban,et al.  Reinforcement Learning Scheduler for Vehicle-to-Vehicle Communications Outside Coverage , 2018, 2018 IEEE Vehicular Networking Conference (VNC).

[26]  Mai Vu,et al.  Load Balancing User Association in Millimeter Wave MIMO Networks , 2018, IEEE Transactions on Wireless Communications.

[27]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[28]  N. Sidiropoulos,et al.  Learning to Optimize: Training Deep Neural Networks for Interference Management , 2017, IEEE Transactions on Signal Processing.

[29]  Theodore S. Rappaport,et al.  Millimeter-Wave Cellular Wireless Networks: Potentials and Challenges , 2014, Proceedings of the IEEE.

[30]  Adam Wolisz,et al.  QoE-Based Low-Delay Live Streaming Using Throughput Predictions , 2016, ACM Trans. Multim. Comput. Commun. Appl..

[31]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[32]  Mérouane Debbah,et al.  User Association and Load Balancing for Massive MIMO through Deep Learning , 2018, 2018 52nd Asilomar Conference on Signals, Systems, and Computers.

[33]  Tiankui Zhang,et al.  Distributed Energy Efficient Fair User Association in Massive MIMO Enabled HetNets , 2015, IEEE Communications Letters.

[34]  Theodore S. Rappaport,et al.  Millimeter Wave Mobile Communications for 5G Cellular: It Will Work! , 2013, IEEE Access.

[35]  Ness B. Shroff,et al.  Efficient Beam Alignment in Millimeter Wave Systems Using Contextual Bandits , 2017, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[36]  Sundeep Rangan,et al.  Improved Handover Through Dual Connectivity in 5G mmWave Mobile Networks , 2016, IEEE Journal on Selected Areas in Communications.

[37]  Xianfu Chen,et al.  Deep Reinforcement Learning for Network Slicing , 2018, ArXiv.

[38]  Xianfu Chen,et al.  Wireless Edge Computing With Latency and Reliability Guarantees , 2019, Proceedings of the IEEE.

[39]  Alex M. Andrew,et al.  ROBOT LEARNING, edited by Jonathan H. Connell and Sridhar Mahadevan, Kluwer, Boston, 1993/1997, xii+240 pp., ISBN 0-7923-9365-1 (Hardback, 218.00 Guilders, $120.00, £89.95). , 1999, Robotica (Cambridge. Print).

[40]  Katharina Morik,et al.  Predicting next network cell IDs for moving users with Discriminative and Generative Models , 2012 .

[41]  Umair Sajid Hashmi,et al.  What user-cell association algorithms will perform best in mmWave massive MIMO ultra-dense HetNets? , 2017, 2017 IEEE 28th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC).

[42]  Hongzi Mao,et al.  Neural Adaptive Video Streaming with Pensieve , 2017, SIGCOMM.

[43]  Xuemin Shen,et al.  Connected Vehicles: Solutions and Challenges , 2014, IEEE Internet of Things Journal.

[44]  Zhanyu Ma,et al.  Vehicular Edge Computing via Deep Reinforcement Learning , 2019, ArXiv.

[45]  Matthias Hein,et al.  Variants of RMSProp and Adagrad with Logarithmic Regret Bounds , 2017, ICML.

[46]  Ekram Hossain,et al.  Downlink Performance of Cellular Systems With Base Station Sleeping, User Association, and Scheduling , 2014, IEEE Transactions on Wireless Communications.

[47]  Kishor S. Trivedi,et al.  Performance Evaluation for DSRC Vehicular Safety Communication , 2011 .

[48]  Robert Babuska,et al.  A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[49]  Andreas F. Molisch,et al.  Hybrid Beamforming for Massive MIMO: A Survey , 2017, IEEE Communications Magazine.

[50]  Ian D. Reid,et al.  Data-Driven Approximations to NP-Hard Problems , 2017, AAAI.

[51]  Lothar Kreft,et al.  A Novel Handover Prediction Scheme in Content Centric Networking Using Nonlinear Autoregressive Exogenous Model , 2013, 2013 IEEE 77th Vehicular Technology Conference (VTC Spring).