Beam Management Solution Using Q-Learning Framework

The beam management is a procedure that properly selects the beams to overcome strong path loss attenuation. This procedure is specially important in 5G-NR (new radio) deployments that operate in millimeter waves frequencies. We propose a novel beam tracking solution that is based on a reinforcement learning framework. More specifically, using the Q-learning algorithm, the user equipments learn over time the best set of beams to maximize their own signal-to-noise to interference ratio. Our framework takes into account reference signals in the 5G technical specification and propose a measurement protocol to implement the Q-learning. The proposed method shows better spectral efficiency than the beam sweeping technique for the multi-user MIMO case.

[1]  Ahmed Alkhateeb,et al.  Deep Reinforcement Learning for 5G Networks: Joint Beamforming, Power Control, and Interference Coordination , 2019, IEEE Transactions on Communications.

[2]  Csaba Szepesvári,et al.  Algorithms for Reinforcement Learning , 2010, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[3]  Ying Li,et al.  Deep Learning Coordinated Beamforming for Highly-Mobile Millimeter Wave Systems , 2018, IEEE Access.

[4]  Robert W. Heath,et al.  Millimeter-Wave Vehicular Communication to Support Massive Automotive Sensing , 2016, IEEE Communications Magazine.

[5]  Robert W. Heath,et al.  Channel Estimation and Hybrid Precoding for Millimeter Wave Cellular Systems , 2014, IEEE Journal of Selected Topics in Signal Processing.

[6]  Der-Jiunn Deng,et al.  Latency-Optimal mmWave Radio Access for V2X Supporting Next Generation Driving Use Cases , 2019, IEEE Access.

[7]  Joerg Widmer,et al.  Millimeter-Wave Beam Training Acceleration Through Low-Complexity Hybrid Transceivers , 2017, IEEE Transactions on Wireless Communications.

[8]  Erik G. Ström,et al.  Location-aided mm-wave channel estimation for vehicular communication , 2016, 2016 IEEE 17th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[9]  Theodore S. Rappaport,et al.  3-D Millimeter-Wave Statistical Channel Model for 5G Wireless System Design , 2016, IEEE Transactions on Microwave Theory and Techniques.

[10]  Michael D. Zoltowski,et al.  Multi-Resolution Codebook and Adaptive Beamforming Sequence Design for Millimeter Wave Beam Alignment , 2017, IEEE Transactions on Wireless Communications.

[11]  Arnab Roy,et al.  A Tutorial on Beam Management for 3GPP NR at mmWave Frequencies , 2018, IEEE Communications Surveys & Tutorials.

[12]  Upamanyu Madhow,et al.  Compressive adaptation of large steerable arrays , 2012, 2012 Information Theory and Applications Workshop.

[13]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[14]  Robert W. Heath,et al.  Inverse Multipath Fingerprinting for Millimeter Wave V2I Beam Alignment , 2017, IEEE Transactions on Vehicular Technology.

[15]  Stefan Parkvall,et al.  5G New Radio: Unveiling the Essentials of the Next Generation Wireless Access Technology , 2018, IEEE Communications Standards Magazine.