Efficient Beam Alignment in Millimeter Wave Systems Using Contextual Bandits

In this paper, we investigate the problem of beam alignment in millimeter wave (mmWave) systems, and design an optimal algorithm to reduce the overhead. Specifically, due to directional communications, the transmitter and receiver beams need to be aligned, which incurs high delay overhead since without a priori knowledge of the transmitter/receiver location, the search space spans the entire angular domain. This is further exacerbated under dynamic conditions (e.g., moving vehicles) where the access to the base station (access point) is highly dynamic with intermittent on-off periods, requiring more frequent beam alignment and signal training. To mitigate this issue, we consider an online stochastic optimization formulation where the goal is to maximize the directivity gain (i.e., received energy) of the beam alignment policy within a time period. We exploit the inherent correlation and unimodality properties of the model, and demonstrate that contextual information improves the performance. To this end, we propose an equivalent structured Multi-Armed Bandit model to optimally exploit the exploration-exploitation tradeoff. In contrast to the classical MAB models, the contextual information makes the lower bound on regret (i.e., performance loss compared with an oracle policy) independent of the number of beams. This is a crucial property since the number of all combinations of beam patterns can be large in transceiver antenna arrays, especially in massive MIMO systems. We further provide an asymptotically optimal beam alignment algorithm, and investigate its performance via simulations.

[1]  Ness B. Shroff,et al.  Energy-Efficient Power and Bandwidth Allocation in an Integrated Sub-6 GHz - Millimeter Wave System , 2017, ArXiv.

[2]  Ness B. Shroff,et al.  Out-of-Band Millimeter Wave Beamforming and Communications to Achieve Low Latency and High Energy Efficiency in 5G Systems , 2018, IEEE Transactions on Communications.

[3]  Piotr Indyk,et al.  Agile Millimeter Wave Networks with Provable Guarantees , 2017, ArXiv.

[4]  Philippe J. Sartori,et al.  Initial beamforming for mmWave communications , 2014, 2014 48th Asilomar Conference on Signals, Systems and Computers.

[5]  Ben Y. Zhao,et al.  Demystifying 60GHz outdoor picocells , 2014, MobiCom.

[6]  H. Robbins Some aspects of the sequential design of experiments , 1952 .

[7]  Sébastien Bubeck,et al.  Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems , 2012, Found. Trends Mach. Learn..

[8]  T. L. Lai Andherbertrobbins Asymptotically Efficient Adaptive Allocation Rules , 2022 .

[9]  Shie Mannor,et al.  Unimodal Bandits , 2011, ICML.

[10]  Alexandre Proutière,et al.  Unimodal Bandits: Regret Lower Bounds and Optimal Algorithms , 2014, ICML.

[11]  Pei Liu,et al.  Directional Cell Discovery in Millimeter Wave Cellular Networks , 2014, IEEE Transactions on Wireless Communications.

[12]  James V. Krogmeier,et al.  Millimeter Wave Beamforming for Wireless Backhaul and Access in Small Cell Networks , 2013, IEEE Transactions on Communications.

[13]  Liang Zhou,et al.  Efficient codebook-based MIMO beamforming for millimeter-wave WLANs , 2012, 2012 IEEE 23rd International Symposium on Personal, Indoor and Mobile Radio Communications - (PIMRC).

[14]  Robert W. Heath,et al.  Millimeter Wave Beam-Selection Using Out-of-Band Spatial Information , 2017, IEEE Transactions on Wireless Communications.

[15]  Alexandre Proutière,et al.  Dynamic Rate and Channel Selection in Cognitive Radio Systems , 2014, IEEE Journal on Selected Areas in Communications.

[16]  Jaspreet Singh,et al.  On the Feasibility of Codebook-Based Beamforming in Millimeter Wave Systems With Multiple Antenna Arrays , 2015, IEEE Transactions on Wireless Communications.

[17]  Eric W. Cope,et al.  Regret and Convergence Bounds for a Class of Continuum-Armed Bandit Problems , 2009, IEEE Transactions on Automatic Control.

[18]  Marcello Restelli,et al.  Unimodal Thompson Sampling for Graph-Structured Arms , 2017, AAAI.

[19]  Yinan Qi,et al.  Coordinated initial access in millimetre wave standalone networks , 2016, 2016 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[20]  R. Munos,et al.  Kullback–Leibler upper confidence bounds for optimal sequential allocation , 2012, 1210.1136.

[21]  Jörg Widmer,et al.  Steering with eyes closed: Mm-Wave beam steering without in-band measurement , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[22]  Alexandre Proutière,et al.  Optimal Rate Sampling in 802.11 systems , 2013, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.