Second-Best Beam-Alignment via Bayesian Multi-Armed Bandits

Millimeter-wave (mm-wave) systems rely on narrow-beams to cope with the severe signal attenuation in the mm-wave frequency band. However, susceptibility to beam mis- alignment due to mobility or blockage requires the use of beam-alignment schemes, with huge cost in terms of overhead and use of system resources. In this paper, a beam-alignment scheme is proposed based on Bayesian multi-armed bandits, with the goal to maximize the alignment probability and the data-communication throughput. A Bayesian approach is proposed, by considering the state as a posterior distribution over angles of arrival (AoA) and of departure (AoD), given the history of feedback signaling and of beam pairs scanned by the base- station (BS) and the user-end (UE). A simplified sufficient statistic for optimal control is identified, in the form of preference of BS-UE beam pairs. By bounding a value function, the second-best preference policy is formulated, which strikes an optimal balance between exploration and exploitation by selecting the beam pair with the current second-best preference. Through Monte-Carlo simulation with analog beamforming, the superior performance of the second-best preference policy is demonstrated in comparison to existing schemes based on first-best preference, linear Thompson sampling, and upper confidence bounds, with up to 7%, 10% and 30% improvements in alignment probability, respectively.

[1]  Nicolò Michelusi,et al.  Throughput optimal beam alignment in millimeter wave networks , 2017, 2017 Information Theory and Applications Workshop (ITA).

[2]  Philippe J. Sartori,et al.  Initial beamforming for mmWave communications , 2014, 2014 48th Asilomar Conference on Signals, Systems and Computers.

[3]  Nicolò Michelusi,et al.  Multi-User Beam-Alignment for Millimeter-Wave Networks , 2018, 2018 Information Theory and Applications Workshop (ITA).

[4]  Sundeep Rangan,et al.  Initial Access in Millimeter Wave Cellular Systems , 2015, IEEE Transactions on Wireless Communications.

[5]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[6]  Nicolò Michelusi,et al.  Optimal Beam-Sweeping and Communication in Mobile Millimeter-Wave Networks , 2018, 2018 IEEE International Conference on Communications (ICC).

[7]  Jeffrey G. Andrews,et al.  Design and Analysis of Initial Access in Millimeter Wave Cellular Networks , 2016, IEEE Transactions on Wireless Communications.

[8]  Theodore S. Rappaport,et al.  Millimeter Wave Channel Modeling and Cellular Capacity Evaluation , 2013, IEEE Journal on Selected Areas in Communications.

[9]  Jeongho Park,et al.  Random access in millimeter-wave beamforming cellular networks: issues and approaches , 2015, IEEE Communications Magazine.

[10]  Michael D. Zoltowski,et al.  Multi-Resolution Codebook and Adaptive Beamforming Sequence Design for Millimeter Wave Beam Alignment , 2017, IEEE Transactions on Wireless Communications.

[11]  Ness B. Shroff,et al.  Efficient Beam Alignment in Millimeter Wave Systems Using Contextual Bandits , 2017, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[12]  Theodore S. Rappaport,et al.  Millimeter Wave Wireless Communications , 2014 .

[13]  Nicolo Michelusi,et al.  Energy-Efficient Interactive Beam Alignment for Millimeter-Wave Networks , 2018, IEEE Transactions on Wireless Communications.

[14]  Sung-En Chiu,et al.  Active Learning and CSI Acquisition for mmWave Initial Alignment , 2018, IEEE Journal on Selected Areas in Communications.

[15]  Nicolò Michelusi,et al.  Coded Energy-Efficient Beam-Alignment for Millimeter-Wave Networks , 2018, 2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[16]  Desmond P. Taylor,et al.  A Statistical Model for Indoor Multipath Propagation , 2007 .

[17]  Robert W. Heath,et al.  The Impact of Beamwidth on Temporal Channel Variation in Vehicular Channels and Its Implications , 2015, IEEE Transactions on Vehicular Technology.