A Non-Stationary Online Learning Approach to Mobility Management

Efficient mobility management is an important problem in modern wireless networks with heterogeneous cell sizes and increased nodes densities. We show that optimization- based mobility protocols cannot achieve long-term optimal performance, particularly in a time-varying environment for ultra-dense networks. To address the complex system dynamics, especially the possible change of statistics due to user movement and environment changes, we propose piece-wise stationary online-learning algorithms to track the activities of small base stations and solve frequent handover (FHO) problems. The BASD/BASSW algorithms are proved to achieve sublinear regret performance in finite time horizon and a linear, non-trivial rigorous bound for infinite time horizon. We study the robustness of the BASD/BASSW algorithms under missing feedback. Simulations show that proposed algorithms can outperform 3GPP protocols with the best threshold, and tend to be more robust than 3GPP to various dynamics which are common in practical ultra- dense wireless networks.

[1]  Ismail Güvenç,et al.  Context-aware mobility management in HetNets: A reinforcement learning approach , 2015, 2015 IEEE Wireless Communications and Networking Conference (WCNC).

[2]  Qing Zhao,et al.  Time-varying stochastic multi-armed bandit problems , 2014, 2014 48th Asilomar Conference on Signals, Systems and Computers.

[3]  Cong Shen,et al.  A Learning Approach to Frequent Handover Mitigations in 3GPP Mobility Protocols , 2017, 2017 IEEE Wireless Communications and Networking Conference (WCNC).

[4]  D. Teneketzis,et al.  Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost , 1988 .

[5]  Aurélien Garivier,et al.  On Upper-Confidence Bound Policies for Non-Stationary Bandit Problems , 2008, 0805.3415.

[6]  Jin Yang,et al.  Evolved Universal Terrestrial Radio Access Network (EUTRAN) , 2017 .

[7]  T. L. Lai Andherbertrobbins Asymptotically Efficient Adaptive Allocation Rules , 1985 .

[8]  Tony Q. S. Quek,et al.  Small Cell Networks: Deployment, PHY Techniques, and Resource Management , 2013 .

[9]  Cong Shen,et al.  A Non-Stochastic Learning Approach to Energy Efficient Mobility Management , 2016, IEEE Journal on Selected Areas in Communications.

[10]  Jeffrey G. Andrews,et al.  User Association for Load Balancing in Heterogeneous Cellular Networks , 2012, IEEE Transactions on Wireless Communications.

[11]  Luis Alonso,et al.  Energy-efficient context-aware user association for outdoor small cell heterogeneous networks , 2014, 2014 IEEE International Conference on Communications (ICC).

[12]  Sébastien Bubeck,et al.  Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems , 2012, Found. Trends Mach. Learn..

[13]  Cong Shen,et al.  User association with lighting constraints in visible light communication systems , 2016, 2016 Annual Conference on Information Science and Systems (CISS).