A Learning Approach to Frequent Handover Mitigations in 3GPP Mobility Protocols

The industry standard 3GPP mobility solutions are analyzed through the lens of bandit learning theory. In particular, it is shown that the original 3GPP handover protocol, developed primarily from a radio frequency and load balancing perspective, can be viewed as a special case of the $\epsilon$-greedy bandit algorithm, and thus its sub- optimality can be characterized via the regret analysis. Inspired by the equivalence between 3GPP handover protocols and bandit algorithms, we rigorously analyze the performance of cell range expansion in 3GPP handover enhancement, and further propose a learning-based approach to address the frequent handover (FHO) challenges in ultra-dense networks. The key component is to explicitly consider the handover cost to discourage FHOs. Rather surprisingly, we prove that the bandit-inspired scheme with handover cost can be viewed as an enhancement to the simple sticky biasing solution in 3GPP that has been developed to partially address the FHO problem, and hence lay a theoretic foundation to this industrial intuition.

[1]  Ismail Güvenç,et al.  Context-aware mobility management in HetNets: A reinforcement learning approach , 2015, 2015 IEEE Wireless Communications and Networking Conference (WCNC).

[2]  Cong Shen,et al.  A Non-Stochastic Learning Approach to Energy Efficient Mobility Management , 2016, IEEE Journal on Selected Areas in Communications.

[3]  Cong Shen,et al.  User association with lighting constraints in visible light communication systems , 2016, 2016 Annual Conference on Information Science and Systems (CISS).

[4]  Holger Claussen,et al.  Towards 1 Gbps/UE in Cellular Systems: Understanding Ultra-Dense Small Cell Deployments , 2015, IEEE Communications Surveys & Tutorials.

[5]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[6]  Tomoaki Ohtsuki,et al.  Cell range expansion using distributed Q-learning in heterogeneous networks , 2013, 2013 IEEE 78th Vehicular Technology Conference (VTC Fall).

[7]  Jeffrey G. Andrews,et al.  An overview of load balancing in hetnets: old myths and open problems , 2013, IEEE Wireless Communications.

[8]  Jeffrey G. Andrews,et al.  User Association for Load Balancing in Heterogeneous Cellular Networks , 2012, IEEE Transactions on Wireless Communications.

[9]  Sébastien Bubeck,et al.  Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems , 2012, Found. Trends Mach. Learn..

[10]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[11]  Stefania Sesia,et al.  LTE - The UMTS Long Term Evolution, Second Edition , 2011 .