Reinforcement learning based predictive handover for pedestrian-aware mmWave networks

This paper discusses the optimal decision-making for predictive handover in millimeter-wave (mmWave) communication networks using information of pedestrian movement. In mmWave communication networks, human blockage causes significant performance degradation. Hence, to maximize the throughput, it might be important to perform a handover predictively using information such as location and velocity of a pedestrian. To optimize the timing to perform the predictive handover, this paper presents a reinforcement learning framework. The important point in this framework is learning the optimal handover policy maximizing the future throughput expected under the locations and velocities of a pedestrian. To learn the optimal policy, this paper applies Q-learning. The numerical results demonstrate that the learned handover decisions outperform the heuristic handover decisions in terms of throughput performance.

[1]  Sundeep Rangan,et al.  An MDP model for optimal handover decisions in mmWave cellular networks , 2015, 2016 European Conference on Networks and Communications (EuCNC).

[2]  Teruyuki Miyajima,et al.  Feasibility of RSSI based access network detection for multi-band WLAN using 2.4/5GHz and 60GHz , 2014, 2014 International Symposium on Wireless Personal Multimedia Communications (WPMC).

[3]  Zhisheng Niu,et al.  Improving network throughput in 60GHz WLANs via multi-AP diversity , 2012, 2012 IEEE International Conference on Communications (ICC).

[4]  Vincent W. S. Wong,et al.  An MDP-Based Vertical Handoff Decision Algorithm for Heterogeneous Wireless Networks , 2008, IEEE Transactions on Vehicular Technology.

[5]  Kei Sakaguchi,et al.  Millimeter-wave Wireless LAN and its Extension toward 5G Heterogeneous Networks , 2015, IEICE Trans. Commun..

[6]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[7]  Masahiro Morikura,et al.  Proactive Base Station Selection Based on Human Blockage Prediction Using RGB-D Cameras for mmWave Communications , 2014, GLOBECOM 2014.

[8]  Katsuyuki Haneda,et al.  Channel Models and Beamforming at Millimeter-Wave Frequency Bands , 2015, IEICE Trans. Commun..

[9]  Masahiro Morikura,et al.  Proactive Handover Based on Human Blockage Prediction Using RGB-D Cameras for mmWave Communications , 2016, IEICE Trans. Commun..

[10]  Masahiro Morikura,et al.  Implementation and evaluation of reactive base station selection for human blockage in mmWave communications , 2015, 2015 21st Asia-Pacific Conference on Communications (APCC).

[11]  Injong Rhee,et al.  On the levy-walk nature of human mobility , 2011, TNET.

[12]  John M. Cioffi,et al.  Dynamic handoff decision in heterogeneous wireless systems: Q-learning approach , 2012, 2012 IEEE International Conference on Communications (ICC).

[13]  Masahiro Morikura,et al.  Proactive traffic control based on human blockage prediction using RGBD cameras for millimeter-wave communications , 2015, 2015 12th Annual IEEE Consumer Communications and Networking Conference (CCNC).

[14]  Ting Wang,et al.  A twin cylinder model for moving human body shadowing in 60GHz WLAN , 2015, 2015 21st Asia-Pacific Conference on Communications (APCC).

[15]  Athanasios V. Vasilakos,et al.  A survey of millimeter wave communications (mmWave) for 5G: opportunities and challenges , 2015, Wireless Networks.