End-to-End Learning of Proactive Handover Policy for Camera-Assisted mmWave Networks Using Deep Reinforcement Learning

For mmWave networks, this paper proposes an image-to-decision proactive handover framework, which directly maps camera images to a handover decision. With the help of camera images, the proposed framework enables a proactive handover, i.e., a handover is triggered before a temporal variation in the received power induced by obstacles even if the variation is extremely rapid such that it cannot be predicted from a time series of received power. Furthermore, direct mapping allows scalability for the number of obstacles. This paper shows that optimal mapping is learned via deep reinforcement learning (RL) by proving that the decision process in our proposed framework is a Markov decision process. While performing deep RL, this paper designs a neural network (NN) architecture for a network controller to successfully learn the use of lower-dimensional observations in state information and higher-dimensional image observations. The evaluations based on experimentally obtained camera images and received powers indicate that the learned handover policy in the proposed framework outperforms the learned policy in a received power-based handover framework.

[1]  John M. Cioffi,et al.  Dynamic handoff decision in heterogeneous wireless systems: Q-learning approach , 2012, 2012 IEEE International Conference on Communications (ICC).

[2]  Tomoaki Ohtsuki,et al.  Q-learning cell selection for femtocell networks: Single- and multi-user case , 2012, 2012 IEEE Global Communications Conference (GLOBECOM).

[3]  Wenhua Jiao,et al.  Fast Handover Scheme for Real-Time Applications in Mobile WiMAX , 2007, 2007 IEEE International Conference on Communications.

[4]  Kei Sakaguchi,et al.  Millimeter-wave Wireless LAN and its Extension toward 5G Heterogeneous Networks , 2015, IEICE Trans. Commun..

[5]  Masahiro Morikura,et al.  Proactive Base Station Selection Based on Human Blockage Prediction Using RGB-D Cameras for mmWave Communications , 2014, GLOBECOM 2014.

[6]  Ying-Chang Liang,et al.  The SMART Handoff Policy for Millimeter Wave Heterogeneous Cellular Networks , 2018, IEEE Transactions on Mobile Computing.

[7]  Masahiro Morikura,et al.  Proactive Received Power Prediction Using Machine Learning and Depth Images for mmWave Networks , 2018, IEEE Journal on Selected Areas in Communications.

[8]  Athanasios V. Vasilakos,et al.  A survey of millimeter wave communications (mmWave) for 5G: opportunities and challenges , 2015, Wireless Networks.

[9]  Theodore S. Rappaport,et al.  A Flexible Millimeter-Wave Channel Sounder With Absolute Timing , 2017, IEEE Journal on Selected Areas in Communications.

[10]  Lingyang Song,et al.  Multi-gigabit millimeter wave wireless communications for 5G: from fixed access to cellular networks , 2014, IEEE Communications Magazine.

[11]  Tomoaki Ohtsuki,et al.  Adaptive Q-Learning Cell Selection Method for Open-Access Femtocell Networks: Multi-User Case , 2014, IEICE Trans. Commun..

[12]  Katsuyuki Haneda,et al.  Channel Models and Beamforming at Millimeter-Wave Frequency Bands , 2015, IEICE Trans. Commun..

[13]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[14]  Jörg Widmer,et al.  Boon and bane of 60 GHz networks: practical insights into beamforming, interference, and frame level operation , 2015, CoNEXT.

[15]  Branka Vucetic,et al.  Mobility handover optimization in millimeter wave heterogeneous networks , 2017, 2017 17th International Symposium on Communications and Information Technologies (ISCIT).

[16]  Rob Fergus,et al.  Depth Map Prediction from a Single Image using a Multi-Scale Deep Network , 2014, NIPS.

[17]  Masahiro Morikura,et al.  Proactive Handover Based on Human Blockage Prediction Using RGB-D Cameras for mmWave Communications , 2016, IEICE Trans. Commun..

[18]  Masahiro Morikura,et al.  Implementation and evaluation of reactive base station selection for human blockage in mmWave communications , 2015, 2015 21st Asia-Pacific Conference on Communications (APCC).

[19]  Sundeep Rangan,et al.  Improved Handover Through Dual Connectivity in 5G mmWave Mobile Networks , 2016, IEEE Journal on Selected Areas in Communications.

[20]  Masahiro Morikura,et al.  Reinforcement learning based predictive handover for pedestrian-aware mmWave networks , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[21]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[22]  Sundeep Rangan,et al.  An MDP model for optimal handover decisions in mmWave cellular networks , 2015, 2016 European Conference on Networks and Communications (EuCNC).

[23]  Laurent Dussopt,et al.  Millimeter-wave access and backhauling: the solution to the exponential data traffic increase in 5G mobile communications systems? , 2014, IEEE Communications Magazine.

[24]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[25]  T. W. Anderson,et al.  Statistical Inference about Markov Chains , 1957 .

[26]  Masahiro Morikura,et al.  Proactive traffic control based on human blockage prediction using RGBD cameras for millimeter-wave communications , 2015, 2015 12th Annual IEEE Consumer Communications and Networking Conference (CCNC).

[27]  Zhisheng Niu,et al.  Improving network throughput in 60GHz WLANs via multi-AP diversity , 2012, 2012 IEEE International Conference on Communications (ICC).

[28]  Ben-Jye Chang,et al.  Cross-Layer-Based Adaptive Vertical Handoff With Predictive RSS in Heterogeneous Wireless Networks , 2008, IEEE Transactions on Vehicular Technology.

[29]  Masahiro Morikura,et al.  Machine-Learning-Based Throughput Estimation Using Images for mmWave Communications , 2017, 2017 IEEE 85th Vehicular Technology Conference (VTC Spring).

[30]  Masahiro Morikura,et al.  Measurement Method of Temporal Attenuation by Human Body in Off-the-Shelf 60 GHz WLAN with HMM-Based Transmission State Estimation , 2018, Wirel. Commun. Mob. Comput..

[31]  Vincent W. S. Wong,et al.  An MDP-Based Vertical Handoff Decision Algorithm for Heterogeneous Wireless Networks , 2008, IEEE Transactions on Vehicular Technology.

[32]  Jianjun Wu,et al.  Cell selection in two-tier femtocell networks using Q-learning algorithm , 2014, 16th International Conference on Advanced Communication Technology.

[33]  Teruyuki Miyajima,et al.  Feasibility of RSSI based access network detection for multi-band WLAN using 2.4/5GHz and 60GHz , 2014, 2014 International Symposium on Wireless Personal Multimedia Communications (WPMC).