Artificial Intelligence-Based Handoff Management for Dense WLANs: A Deep Reinforcement Learning Approach

So far, the handoff management involved in the wireless local area network (WLAN) has mainly fallen into the handoff mechanism and the decision algorithm. The traditional handoff mechanism generates noticeable delays during the handoff process, resulting in discontinuity of service, which is more evident in dense WLANs. Inspired by software-defined networking (SDN), prior works put forward many seamless handoff mechanisms to ensure service continuity. With respect to the handoff decision algorithm, when to trigger handoff and which access point to reconnect to, however, are still tricky problems. In this paper, we first design a self-learning architecture applicable to the SDN-based WLAN frameworks. Along with it, we propose DCRQN, a novel handoff management scheme based on deep reinforcement learning, specifically deep $Q$ -network. The proposed scheme enables the network to learn from actual users’ behaviors and network status from scratch, adapting its learning in time-varying dense WLANs. Due to the temporal correlation property, the handoff decision is modeled as the Markov decision process (MDP). In the modeled MDP, the proposed scheme depends on the real-time network statistics at the time of decisions. Moreover, the convolutional neural network and the recurrent neural network are leveraged to extract fine-grained discriminative features. The numerical results through simulation demonstrate that DCRQN can effectively improve the data rate during the handoff process, outperforming the traditional handoff scheme.

[1]  William A. Arbaugh,et al.  An empirical analysis of the IEEE 802.11 MAC layer handoff process , 2003, CCRV.

[2]  Kobi Cohen,et al.  Deep Multi-User Reinforcement Learning for Dynamic Spectrum Access in Multichannel Wireless Networks , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[3]  Andrzej Duda,et al.  Multichannel Virtual Access Points for Seamless Handoffs in IEEE 802.11 Wireless Networks , 2011, 2011 IEEE 73rd Vehicular Technology Conference (VTC Spring).

[4]  M. Moghaddam,et al.  Inverse Scattering Using a Joint $L1-L2$ Norm-Based Regularization , 2016, IEEE Transactions on Antennas and Propagation.

[5]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[6]  Xiaoli Chu,et al.  Coexistence of Wi-Fi and heterogeneous small cell networks sharing unlicensed spectrum , 2015, IEEE Communications Magazine.

[7]  Jianfeng Gao,et al.  Deep Reinforcement Learning for Dialogue Generation , 2016, EMNLP.

[8]  Michal Ries,et al.  Common SDN control channel for seamless handover in 802.11 , 2018, 2018 Wireless Days (WD).

[9]  K. Tsukamoto,et al.  Impact of layer 2 behavior on TCP performance in WLAN , 2005, VTC-2005-Fall. 2005 IEEE 62nd Vehicular Technology Conference, 2005..

[10]  Yi Zhang,et al.  Spectrum Monitoring for Radar Bands Using Deep Convolutional Neural Networks , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[11]  Chung-Ju Chang,et al.  Fuzzy Q-Learning Admission Control for WCDMA/WLAN Heterogeneous Networks with Multimedia Traffic , 2009, IEEE Transactions on Mobile Computing.

[12]  Xiangming Wen,et al.  QoE-Based Reduction of Handover Delay for Multimedia Application in IEEE 802.11 Networks , 2015, IEEE Communications Letters.

[13]  Luis Sequeira,et al.  Building an SDN Enterprise WLAN Based on Virtual APs , 2017, IEEE Communications Letters.

[14]  Adam Wolisz,et al.  Interference-aware decentralized access point selection policy for multi-rate IEEE 802.11 Wireless LANs , 2008, 2008 IEEE 19th International Symposium on Personal, Indoor and Mobile Radio Communications.

[15]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[16]  F.L. Lewis,et al.  Reinforcement learning and adaptive dynamic programming for feedback control , 2009, IEEE Circuits and Systems Magazine.

[17]  Ana Galindo-Serrano,et al.  Distributed Q-Learning for Aggregated Interference Control in Cognitive Radio Networks , 2010, IEEE Transactions on Vehicular Technology.

[18]  John M. Cioffi,et al.  Dynamic handoff decision in heterogeneous wireless systems: Q-learning approach , 2012, 2012 IEEE International Conference on Communications (ICC).

[19]  Yuji Oie,et al.  Experimental Evaluation of Decision Criteria for WLAN Handover: Signal Strength and Frame Retransmission , 2006, UCS.

[20]  Shyam Lal,et al.  Coverage Analysis of Handoff Algorithm with Adaptive Hysteresis Margin , 2007 .

[21]  Xiaoguang Zhao,et al.  A fast handoff scheme for ieee 802.11 networks using software defined networking , 2016, 2016 19th International Symposium on Wireless Personal Multimedia Communications (WPMC).

[22]  Wonyong Yoon,et al.  Design of scalable SDN based eMBMS/WLAN network architecture assisted by Fog Computing , 2018, 2018 International Conference on Information and Communication Technology Convergence (ICTC).

[23]  Gang Cao,et al.  Handoff management scheme based on frame loss rate and RSSI prediction for IEEE 802.11 networks , 2016, 2016 International Symposium on Wireless Communication Systems (ISWCS).

[24]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[25]  William A. Arbaugh,et al.  Improving the latency of 802.11 hand-offs using neighbor graphs , 2004, MobiSys '04.

[26]  Xing Zhao,et al.  SWAN: An SDN based campus WLAN framework , 2014, 2014 4th International Conference on Wireless Communications, Vehicular Technology, Information Theory and Aerospace & Electronic Systems (VITAE).

[27]  Iqbal Gondal,et al.  Combined SINR Based Vertical Handoff Algorithm for Next Generation Heterogeneous Wireless Networks , 2007, IEEE GLOBECOM 2007 - IEEE Global Telecommunications Conference.

[28]  Yoshua Bengio,et al.  On the Expressive Power of Deep Architectures , 2011, ALT.

[29]  John N. Tsitsiklis,et al.  Asynchronous Stochastic Approximation and Q-Learning , 1994, Machine Learning.

[30]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 2005, IEEE Transactions on Neural Networks.

[31]  Xi Chen,et al.  HaND: Fast Handoff with Null Dwell Time for IEEE 802.11 Networks , 2010, 2010 Proceedings IEEE INFOCOM.

[32]  Weidong Wang,et al.  Deep Reinforcement Learning Based Dynamic Channel Allocation Algorithm in Multibeam Satellite Systems , 2018, IEEE Access.