Deep Reinforcement Learning-Based Channel Allocation for Wireless LANs With Graph Convolutional Networks

For densely deployed wireless local area networks (WLANs), this paper proposes a deep reinforcement learning-based channel allocation scheme that enables the efficient use of experience. The central idea is that an objective function is modeled relative to communication quality as a parametric function of a pair of observed topologies and channels. This is because communication quality in WLANs is significantly influenced by the carrier sensing relationship between access points. The features of the proposed scheme can be summarized by two points. First, we adopt graph convolutional layers in the model to extract the features of the channel vectors with topology information, which is the adjacency matrix of the graph dependent on the carrier sensing relationships. Second, we filter experiences to reduce the duplication of data for learning, which can often adversely influence the generalization performance. Because fixed experiences tend to be repeatedly observed in WLAN channel allocation problems, the duplication of experiences must be avoided. The simulation results demonstrate that the proposed method enables the allocation of channels in densely deployed WLANs such that the system throughput increases. Moreover, improved channel allocation, compared to other existing methods, is achieved in terms of the system throughput. Furthermore, compared to the immediate reward maximization method, the proposed method successfully achieves greater reward channel allocation or realizes the optimal channel allocation while reducing the number of changes.

[1]  Samy Bengio,et al.  A Study on Overfitting in Deep Reinforcement Learning , 2018, ArXiv.

[2]  A. Raschellà,et al.  Evaluation of Channel Assignment Algorithms in a Dense Real World WLAN , 2019, 2019 4th International Conference on Computing, Communications and Security (ICCCS).

[3]  Jure Leskovec,et al.  Graph Convolutional Neural Networks for Web-Scale Recommender Systems , 2018, KDD.

[4]  George Karypis,et al.  Frequent subgraph discovery , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[5]  Ying-Chang Liang,et al.  Applications of Deep Reinforcement Learning in Communications and Networking: A Survey , 2018, IEEE Communications Surveys & Tutorials.

[6]  Xavier Bresson,et al.  Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[7]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[8]  Dahua Lin,et al.  Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition , 2018, AAAI.

[9]  Kumpati S. Narendra,et al.  Learning automata - an introduction , 1989 .

[10]  Koji Yamamoto,et al.  A Comprehensive Survey of Potential Game Approaches to Wireless Networks , 2015, IEICE Trans. Commun..

[11]  Petteri Kaski,et al.  Engineering an Efficient Canonical Labeling Tool for Large and Sparse Graphs , 2007, ALENEX.

[12]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[13]  M. Haidar,et al.  Channel assignment in an IEEE 802.11 WLAN based on Signal-To-Interference Ratio , 2008, 2008 Canadian Conference on Electrical and Computer Engineering.

[14]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[15]  Joan Bruna,et al.  Spectral Networks and Locally Connected Networks on Graphs , 2013, ICLR.

[16]  Ah Chung Tsoi,et al.  The Graph Neural Network Model , 2009, IEEE Transactions on Neural Networks.

[17]  Behrouz Shahgholi Ghahfarokhi,et al.  Distributed QoE-aware channel assignment algorithms for IEEE 802.11 WLANs , 2015, Wirel. Networks.

[18]  Ekram Hossain,et al.  Channel assignment schemes for infrastructure-based 802.11 WLANs: A survey , 2010, IEEE Communications Surveys & Tutorials.

[19]  Patrick Thiran,et al.  Self-Organization Properties of CSMA/CA Systems and Their Consequences on Fairness , 2009, IEEE Transactions on Information Theory.

[20]  L. Shapley,et al.  Potential Games , 1994 .

[21]  Robert Babuska,et al.  Experience Replay for Real-Time Reinforcement Learning Control , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[22]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[23]  Graham Kendall,et al.  Artificial immune algorithm in solving the channel assignment task , 2014, 2014 IEEE International Conference on Control System, Computing and Engineering (ICCSCE 2014).

[24]  Gustavo de Veciana,et al.  Site-Specific Knowledge and Interference Measurement for Improving Frequency Allocations in Wireless Networks , 2009, IEEE Transactions on Vehicular Technology.

[25]  Patrick Bosch,et al.  A Machine Learning Approach for IEEE 802.11 Channel Allocation , 2018, 2018 14th International Conference on Network and Service Management (CNSM).

[26]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[27]  Bhaskar Krishnamachari,et al.  Deep Reinforcement Learning for Dynamic Multichannel Access in Wireless Networks , 2018, IEEE Transactions on Cognitive Communications and Networking.

[28]  Xu Chen,et al.  To Bond or Not to Bond: An Optimal Channel Allocation Algorithm for Flexible Dynamic Channel Bonding in WLANs , 2017, 2017 IEEE 86th Vehicular Technology Conference (VTC-Fall).

[29]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[30]  Petri Mähönen,et al.  Performance Evaluation of Automatic Channel Assignment Mechanism for IEEE 802.11 Based on Graph Colouring , 2006, 2006 IEEE 17th International Symposium on Personal, Indoor and Mobile Radio Communications.

[31]  Joan Bruna,et al.  Deep Convolutional Networks on Graph-Structured Data , 2015, ArXiv.

[32]  Seetha Hari,et al.  Learning From Imbalanced Data , 2019, Advances in Computer and Electrical Engineering.

[33]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[34]  Tom Schaul,et al.  Prioritized Experience Replay , 2015, ICLR.

[35]  Alagan Anpalagan,et al.  Opportunistic Spectrum Access in Cognitive Radio Networks: Global Optimization Using Local Interaction Games , 2012, IEEE Journal of Selected Topics in Signal Processing.

[36]  John E. R. Staddon,et al.  The dynamics of behavior: Review of Sutton and Barto: Reinforcement Learning : An Introduction (2 nd ed.) , 2020 .