Fuzzy Reinforcement Learning for Robust Spectrum Access in Dynamic Shared Networks

The persistent increases of wireless terminals have brought about diverse shared networks, where robust and efficient spectrum reuse among heterogeneous users is of critical importance while still remains as a challenging task for practical application. In this paper, we study the problem of robust spectrum access (RSA) in a canonical wireless shared network (WSN) with fully considering the inherent dynamics of the wireless environment. The non-static features of WSNs result in uncertain channel state information (CSI) and complicated coupling interference, which can’t be directly formulated as the well-accepted crisp game model, rendering most existing perfect CSI relied approaches inefficient or even unfeasible. To address this, by interpreting the estimated CSI with uncertainty as fuzzy number, a novel framework referred to as a non-cooperative fuzzy game (NC-FG) is adopted, whereby the user utility is mapped as a fuzzy value via the user-defined fuzzy utility function. Based on the derived property of the NC-FG that fuzzy Nash equilibrium (FNE) exists, a fuzzy-logic inspired reinforcement learning (FLRL) algorithm is proposed to achieve the FNE solutions of the constructed NC-FG to obtain the RSA in dynamic WSN, with which both the iterative learning and decision making procedures are implemented in a fuzzy-space, thus the sensitiveness of our scheme to environmental variations is alleviated. Finally, numerical simulations are provided to demonstrate the convergence, effectiveness, and superiority of our proposed FLRL algorithm in dynamic WSNs.

[1]  Arumugam Nallanathan,et al.  Spectrum Detection and Link Quality Assessment for Heterogeneous Shared Access Networks , 2019, IEEE Transactions on Vehicular Technology.

[2]  Arumugam Nallanathan,et al.  Deep Sensing for Future Spectrum and Location Awareness 5G Communications , 2015, IEEE Journal on Selected Areas in Communications.

[3]  Hans-Jürgen Zimmermann,et al.  Fuzzy Set Theory - and Its Applications , 1985 .

[4]  Bin Li,et al.  Learning-Based Spectrum Sharing and Spatial Reuse in mm-Wave Ultradense Networks , 2018, IEEE Transactions on Vehicular Technology.

[5]  Alagan Anpalagan,et al.  Advanced spectrum sharing in 5G cognitive heterogeneous networks , 2016, IEEE Wireless Communications.

[6]  Mugen Peng,et al.  Deep Reinforcement Learning-Based Mode Selection and Resource Management for Green Fog Radio Access Networks , 2018, IEEE Internet of Things Journal.

[7]  Alireza Chakeri,et al.  Fuzzy Nash Equilibriums in Crisp and Fuzzy Games , 2013, IEEE Transactions on Fuzzy Systems.

[8]  Wei Zhang,et al.  Spectrum Sharing for Drone Networks , 2017, IEEE Journal on Selected Areas in Communications.

[9]  Moussa Larbani,et al.  Existence of equilibrium solution for a non-cooperative game with fuzzy goals and parameters , 2008, Fuzzy Sets Syst..

[10]  Alagan Anpalagan,et al.  Dynamic Spectrum Access in Time-Varying Environment: Distributed Learning Beyond Expectation Optimization , 2015, IEEE Transactions on Communications.

[11]  H. Kushner,et al.  Stochastic Approximation and Recursive Algorithms and Applications , 2003 .

[12]  Xiaohu Ge,et al.  User Mobility Evaluation for 5G Small Cell Networks Based on Individual Mobility Model , 2015, IEEE Journal on Selected Areas in Communications.

[13]  Moussa Larbani,et al.  Non cooperative fuzzy games in normal form: A survey , 2009, Fuzzy Sets Syst..

[14]  Zeshui Xu,et al.  A least deviation method to obtain a priority vector of a fuzzy preference relation , 2005, Eur. J. Oper. Res..

[15]  Jee-Hyong Lee,et al.  A method for ranking fuzzy numbers and its application to decision-making , 1999, IEEE Trans. Fuzzy Syst..

[16]  Xiaofan Li,et al.  Deep Sensing for Space-Time Doubly Selective Channels: When a Primary User Is Mobile and the Channel Is Flat Rayleigh Fading , 2016, IEEE Transactions on Signal Processing.

[17]  Beatriz Lorenzo,et al.  User-Centric Distributed Spectrum Sharing in Dynamic Network Architectures , 2019, IEEE/ACM Transactions on Networking.

[18]  Zhang Qiang,et al.  Nash equilibrium strategy for fuzzy non-cooperative games , 2011, Fuzzy Sets Syst..

[19]  Deng-Feng Li,et al.  An Effective Methodology for Solving Matrix Games With Fuzzy Payoffs , 2013, IEEE Transactions on Cybernetics.

[20]  Geoffrey Ye Li,et al.  A Framework for Co-Channel Interference and Collision Probability Tradeoff in LTE Licensed-Assisted Access Networks , 2016, IEEE Transactions on Wireless Communications.

[21]  Yaoqing Yang,et al.  Spectrum Reuse Ratio in 5G Cellular Networks: A Matrix Graph Approach , 2017, IEEE Transactions on Mobile Computing.

[22]  Jianchao Zheng,et al.  QoE Driven Decentralized Spectrum Sharing in 5G Networks: Potential Game Approach , 2017, IEEE Transactions on Vehicular Technology.

[23]  Klaus Moessner,et al.  Licensed Spectrum Sharing Schemes for Mobile Operators: A Survey and Outlook , 2016, IEEE Communications Surveys & Tutorials.

[24]  Lajos Hanzo,et al.  Synergistic spectrum sharing in 5G HetNets: A harmonized SDN-enabled approach , 2016, IEEE Communications Magazine.

[25]  Shaowei Wang,et al.  Clustering-Based Spectrum Sharing Strategy for Cognitive Radio Networks , 2017, IEEE Journal on Selected Areas in Communications.

[26]  A. Lozano,et al.  What Will 5 G Be ? , 2014 .

[27]  Tim Clarke,et al.  Distributed Heuristically Accelerated Q-Learning for Robust Cognitive Spectrum Management in LTE Cellular Systems , 2016, IEEE Transactions on Mobile Computing.

[28]  Navrati Saxena,et al.  Next Generation 5G Wireless Networks: A Comprehensive Survey , 2016, IEEE Communications Surveys & Tutorials.

[29]  K. J. Ray Liu,et al.  Joint Spectrum Sensing and Access Evolutionary Game in Cognitive Radio Networks , 2013, IEEE Transactions on Wireless Communications.

[30]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[31]  Chun-Hung Liu,et al.  Optimal Cell Load and Throughput in Green Small Cell Networks With Generalized Cell Association , 2015, IEEE Journal on Selected Areas in Communications.

[32]  Ming Xiao,et al.  A Survey of Advanced Techniques for Spectrum Sharing in 5G Networks , 2017, IEEE Wireless Communications.

[33]  Bin Li,et al.  Robust Dynamic Spectrum Access in Uncertain Channels: A Fuzzy Payoffs Game Approach , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[34]  C. Beckman,et al.  Shared networks: making wireless communication affordable , 2005, IEEE Wireless Communications.

[35]  Ming Xiao,et al.  Spectrum Sharing for Internet of Things: A Survey , 2018, IEEE Wireless Communications.

[36]  H. Vincent Poor,et al.  A Distributed Approach to Improving Spectral Efficiency in Uplink Device-to-Device-Enabled Cloud Radio Access Networks , 2018, IEEE Transactions on Communications.

[37]  K. J. Ray Liu,et al.  Multi-Channel Sensing and Access Game: Bayesian Social Learning with Negative Network Externality , 2014, IEEE Transactions on Wireless Communications.