Learning-Based Multi-Channel Access in 5G and Beyond Networks With Fast Time-Varying Channels

We propose a learning-based scheme to investigate the dynamic multi-channel access (DMCA) problem in the fifth generation (5G) and beyond networks with fast time-varying channels wherein the channel parameters are unknown. The proposed learning-based scheme can maintain near-optimal performance for a long time, even in the sharp changing channels. This scheme greatly reduces processing delay, and effectively alleviates the error due to decision lag, which is cased by the non-immediacy of the information acquisition and processing. We first propose a psychology-based personalized quality of service model after introducing the network model with unknown channel parameters and the streaming model. Then, two access criteria are presented for the living streaming model and the buffered streaming model. Their corresponding optimization problems are also formulated. The optimization problems are solved by learning-based DMCA scheme, which combines the recurrent neural network with deep reinforcement learning. In the learning-based DMCA scheme, the agent mainly invokes the proposed prediction-based deep deterministic policy gradient algorithm as the learning algorithm. As a novel technical paradigm, our scheme has strong universality, since it can be easily extended to solve other problems in wireless communications. The real channel data-based simulation results validate that the performance of the learning-based scheme approaches that derived from the exhaustive search when making a decision at each time-slot, and is superior to the exhaustive search method when making a decision at every few time-slots.

[1]  Jinho Choi,et al.  NOMA-Based Random Access With Multichannel ALOHA , 2017, IEEE Journal on Selected Areas in Communications.

[2]  Guan Gui,et al.  Fast Beamforming Design via Deep Learning , 2020, IEEE Transactions on Vehicular Technology.

[3]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[4]  Bhaskar Krishnamachari,et al.  On myopic sensing for multi-channel opportunistic access: structure, optimality, and performance , 2007, IEEE Transactions on Wireless Communications.

[5]  Jürgen Schmidhuber,et al.  Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[6]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[7]  Gang Feng,et al.  Multi-RAT Access Based on Multi-Agent Reinforcement Learning , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[8]  Jae Wook Shin,et al.  Deterministic rendezvous scheme in multichannel access networks , 2010 .

[9]  Jie Yang,et al.  Data-Driven Deep Learning for Automatic Modulation Recognition in Cognitive Radios , 2019, IEEE Transactions on Vehicular Technology.

[10]  Tao Jiang,et al.  Channel Prediction in Time-Varying Massive MIMO Environments , 2017, IEEE Access.

[11]  Tuan A. Nguyen,et al.  Deep Q-Learning with Multiband Sensing for Dynamic Spectrum Access , 2018, 2018 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN).

[12]  Geoffrey Ye Li,et al.  Compression and Acceleration of Neural Networks for Communications , 2019, IEEE Wireless Communications.

[13]  Jianyu Wang,et al.  Heterogeneous quality of experience guarantees over wireless networks , 2018, China Communications.

[14]  Pan Li,et al.  Channel State Information Prediction for 5G Wireless Communications: A Deep Learning Approach , 2020, IEEE Transactions on Network Science and Engineering.

[15]  Vasant Honavar,et al.  Learn++: an incremental learning algorithm for supervised neural networks , 2001, IEEE Trans. Syst. Man Cybern. Part C.

[16]  Shengwei Hou,et al.  High mobility orthogonal frequency division multiple access channel estimation using basis expansion model , 2010, IET Commun..

[17]  Lin Chen,et al.  On optimality of myopic policy in multi-channel opportunistic access , 2016, 2016 IEEE International Conference on Communications (ICC).

[18]  Fumiyuki Adachi,et al.  Deep-Learning-Based Millimeter-Wave Massive MIMO for Hybrid Precoding , 2019, IEEE Transactions on Vehicular Technology.

[19]  Khaled Ben Letaief,et al.  Massive MIMO Beamforming With Transmit Diversity for High Mobility Wireless Communications , 2017, IEEE Access.

[20]  Fumiyuki Adachi,et al.  Deep Learning for Physical-Layer 5G Wireless Techniques: Opportunities, Challenges and Solutions , 2019, IEEE Wireless Communications.

[21]  Xianbin Wang,et al.  Mobility Management through Scalable C/U-Plane Decoupling in IoV Networks , 2019, IEEE Communications Magazine.

[22]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[23]  Wei Cui,et al.  Spatial Deep Learning for Wireless Scheduling , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[24]  Vasanthan Raghavan,et al.  Evolution of Physical-Layer Communications Research in the Post-5G Era , 2019, IEEE Access.

[25]  Victor C. M. Leung,et al.  Network Slicing Based 5G and Future Mobile Networks: Mobility, Resource Management, and Challenges , 2017, IEEE Communications Magazine.

[26]  Ana Galindo-Serrano,et al.  Distributed Q-Learning for Aggregated Interference Control in Cognitive Radio Networks , 2010, IEEE Transactions on Vehicular Technology.

[27]  Mingyan Liu,et al.  Optimality of Myopic Sensing in Multi-Channel Opportunistic Access , 2008, 2008 IEEE International Conference on Communications.

[28]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[29]  Ismail Güvenç,et al.  Learning Based Frequency- and Time-Domain Inter-Cell Interference Coordination in HetNets , 2014, IEEE Transactions on Vehicular Technology.

[30]  Bhaskar Krishnamachari,et al.  Dynamic Multichannel Access With Imperfect Channel State Detection , 2010, IEEE Transactions on Signal Processing.

[31]  Victor C. M. Leung,et al.  A Multichannel Medium Access Control Protocol for Vehicular Power Line Communication Systems , 2016, IEEE Transactions on Vehicular Technology.

[32]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[33]  Kobi Cohen,et al.  Deep Multi-User Reinforcement Learning for Dynamic Spectrum Access in Multichannel Wireless Networks , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[34]  Yuchou Chang,et al.  Unsupervised feature selection using clustering ensembles and population based incremental learning algorithm , 2008, Pattern Recognit..

[35]  Mingyan Liu,et al.  Sufficient Conditions on the Optimality of Myopic Sensing in Opportunistic Channel Access: A Unifying Framework , 2014, IEEE Transactions on Information Theory.

[36]  Bhaskar Krishnamachari,et al.  Deep Reinforcement Learning for Dynamic Multichannel Access in Wireless Networks , 2018, IEEE Transactions on Cognitive Communications and Networking.

[37]  Soung Chang Liew,et al.  Deep-Reinforcement Learning Multiple Access for Heterogeneous Wireless Networks , 2017, 2018 IEEE International Conference on Communications (ICC).

[38]  Luiz A. DaSilva,et al.  Complexity of Spectrum Activity and Benefits of Reinforcement Learning for Dynamic Channel Selection , 2013, IEEE Journal on Selected Areas in Communications.

[39]  Nei Kato,et al.  Efficient Resource Allocation Utilizing Q-Learning in Multiple UA Communications , 2019, IEEE Transactions on Network Science and Engineering.

[40]  Alexandra Duel-Hallen,et al.  Deterministic channel modeling and long range prediction of fast fading mobile radio channels , 1998, IEEE Communications Letters.

[41]  Pingzhi Fan,et al.  5G high mobility wireless communications: Challenges and solutions , 2016, China Communications.

[42]  Lin Chen,et al.  On Optimality of Myopic Policy in Multi-Channel Opportunistic Access , 2017, IEEE Transactions on Communications.

[43]  Guan Gui,et al.  Deep Learning for an Effective Nonorthogonal Multiple Access Scheme , 2018, IEEE Transactions on Vehicular Technology.

[44]  Koushik Kar,et al.  Throughput-Optimal Scheduling in Multichannel Access Point Networks Under Infrequent Channel Measurements , 2007, IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.

[45]  Guan Gui,et al.  Deep Learning for Super-Resolution Channel Estimation and DOA Estimation Based Massive MIMO System , 2018, IEEE Transactions on Vehicular Technology.

[46]  Qing Zhao,et al.  Indexability of Restless Bandit Problems and Optimality of Whittle Index for Dynamic Multichannel Access , 2008, IEEE Transactions on Information Theory.

[47]  Xiang Cheng,et al.  Channel Prediction Based Scheduling for Data Dissemination in VANETs , 2017, IEEE Communications Letters.

[48]  Naofal Al-Dhahir,et al.  Unsupervised Machine Learning-Based User Clustering in Millimeter-Wave-NOMA Systems , 2018, IEEE Transactions on Wireless Communications.

[49]  Quan Liu,et al.  On Optimality of Myopic Policy in Opportunistic Spectrum Access: The Case of Sensing Multiple Channels and Accessing One Channel , 2012, IEEE Wireless Communications Letters.

[50]  Shalabh Bhatnagar,et al.  Natural actor-critic algorithms , 2009, Autom..

[51]  Tao Jiang,et al.  Deep learning for wireless physical layer: Opportunities and challenges , 2017, China Communications.

[52]  Tao Jiang,et al.  Downlink Channel Prediction for Time-Varying FDD Massive MIMO Systems , 2019, IEEE Journal of Selected Topics in Signal Processing.

[53]  Kobi Cohen,et al.  Deep Multi-User Reinforcement Learning for Distributed Dynamic Spectrum Access , 2017, IEEE Transactions on Wireless Communications.