Transfer Learning-Based Accelerated Deep Reinforcement Learning for 5G RAN Slicing

Deep Reinforcement Learning (DRL) algorithms have been recently proposed to solve dynamic Radio Resource Management (RRM) problems in 5G networks. However, the slow convergence experienced by traditional DRL agents puts many doubts on their practical adoption in cellular networks. In this paper, we first discuss the need to have accelerated DRL algorithms. We then analyze the exploration behavior of various state-of-the-art DRL algorithms for slice resource allocation, and compare it with the traditional 5G Radio Access Network (RAN) slicing baselines. Finally, we propose a transfer learning-accelerated DRL-based solution for slice resource allocation. In particular, we tackle the challenge of slow convergence by transferring the policy learned by a DRL agent at an expert base station (BS) to newly deployed agents at target learner BSs. Our approach shows a remarkable reduction in convergence time and a significant performance improvement compared with its non-accelerated counterparts when tested against multiple traffic load variations.

[1]  Xianfu Chen,et al.  Deep Reinforcement Learning for Resource Management in Network Slicing , 2018, IEEE Access.

[2]  Qiao Tian,et al.  Transfer Learning Promotes 6G Wireless Communications: Recent Advances and Future Challenges , 2021, IEEE Transactions on Reliability.

[3]  Xin Liu,et al.  A Constrained Reinforcement Learning Based Approach for Network Slicing , 2020, 2020 IEEE 28th International Conference on Network Protocols (ICNP).

[4]  Shuguang Cui,et al.  Meta-Reinforcement Learning for Trajectory Design in Wireless UAV Networks , 2020, GLOBECOM 2020 - 2020 IEEE Global Communications Conference.

[5]  Guolin Sun,et al.  Transfer Learning for Autonomous Cell Activation Based on Relational Reinforcement Learning With Adaptive Reward , 2021, IEEE Systems Journal.

[6]  Ekram Hossain,et al.  Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial , 2020, IEEE Communications Surveys & Tutorials.

[7]  H. Vincent Poor,et al.  Experienced Deep Reinforcement Learning With Generative Adversarial Networks (GANs) for Model-Free Ultra Reliable Low Latency Communication , 2019, IEEE Transactions on Communications.

[8]  Gang Feng,et al.  Intelligent Resource Scheduling for 5G Radio Access Network Slicing , 2019, IEEE Transactions on Vehicular Technology.

[9]  Yuanyuan Wang,et al.  Integrated Resource Scheduling for User Experience Enhancement: A Heuristically Accelerated DRL , 2019, 2019 11th International Conference on Wireless Communications and Signal Processing (WCSP).

[10]  Meryem Simsek,et al.  Improved decentralized Q-learning algorithm for interference reduction in LTE-femtocells , 2011, 2011 Wireless Advanced.

[11]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[12]  Jian Wang,et al.  Deep Reinforcement Learning for Scheduling in Cellular Networks , 2019, 2019 11th International Conference on Wireless Communications and Signal Processing (WCSP).