Dynamic Network Slice Reconfiguration by Exploiting Deep Reinforcement Learning

It is widely acknowledged that network slicing can tackle the diverse usage scenarios and connectivity services that the 5G-and-beyond systems need to support. To guarantee performance isolation while maximizing network resource utilization under traffic uncertainty, network slice needs to be reconfigured adaptively. However, it is commonly believed that the fine-grained resource reconfiguration problem is intractable due to the extremely high computational complexity caused by the numerous variables. In this paper, we investigate network slice reconfiguration with aim of minimizing long-term resource consumption by exploiting Deep Reinforcement Learning (DRL). To address the curse of dimensionality of the problem, we propose to incorporate the Branching Dueling Q-network (BDQ) into DRL, to avoid some unnecessary calculations of Q-value by separating the Q-network into a shared value branch and a number of distributed advantage branches. Furthermore, the value branch and the advantage branch of each dimension are aggregated to derive the corresponding dimension’s sub-Q-value. Then the best reconfiguration action is composed of the subactions in individual dimensions which are selected by $\epsilon -$greedy policy. Finally, we design an intelligent online network slice reconfiguration policy based on BDQ and extensive simulation experiments are conducted to validate the effectiveness of the proposed slice reconfiguration policy.

[1]  Mahesh K. Marina,et al.  Network Slicing in 5G: Survey and Challenges , 2017, IEEE Communications Magazine.

[2]  Gang Feng,et al.  On Robustness of Network Slicing for Next-Generation Mobile Networks , 2019, IEEE Transactions on Communications.

[3]  Xavier Costa-Perez,et al.  RL-NSB: Reinforcement Learning-Based 5G Network Slice Broker , 2019, IEEE/ACM Transactions on Networking.

[4]  Shunliang Zhang,et al.  An Overview of Network Slicing for 5G , 2019, IEEE Wireless Communications.

[5]  Zhi-Quan Luo,et al.  Network Slicing for Service-Oriented Networks Under Resource Constraints , 2017, IEEE Journal on Selected Areas in Communications.

[6]  Gang Feng,et al.  Reconfiguration in Network Slicing—Optimizing the Profit and Performance , 2019, IEEE Transactions on Network and Service Management.

[7]  Xavier Hesselbach,et al.  Virtual Network Embedding: A Survey , 2013, IEEE Communications Surveys & Tutorials.

[8]  Mostafa H. Ammar,et al.  Dynamic Topology Configuration in Service Overlay Networks: A Study of Reconfiguration Policies , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[9]  Arash Tavakoli,et al.  Action Branching Architectures for Deep Reinforcement Learning , 2017, AAAI.

[10]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[11]  Wolfgang Kellerer,et al.  Boost online virtual network embedding: Using neural networks for admission control , 2016, 2016 12th International Conference on Network and Service Management (CNSM).

[12]  Chen Qi,et al.  Deep Reinforcement Learning With Discrete Normalized Advantage Functions for Resource Management in Network Slicing , 2019, IEEE Communications Letters.

[13]  Zhifeng Zhao,et al.  The Learning and Prediction of Application-Level Traffic Data in Cellular Networks , 2016, IEEE Transactions on Wireless Communications.