Network Slice Reconfiguration by Exploiting Deep Reinforcement Learning With Large Action Space

It is widely acknowledged that network slicing can tackle the diverse usage scenarios and connectivity services that the 5G-and-beyond system needs to support. To guarantee performance isolation while maximizing network resource utilization under dynamic traffic load, network slice needs to be reconfigured adaptively. However, it is commonly believed that the fine-grained resource reconfiguration problem is intractable due to the extremely high computational complexity caused by numerous variables. In this article, we investigate the reconfiguration within a core network slice with aim of minimizing long-term resource consumption by exploiting Deep Reinforcement Learning (DRL). This problem is also intractable by using conventional Deep Q Network (DQN), as it has a multi-dimensional discrete action space which is difficult to explore efficiently. To address the curse of dimensionality, we propose to exploit Branching Dueling Q-network which incorporates the action branching architecture into DQN to drastically decrease the number of estimated actions. Based on the discrete BDQ network, we develop an intelligent network slice reconfiguration algorithm (INSRA). Extensive simulation experiments are conducted to evaluate the performance of INSRA and the numerical results reveal that INSRA can minimize the long-term resource consumption and achieve high resource efficiency compared with several benchmark algorithms.

[1]  Roberto Riggio,et al.  Machine Learning-Driven Scaling and Placement of Virtual Network Functions at the Network Edges , 2019, 2019 IEEE Conference on Network Softwarization (NetSoft).

[2]  Zhi-Quan Luo,et al.  Network Slicing for Service-Oriented Networks Under Resource Constraints , 2017, IEEE Journal on Selected Areas in Communications.

[3]  Jiawei Han,et al.  A Distributed Game Methodology for Crowdsensing in Uncertain Wireless Scenario , 2020, IEEE Transactions on Mobile Computing.

[4]  Arie M. C. A. Koster,et al.  Optimisation Models for Robust and Survivable Network Slice Design: A Comparative Analysis , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[5]  Gang Feng,et al.  Intelligent Resource Scheduling for 5G Radio Access Network Slicing , 2019, IEEE Transactions on Vehicular Technology.

[6]  Yoichi Sato,et al.  Environment-Adaptive Sizing and Placement of NFV Service Chains with Accelerated Reinforcement Learning , 2019, 2019 IFIP/IEEE Symposium on Integrated Network and Service Management (IM).

[7]  Meral Shirazipour,et al.  StEERING: A software-defined networking for inline service chaining , 2013, 2013 21st IEEE International Conference on Network Protocols (ICNP).

[8]  Elisa Jimeno,et al.  Design of Virtual Infrastructure Manager with Novel VNF Placement Features for Edge Clouds in 5G , 2017, EANN.

[9]  Nikos A. Vlassis,et al.  Collaborative Multiagent Reinforcement Learning by Payoff Propagation , 2006, J. Mach. Learn. Res..

[10]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[11]  Meng-Hsun Tsai,et al.  Realizing Dynamic Network Slice Resource Management based on SDN networks , 2019, 2019 International Conference on Intelligent Computing and its Emerging Applications (ICEA).

[12]  Mostafa Ammar,et al.  An Approach for Service Function Chain Routing and Virtual Function Network Instance Migration in Network Function Virtualization Architectures , 2017, IEEE/ACM Transactions on Networking.

[13]  Mahesh K. Marina,et al.  Network Slicing in 5G: Survey and Challenges , 2017, IEEE Communications Magazine.

[14]  Jiaxing Zhang,et al.  NFVdeep: Adaptive Online Service Function Chain Deployment with Deep Reinforcement Learning , 2019, 2019 IEEE/ACM 27th International Symposium on Quality of Service (IWQoS).

[15]  Gang Feng,et al.  On Robustness of Network Slicing for Next-Generation Mobile Networks , 2019, IEEE Transactions on Communications.

[16]  Wolfgang Kellerer,et al.  Boost online virtual network embedding: Using neural networks for admission control , 2016, 2016 12th International Conference on Network and Service Management (CNSM).

[17]  Shahid Mumtaz,et al.  When Internet of Things Meets Blockchain: Challenges in Distributed Consensus , 2019, IEEE Network.

[18]  Yves Lemieux,et al.  MAPLE: A Machine Learning Approach for Efficient Placement and Adjustment of Virtual Network Functions , 2019, J. Netw. Comput. Appl..

[19]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[20]  Xianfu Chen,et al.  Deep Reinforcement Learning for Resource Management in Network Slicing , 2018, IEEE Access.

[21]  Wei Cao,et al.  Intelligent Offloading in Multi-Access Edge Computing: A State-of-the-Art Review and Framework , 2019, IEEE Communications Magazine.

[22]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[23]  Lazaros Gkatzikis,et al.  The Algorithmic Aspects of Network Slicing , 2017, IEEE Communications Magazine.

[24]  Thomas Bauschert,et al.  Network slice embedding under traffic uncertainties — A light robust approach , 2017, 2017 13th International Conference on Network and Service Management (CNSM).

[25]  Zhifeng Zhao,et al.  The Learning and Prediction of Application-Level Traffic Data in Cellular Networks , 2016, IEEE Transactions on Wireless Communications.

[26]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[27]  Gang Feng,et al.  Reconfiguration in Network Slicing—Optimizing the Profit and Performance , 2019, IEEE Transactions on Network and Service Management.

[28]  Murali S. Kodialam,et al.  Traffic engineering in software defined networks , 2013, 2013 Proceedings IEEE INFOCOM.

[29]  Xin Chen,et al.  VNF-FG design and VNF placement for 5G mobile networks , 2017, Science China Information Sciences.

[30]  Sangheon Pack,et al.  Joint Optimization of Service Function Placement and Flow Distribution for Service Function Chaining , 2017, IEEE Journal on Selected Areas in Communications.

[31]  Xavier Costa-Perez,et al.  RL-NSB: Reinforcement Learning-Based 5G Network Slice Broker , 2019, IEEE/ACM Transactions on Networking.

[32]  Edoardo Amaldi,et al.  On the computational complexity of the virtual network embedding problem , 2016, Electron. Notes Discret. Math..

[33]  Xavier Hesselbach,et al.  Virtual Network Embedding: A Survey , 2013, IEEE Communications Surveys & Tutorials.

[34]  Abdelkader Outtagarts,et al.  A Deep Reinforcement Learning Approach for VNF Forwarding Graph Embedding , 2019, IEEE Transactions on Network and Service Management.

[35]  Mostafa H. Ammar,et al.  Dynamic Topology Configuration in Service Overlay Networks: A Study of Reconfiguration Policies , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[36]  Lena Wosinska,et al.  Dynamic slicing approach for multi-tenant 5G transport networks [invited] , 2018, IEEE/OSA Journal of Optical Communications and Networking.

[37]  Junxing Zhang,et al.  GANSlicing: A GAN-Based Software Defined Mobile Network Slicing Scheme for IoT Applications , 2019, ICC 2019 - 2019 IEEE International Conference on Communications (ICC).

[38]  Swades De,et al.  VNF Placement and Resource Allocation for the Support of Vertical Services in 5G Networks , 2018, IEEE/ACM Transactions on Networking.

[39]  Tom Schaul,et al.  Prioritized Experience Replay , 2015, ICLR.

[40]  Fangming Liu,et al.  Adaptive Interference-Aware VNF Placement for Service-Customized 5G Network Slices , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[41]  Juan Felipe Botero,et al.  Resource Allocation in NFV: A Comprehensive Survey , 2016, IEEE Transactions on Network and Service Management.

[42]  Laurie G. Cuthbert,et al.  Effective isolation in dynamic network slicing , 2019, 2019 IEEE Wireless Communications and Networking Conference (WCNC).

[43]  Arash Tavakoli,et al.  Action Branching Architectures for Deep Reinforcement Learning , 2017, AAAI.