Multi-Tenant Cross-Slice Resource Orchestration: A Deep Reinforcement Learning Approach

With the cellular networks becoming increasingly agile, a major challenge lies in how to support diverse services for mobile users (MUs) over a common physical network infrastructure. Network slicing is a promising solution to tailor the network to match such service requests. This paper considers a system with radio access network (RAN)-only slicing, where the physical infrastructure is split into slices providing computation and communication functionalities. A limited number of channels are auctioned across scheduling slots to MUs of multiple service providers (SPs) (i.e., the tenants). Each SP behaves selfishly to maximize the expected long-term payoff from the competition with other SPs for the orchestration of channels, which provides its MUs with the opportunities to access the computation and communication slices. This problem is modelled as a stochastic game, in which the decision makings of a SP depend on the global network dynamics as well as the joint control policy of all SPs. To approximate the Nash equilibrium solutions, we first construct an abstract stochastic game with the local conjectures of channel auction among the SPs. We then linearly decompose the per-SP Markov decision process to simplify the decision makings at a SP and derive an online scheme based on deep reinforcement learning to approach the optimal abstract control policies. Numerical experiments show significant performance gains from our scheme.

[1]  Dinh Thai Hoang,et al.  Optimal Cross Slice Orchestration for 5G Mobile Services , 2018, 2018 IEEE 88th Vehicular Technology Conference (VTC-Fall).

[2]  Vincent W. S. Wong,et al.  Joint Optimal Pricing and Task Scheduling in Mobile Cloud Computing Systems , 2017, IEEE Transactions on Wireless Communications.

[3]  Di Yuan,et al.  Data Offloading in Load Coupled Networks: A Utility Maximization Framework , 2014, IEEE Transactions on Wireless Communications.

[4]  John N. Tsitsiklis,et al.  Feature-based methods for large scale dynamic programming , 2004, Machine Learning.

[5]  Oriol Sallent,et al.  On Radio Access Network Slicing from a Radio Resource Management Perspective , 2017, IEEE Wireless Communications.

[6]  Michel Loève,et al.  Probability Theory I , 1977 .

[7]  Jose Ordonez-Lucena,et al.  Network Slicing for 5G with SDN/NFV: Concepts, Architectures, and Challenges , 2017, IEEE Communications Magazine.

[8]  Zhu Han,et al.  Machine Learning Paradigms for Next-Generation Wireless Networks , 2017, IEEE Wireless Communications.

[9]  Sergey Andreev,et al.  Achieving End-to-End Reliability of Mission-Critical Traffic in Softwarized 5G Networks , 2018, IEEE Journal on Selected Areas in Communications.

[10]  Thomas Frisanco,et al.  Infrastructure Sharing and Shared Operations for Mobile Network Operators: From a Deployment and Operations View , 2008, 2008 IEEE International Conference on Communications.

[11]  Brian D. Noble,et al.  BreadCrumbs: forecasting mobile connectivity , 2008, MobiCom '08.

[12]  Zhu Han,et al.  Wireless Resource Scheduling in Virtualized Radio Access Networks Using Stochastic Learning , 2018, IEEE Transactions on Mobile Computing.

[13]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[14]  Mingyan Liu,et al.  Revenue generation for truthful spectrum auction in dynamic spectrum access , 2009, MobiHoc '09.

[15]  Walid Saad,et al.  Matching theory for future wireless networks: fundamentals and applications , 2014, IEEE Communications Magazine.

[16]  Man Hon Cheung,et al.  DAWN: Delay-Aware Wi-Fi Offloading and Network Selection , 2015, IEEE Journal on Selected Areas in Communications.

[17]  Sachin Katti,et al.  SoftRAN: software defined radio access network , 2013, HotSDN '13.

[18]  陈耕艺 无线网络新兵:Project Fi , 2015 .

[19]  Xianfu Chen,et al.  Software defined mobile networks: concept, survey, and research directions , 2015, IEEE Communications Magazine.

[20]  Tarik Taleb,et al.  Network Slicing and Softwarization: A Survey on Principles, Enabling Technologies, and Solutions , 2018, IEEE Communications Surveys & Tutorials.

[21]  Mehdi Bennis,et al.  Performance Optimization in Mobile-Edge Computing via Deep Reinforcement Learning , 2018, 2018 IEEE 88th Vehicular Technology Conference (VTC-Fall).

[22]  Ejaz Ahmed,et al.  A survey on mobile edge computing , 2016, 2016 10th International Conference on Intelligent Systems and Control (ISCO).

[23]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[24]  Albert Banchs,et al.  Network Slicing Games: Enabling Customization in Multi-Tenant Mobile Networks , 2019, IEEE/ACM Transactions on Networking.

[25]  Zhu Han,et al.  Game Theory in Wireless and Communication Networks: Theory, Models, and Applications , 2011 .

[26]  Yueming Cai,et al.  Dynamic Computation Offloading for Mobile Cloud Computing: A Stochastic Game-Theoretic Approach , 2019, IEEE Transactions on Mobile Computing.

[27]  Ulas C. Kozat,et al.  Stochastic Game for Wireless Network Virtualization , 2013, IEEE/ACM Transactions on Networking.

[28]  Jeffrey G. Andrews,et al.  Femtocells: Past, Present, and Future , 2012, IEEE Journal on Selected Areas in Communications.

[29]  Tommaso Melodia,et al.  Low-Complexity Distributed Radio Access Network Slicing: Algorithms and Experimental Results , 2018, IEEE/ACM Transactions on Networking.

[30]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[31]  Arogyaswami Paulraj,et al.  Information Prediction and Dynamic Programming-Based RAN Slicing for Mobile Edge Computing , 2018, IEEE Wireless Communications Letters.

[32]  Yonggang Wen,et al.  “ A Survey of Software Defined Networking , 2020 .

[33]  Yuan Wu,et al.  Delay-Minimization Nonorthogonal Multiple Access Enabled Multi-User Mobile Edge Computation Offloading , 2019, IEEE Journal of Selected Topics in Signal Processing.

[34]  A. M. Fink,et al.  Equilibrium in a stochastic $n$-person game , 1964 .

[35]  F. Richard Yu,et al.  Wireless Network Virtualization: A Survey, Some Research Issues and Challenges , 2015, IEEE Communications Surveys & Tutorials.

[36]  F. Richard Yu,et al.  Resource Allocation for Information-Centric Virtualized Heterogeneous Networks With In-Network Caching and Mobile Edge Computing , 2017, IEEE Transactions on Vehicular Technology.

[37]  Di Yuan,et al.  Matching Theory for Over-the-Top Service Provision in 5G Networks , 2018, IEEE Transactions on Wireless Communications.

[38]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[39]  Xianfu Chen,et al.  SoftMobile: control evolution for future heterogeneous mobile networks , 2014, IEEE Wireless Communications.

[40]  Khaled Ben Letaief,et al.  Dynamic Computation Offloading for Mobile-Edge Computing With Energy Harvesting Devices , 2016, IEEE Journal on Selected Areas in Communications.

[41]  Di Yuan,et al.  Analysis of Cell Load Coupling for LTE Network Planning and Optimization , 2012, IEEE Transactions on Wireless Communications.

[42]  Long-Ji Lin,et al.  Reinforcement learning for robots using neural networks , 1992 .

[43]  Marwan Krunz,et al.  Dynamic Network Slicing for Scalable Fog Computing Systems With Energy Harvesting , 2018, IEEE Journal on Selected Areas in Communications.

[44]  K. B. Letaief,et al.  A Survey on Mobile Edge Computing: The Communication Perspective , 2017, IEEE Communications Surveys & Tutorials.

[45]  Mihaela van der Schaar,et al.  Learning to Compete for Resources in Wireless Stochastic Games , 2009, IEEE Transactions on Vehicular Technology.

[46]  Gustavo de Veciana,et al.  Network slicing games: Enabling customization in multi-tenant networks , 2016, IEEE INFOCOM 2017 - IEEE Conference on Computer Communications.

[47]  Marwan Krunz,et al.  Distributed Resource Allocation for Network Slicing Over Licensed and Unlicensed Bands , 2018, IEEE Journal on Selected Areas in Communications.

[48]  Vincenzo Sciancalepore,et al.  From network sharing to multi-tenancy: The 5G network slice broker , 2016, IEEE Communications Magazine.

[49]  Tuomas Sandholm,et al.  Imperfect-Recall Abstractions with Bounds in Games , 2014, EC.

[50]  K. J. Liu,et al.  Dynamic Spectrum Sharing : A Game Theoretical Overview , 2022 .

[51]  Thomas D. Burd,et al.  Processor design for portable systems , 1996, J. VLSI Signal Process..

[52]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[53]  Xianfu Chen,et al.  Energy-Efficiency Oriented Traffic Offloading in Wireless Networks: A Brief Survey and a Learning Approach for Heterogeneous Cellular Networks , 2015, IEEE Journal on Selected Areas in Communications.

[54]  C. L. Philip Chen,et al.  Broad Learning System: An Effective and Efficient Incremental Learning System Without the Need for Deep Architecture , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[55]  Yann LeCun,et al.  What is the best multi-stage architecture for object recognition? , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[56]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[57]  Honggang Zhang,et al.  Network slicing as a service: enabling enterprises' own software-defined cellular networks , 2016, IEEE Communications Magazine.

[58]  Yan Zhang,et al.  Mobile Edge Computing: A Survey , 2018, IEEE Internet of Things Journal.

[59]  Mahadev Satyanarayanan,et al.  The Emergence of Edge Computing , 2017, Computer.

[60]  Mugen Peng,et al.  Hierarchical Radio Resource Allocation for Network Slicing in Fog Radio Access Networks , 2019, IEEE Transactions on Vehicular Technology.

[61]  Michael L. Littman,et al.  Near Optimal Behavior via Approximate State Abstraction , 2016, ICML.

[62]  Richeng Jin,et al.  Privacy-Aware Offloading in Mobile-Edge Computing , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.