Standing on the Shoulders of Giants: Cross-Slice Federated Meta Learning for Resource Orchestration to Cold-Start Slice

Network slicing is a key technology in 6G communication systems to support numerous vertical applications for all scenes while providing resources on demand. Due to more time-varying and dynamic traffic flows, it is difficult for traditional methods to manage complex and highly dynamic 6G networks. Therefore, intelligent method such as Deep Reinforcement Learning (DRL) is employed into network management since DRL is a model-free and experience-driven approach. However, it is difficult to leverage one DRL model to provide customized intra-slice orchestration for various applications because of their diverse flow characteristics and service requirements. Moreover, training a DRL model is notoriously time-consuming so that it is not realistic for network operator to individually orchestrate customized network slice for each application with the DRL algorithm. Additionally, the data privacy of each application should be considered into the training process. In this paper, we propose a Federated Meta Reinforcement Learning (FedMRL) approach to tackle the cold-start problem in network slice orchestration, while reserving the data privacy. The Meta Reinforcement Learning (MRL) is leveraged to train a meta policy for rapidly learn a local policy for a specific slice orchestration task by finding a common initialization that allows for a quick adaptation towards each optimal solution. With the help of federated learning setting, the training process of meta policy is not required to collect raw data of applications to the centralized server. Experimental results show that FedMRL outperforms three baselines in terms of overall costs, end-to-end latency and convergence speed.

[1]  Xudong Wang,et al.  LinkSlice: Fine-Grained Network Slice Enforcement Based on Deep Reinforcement Learning , 2022, IEEE Journal on Selected Areas in Communications.

[2]  B. Shihada,et al.  Efficient Wireless Traffic Prediction at the Edge: A Federated Meta-Learning Approach , 2022, IEEE Communications Letters.

[3]  Yunjun Gao,et al.  MetaKG: Meta-Learning on Knowledge Graph for Cold-Start Recommendation , 2022, IEEE Transactions on Knowledge and Data Engineering.

[4]  Weihua Zhuang,et al.  Efficient Federated Meta-Learning Over Multi-Access Wireless Networks , 2021, IEEE Journal on Selected Areas in Communications.

[5]  Qiang Li,et al.  Federated Meta-Learning Enhanced Acoustic Radio Cooperative Framework for Ocean of Things , 2021, IEEE Journal of Selected Topics in Signal Processing.

[6]  Shahrokh Valaee,et al.  An Application-Driven Framework for Intelligent Transportation Systems Using 5G Network Slicing , 2021, IEEE Transactions on Intelligent Transportation Systems.

[7]  Qi Qi,et al.  Generative Adversarial Network-Based Transfer Reinforcement Learning for Routing With Prior Knowledge , 2021, IEEE Transactions on Network and Service Management.

[8]  Madhusanka Liyanage,et al.  Survey on Network Slicing for Internet of Things Realization in 5G Networks , 2021, IEEE Communications Surveys & Tutorials.

[9]  Zhiyuan Xu,et al.  An Actor-Critic-Based Transfer Learning Framework for Experience-Driven Networking , 2021, IEEE/ACM Transactions on Networking.

[10]  Shuguang Cui,et al.  A Comprehensive Survey on 6G Networks: Applications, Core Services, Enabling Technologies, and Future Challenges , 2021, ArXiv.

[11]  Huaming Wu,et al.  DMRO: A Deep Meta Reinforcement Learning-Based Task Offloading Framework for Edge-Cloud Computing , 2020, IEEE Transactions on Network and Service Management.

[12]  Albert Y. Zomaya,et al.  Fast Adaptive Task Offloading in Edge Computing Based on Meta Reinforcement Learning , 2020, IEEE Transactions on Parallel and Distributed Systems.

[13]  Nicolas Chapados,et al.  Meta-learning framework with applications to zero-shot time-series forecasting , 2020, AAAI.

[14]  Xin Liu,et al.  A Constrained Reinforcement Learning Based Approach for Network Slicing , 2020, 2020 IEEE 28th International Conference on Network Protocols (ICNP).

[15]  Zhiguo Ding,et al.  A Meta-Learning Framework for Learning Multi-User Preferences in QoE Optimization of DASH , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Bo Li,et al.  Latency-aware VNF Chain Deployment with Efficient Resource Reuse at Network Edge , 2020, IEEE INFOCOM 2020 - IEEE Conference on Computer Communications.

[17]  Lan Yan,et al.  Federated Meta-Learning for Fraudulent Credit Card Detection , 2020, IJCAI.

[18]  Marco Fiore,et al.  Network Slicing Meets Artificial Intelligence: An AI-Based Framework for Slice Management , 2020, IEEE Communications Magazine.

[19]  Tong Li,et al.  Automatic Virtual Network Embedding: A Deep Reinforcement Learning Approach With Graph Convolutional Networks , 2020, IEEE Journal on Selected Areas in Communications.

[20]  Kai Xu,et al.  MetaLight: Value-Based Meta-Reinforcement Learning for Traffic Signal Control , 2020, AAAI.

[21]  Choong Seon Hong,et al.  Edge Computing Resources Reservation in Vehicular Networks: A Meta-Learning Approach , 2020, IEEE Transactions on Vehicular Technology.

[22]  Albert Y. Zomaya,et al.  Intelligent VNF Orchestration and Flow Scheduling via Model-Assisted Deep Reinforcement Learning , 2020, IEEE Journal on Selected Areas in Communications.

[23]  Xu Li,et al.  Dynamic Flow Migration for Embedded Services in SDN/NFV-Enabled 5G Core Networks , 2020, IEEE Transactions on Communications.

[24]  Junshan Zhang,et al.  A Collaborative Learning Framework via Federated Meta-Learning , 2020, 2020 IEEE 40th International Conference on Distributed Computing Systems (ICDCS).

[25]  Joonhyuk Kang,et al.  From Learning to Meta-Learning: Reduced Training Overhead and Complexity for Communication Systems , 2020, 2020 2nd 6G Wireless Summit (6G SUMMIT).

[26]  Anit Kumar Sahu,et al.  Federated Learning: Challenges, Methods, and Future Directions , 2019, IEEE Signal Processing Magazine.

[27]  Aryan Mokhtari,et al.  Personalized Federated Learning with Theoretical Guarantees: A Model-Agnostic Meta-Learning Approach , 2020, NeurIPS.

[28]  Abdelkader Outtagarts,et al.  A Deep Reinforcement Learning Approach for VNF Forwarding Graph Embedding , 2019, IEEE Transactions on Network and Service Management.

[29]  Jakub Konecný,et al.  Improving Federated Learning Personalization via Model Agnostic Meta Learning , 2019, ArXiv.

[30]  Meng-Hsun Tsai,et al.  Realizing Dynamic Network Slice Resource Management based on SDN networks , 2019, 2019 International Conference on Intelligent Computing and its Emerging Applications (ICEA).

[31]  P. Fan,et al.  6G Wireless Networks: Vision, Requirements, Architecture, and Key Technologies , 2019, IEEE Vehicular Technology Magazine.

[32]  Zongpeng Li,et al.  Scaling Geo-Distributed Network Function Chains: A Prediction and Learning Framework , 2019, IEEE Journal on Selected Areas in Communications.

[33]  Jane X. Wang,et al.  Reinforcement Learning, Fast and Slow , 2019, Trends in Cognitive Sciences.

[34]  Laurie G. Cuthbert,et al.  Effective isolation in dynamic network slicing , 2019, 2019 IEEE Wireless Communications and Networking Conference (WCNC).

[35]  Fangming Liu,et al.  Adaptive Interference-Aware VNF Placement for Service-Customized 5G Network Slices , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[36]  Gang Feng,et al.  Reconfiguration in Network Slicing—Optimizing the Profit and Performance , 2019, IEEE Transactions on Network and Service Management.

[37]  Ying-Chang Liang,et al.  Applications of Deep Reinforcement Learning in Communications and Networking: A Survey , 2018, IEEE Communications Surveys & Tutorials.

[38]  Shih-Chun Lin,et al.  End-to-End Network Slicing for 5G&B Wireless Software-Defined Systems , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[39]  Xiaodong Lin,et al.  Querying in Internet of Things with Privacy Preserving: Challenges, Solutions and Opportunities , 2018, IEEE Network.

[40]  Joshua Achiam,et al.  On First-Order Meta-Learning Algorithms , 2018, ArXiv.

[41]  Zhenguo Li,et al.  Federated Meta-Learning with Fast Convergence and Efficient Communication , 2018, 1802.07876.

[42]  Chi Harold Liu,et al.  Experience-driven Networking: A Deep Reinforcement Learning based Approach , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[43]  Stéphane Pérennes,et al.  Provably Efficient Algorithms for Placement of Service Function Chains with Ordering Constraints , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[44]  Lena Wosinska,et al.  Dynamic slicing approach for multi-tenant 5G transport networks [invited] , 2018, IEEE/OSA Journal of Optical Communications and Networking.

[45]  Vincenzo Sciancalepore,et al.  Slice as a Service (SlaaS) Optimal IoT Slice Resources Orchestration , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[46]  Sangheon Pack,et al.  Joint Optimization of Service Function Placement and Flow Distribution for Service Function Chaining , 2017, IEEE Journal on Selected Areas in Communications.

[47]  Radu Soricut,et al.  Cold-Start Reinforcement Learning with Softmax Policy Gradient , 2017, NIPS.

[48]  Zhi-Quan Luo,et al.  Network Slicing for Service-Oriented Networks Under Resource Constraints , 2017, IEEE Journal on Selected Areas in Communications.

[49]  Lazaros Gkatzikis,et al.  The Algorithmic Aspects of Network Slicing , 2017, IEEE Communications Magazine.

[50]  Hang Li,et al.  Meta-SGD: Learning to Learn Quickly for Few Shot Learning , 2017, ArXiv.

[51]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[52]  Mahesh K. Marina,et al.  Network Slicing in 5G: Survey and Challenges , 2017, IEEE Communications Magazine.

[53]  Jose Ordonez-Lucena,et al.  Network Slicing for 5G with SDN/NFV: Concepts, Architectures, and Challenges , 2017, IEEE Communications Magazine.

[54]  Sergey Levine,et al.  Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[55]  Blaise Agüera y Arcas,et al.  Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[56]  Ali A. Ghorbani,et al.  Characterization of Tor Traffic using Time based Features , 2017, ICISSP.

[57]  Maxim Sviridenko,et al.  Inapproximability of the Multilevel Uncapacitated Facility Location Problem , 2016, ACM Trans. Algorithms.

[58]  Juan Felipe Botero,et al.  Resource Allocation in NFV: A Comprehensive Survey , 2016, IEEE Transactions on Network and Service Management.

[59]  Honggang Zhang,et al.  Network slicing as a service: enabling enterprises' own software-defined cellular networks , 2016, IEEE Communications Magazine.

[60]  Edoardo Amaldi,et al.  On the computational complexity of the virtual network embedding problem , 2016, Electron. Notes Discret. Math..

[61]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[62]  Sergey Levine,et al.  High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.

[63]  Richard Evans,et al.  Deep Reinforcement Learning in Large Discrete Action Spaces , 2015, 1512.07679.

[64]  Sergey Levine,et al.  Trust Region Policy Optimization , 2015, ICML.

[65]  Xavier Hesselbach,et al.  Virtual Network Embedding: A Survey , 2013, IEEE Communications Surveys & Tutorials.

[66]  Matthew Roughan,et al.  The Internet Topology Zoo , 2011, IEEE Journal on Selected Areas in Communications.

[67]  Lihong Li,et al.  PAC model-free reinforcement learning , 2006, ICML.

[68]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.