A Q-Learning-Based Approach for Deploying Dynamic Service Function Chains

As the size and service requirements of today’s networks gradually increase, large numbers of proprietary devices are deployed, which leads to network complexity, information security crises and makes network service and service provider management increasingly difficult. Network function virtualization (NFV) technology is one solution to this problem. NFV separates network functions from hardware and deploys them as software on a common server. NFV can be used to improve service flexibility and isolate the services provided for each user, thus guaranteeing the security of user data. Therefore, the use of NFV technology includes many problems worth studying. For example, when there is a free choice of network path, one problem is how to choose a service function chain (SFC) that both meets the requirements and offers the service provider maximum profit. Most existing solutions are heuristic algorithms with high time efficiency, or integer linear programming (ILP) algorithms with high accuracy. It’s necessary to design an algorithm that symmetrically considers both time efficiency and accuracy. In this paper, we propose the Q-learning Framework Hybrid Module algorithm (QLFHM), which includes reinforcement learning to solve this SFC deployment problem in dynamic networks. The reinforcement learning module in QLFHM is responsible for the output of alternative paths, while the load balancing module in QLFHM is responsible for picking the optimal solution from them. The results of a comparison simulation experiment on a dynamic network topology show that the proposed algorithm can output the approximate optimal solution in a relatively short time while also considering the network load balance. Thus, it achieves the goal of maximizing the benefit to the service provider.

[1]  Marc Peter Deisenroth,et al.  Deep Reinforcement Learning: A Brief Survey , 2017, IEEE Signal Processing Magazine.

[2]  Jiayi Cao,et al.  Reinforcement learning-based real-time power management for hybrid energy storage system in the plug-in hybrid electric vehicle , 2018 .

[3]  Giang Son Tran,et al.  Two levels autonomic resource management in virtualized IaaS , 2013, Future Gener. Comput. Syst..

[4]  K. Seeliger,et al.  Generative adversarial networks for reconstructing natural images from brain activity , 2018 .

[5]  P. Anandan,et al.  Cooperativity in Networks of Pattern Recognizing Stochastic Learning Automata , 1986 .

[6]  Junjie Liu,et al.  On Dynamic Service Function Chain Deployment and Readjustment , 2017, IEEE Transactions on Network and Service Management.

[7]  Chih-Hung Chen,et al.  Operation, administration and maintenance (OA&M) architecture design for internet of things , 2017 .

[8]  Gang Sun,et al.  A new technique for efficient live migration of multiple virtual machines , 2016, Future Gener. Comput. Syst..

[9]  Juan Felipe Botero,et al.  Resource Allocation in NFV: A Comprehensive Survey , 2016, IEEE Transactions on Network and Service Management.

[10]  Victor I. Chang,et al.  The efficient framework and algorithm for provisioning evolving VDC in federated data centers , 2017, Future Gener. Comput. Syst..

[11]  Symeon Papavassiliou,et al.  On the Problem of Optimal Cell Selection and Uplink Power Control in Open Access Multi-service Two-Tier Femtocell Networks , 2014, ADHOC-NOW.

[12]  Luciana S. Buriol,et al.  Piecing together the NFV provisioning puzzle: Efficient placement and chaining of virtual network functions , 2015, 2015 IFIP/IEEE International Symposium on Integrated Network Management (IM).

[13]  Victor I. Chang,et al.  Low-latency orchestration for workflow-oriented service function chain in edge computing , 2018, Future Gener. Comput. Syst..

[14]  Seo-Young Noh,et al.  AmoebaNet: An SDN-enabled network service for big data science , 2018, J. Netw. Comput. Appl..

[15]  Wenjian Fang,et al.  Joint defragmentation of optical spectrum and IT resources in elastic optical datacenter interconnections , 2015, IEEE/OSA Journal of Optical Communications and Networking.

[16]  J. R. Chen,et al.  Learning Algorithms: Theory and Applications in Signal Processing, Control and Communications , 2017 .

[17]  Zuqing Zhu,et al.  Distributed Online Scheduling and Routing of Multicast-Oriented Tasks for Profit-Driven Cloud Computing , 2016, IEEE Communications Letters.

[18]  Noel De Palma,et al.  Software consolidation as an efficient energy and cost saving solution , 2016, Future Gener. Comput. Syst..

[19]  Filip De Turck,et al.  Network Function Virtualization: State-of-the-Art and Research Challenges , 2015, IEEE Communications Surveys & Tutorials.

[20]  Radu-Emil Precup,et al.  Data-driven model-free slip control of anti-lock braking systems using reinforcement Q-learning , 2018, Neurocomputing.

[21]  Daniel Hagimont,et al.  Enforcing CPU allocation in a heterogeneous IaaS , 2015, Future Gener. Comput. Syst..

[22]  Anoop Ghanwani,et al.  Service Function Chaining (SFC) Operations, Administration, and Maintenance (OAM) Framework , 2020, RFC.

[23]  Otto Carlos Muniz Bandeira Duarte,et al.  Orchestrating Virtualized Network Functions , 2015, IEEE Transactions on Network and Service Management.

[24]  Xiaoning Zhang,et al.  Power-Efficient Provisioning for Online Virtual Network Requests in Cloud-Based Data Centers , 2015, IEEE Systems Journal.

[25]  Muthu Ramachandran,et al.  Big Data and Internet of Things - Fusion for different services and its impacts , 2018, Future Gener. Comput. Syst..

[26]  Éric Rutten,et al.  Coordinating self-sizing and self-repair managers for multi-tier systems , 2014, Future Gener. Comput. Syst..

[27]  Symeon Papavassiliou,et al.  Demand Response Management in Smart Grid Networks: a Two-Stage Game-Theoretic Learning-Based Approach , 2018, Mobile Networks and Applications.

[28]  Athanasios V. Vasilakos,et al.  Energy-efficient and traffic-aware service function chaining orchestration in multi-domain networks , 2019, Future Gener. Comput. Syst..

[29]  Mehdi Khazaei Occupancy Overload Control by Q-learning , 2019 .

[30]  Sang Il Kim,et al.  A research on dynamic service function chaining based on reinforcement learning using resource usage , 2017, 2017 Ninth International Conference on Ubiquitous and Future Networks (ICUFN).

[31]  Gang Sun,et al.  Live Migration for Multiple Correlated Virtual Machines in Cloud-Based Data Centers , 2018, IEEE Transactions on Services Computing.

[32]  Filip De Turck,et al.  Customizable Function Chains: Managing Service Chain Variability in Hybrid NFV Networks , 2016, IEEE Transactions on Network and Service Management.

[33]  Biswanath Mukherjee,et al.  Joint Virtual Network Function Placement and Routing of Traffic in Operator Networks , 2015 .

[34]  Victor Chang,et al.  Service Function Chain Orchestration Across Multiple Domains: A Full Mesh Aggregation Approach , 2018, IEEE Transactions on Network and Service Management.

[35]  Wei Lu,et al.  Joint Spectrum and IT Resource Allocation for Efficient VNF Service Chaining in Inter-Datacenter Elastic Optical Networks , 2016, IEEE Communications Letters.

[36]  Weihua Zhuang,et al.  UAV Relay in VANETs Against Smart Jamming With Reinforcement Learning , 2018, IEEE Transactions on Vehicular Technology.

[37]  Victor I. Chang,et al.  The cost-efficient deployment of replica servers in virtual content distribution networks for data fusion , 2017, Inf. Sci..

[38]  Victor I. Chang,et al.  Towards provisioning hybrid virtual networks in federated cloud data centers , 2017, Future Gener. Comput. Syst..

[39]  Bo Yi,et al.  A comprehensive survey of Network Function Virtualization , 2018, Comput. Networks.

[40]  Peng Wang,et al.  Joint service function chain deploying and path selection for bandwidth saving and VNF reuse , 2018, Int. J. Commun. Syst..

[41]  Cem Unsal,et al.  Multiple Stochastic Learning Automata for Vehicle Path Control in an Automated Highway System , 1999 .