QMORA: A Q-Learning based Multi-objective Resource Allocation Scheme for NFV Orchestration

To satisfy the various quality-of-service (QoS) re-quirements with minimum network costs, network functions virtualization (NFV) is proposed as an emerging wireless architecture that migrates network functions from dedicated hardware appliances to software instances running in virtual computing platforms. One crucial issue in NFV is to solve the orchestration of virtualized network functions (VNFs) to reduce costs and to improve the management flexibility of telecommunications service providers (TSPs). Particular, multiple objectives are required to be considered for orchestrating VNFs in order to achieve overall system performance. This can be optimally solved in small scale using integer linear programming (ILP) algorithms with high accuracy but low time efficiency. On the other hand, heuristic algorithms can be applied for solving part of the objectives in NFV resource allocation with high time efficiency but low accuracy. To tackle the above challenges, QMORA, a ${Q}$-learning based multi-objective resource allocation approach, is proposed to solve multi-objective optimization in NFV orchestration (NFVO) efficiently and accurately. Particularly, the approach includes reinforcement learning module and VNFs placement module. Reinforcement learning module is responsible for generating the “best” candidate paths. VNFs placement module is responsible for selecting optimal nodes on the generated candidate paths to host VNFs required for flows. The simulation results in the real ISP topology show that the proposed QMORA can balance the multi-objective including maximizing number of flows admitted to the network, minimizing path stretch, balancing the load among VNF instances and minimizing link occupation rate compared with other heuristic approaches.

[1]  Ratul Mahajan,et al.  Measuring ISP topologies with Rocketfuel , 2004, IEEE/ACM Transactions on Networking.

[2]  Mostafa Ammar,et al.  An Approach for Service Function Chain Routing and Virtual Function Network Instance Migration in Network Function Virtualization Architectures , 2017, IEEE/ACM Transactions on Networking.

[3]  Otto Carlos Muniz Bandeira Duarte,et al.  Orchestrating Virtualized Network Functions , 2015, IEEE Transactions on Network and Service Management.

[4]  Leandros Tassiulas,et al.  Q-Placement: Reinforcement-Learning-Based Service Placement in Software-Defined Networks , 2018, 2018 IEEE 38th International Conference on Distributed Computing Systems (ICDCS).

[5]  Michael Kearns,et al.  Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms , 1998, NIPS.

[6]  Juan Felipe Botero,et al.  Resource Allocation in NFV: A Comprehensive Survey , 2016, IEEE Transactions on Network and Service Management.

[7]  Osvaldo Simeone,et al.  Coded Network Function Virtualization: Fault Tolerance via In-Network Coding , 2016, IEEE Wireless Communications Letters.

[8]  Jian Guo,et al.  Joint Optimization of Chain Placement and Request Scheduling for Network Function Virtualization , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[9]  Jie Wu,et al.  NFV Middlebox Placement with Balanced Set-up Cost and Bandwidth Consumption , 2018, ICPP.

[10]  Ying Zhang,et al.  Improve Service Chaining Performance with Optimized Middlebox Placement , 2017, IEEE Transactions on Services Computing.

[11]  Jing Chen,et al.  MOClusVNF: Leveraging Multi-Objective for Scalable NFV Orchestration , 2019, CCIOT 2019.

[12]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[13]  Ved P. Kafle,et al.  A Mathematical Model and Dynamic Programming Based Scheme for Service Function Chain Placement in NFV , 2019, IEICE Trans. Inf. Syst..

[14]  W. Marsden I and J , 2012 .

[15]  Peilin Hong,et al.  Resource Aware Routing for Service Function Chains in SDN and NFV-Enabled Network , 2021, IEEE Transactions on Services Computing.