RABA: Resource-Aware Backup Allocation For A Chain of Virtual Network Functions

Network Function Virtualization (NFV) turns a sequence of network functions on hardwares into a service chain of virtual network functions (VNFs) provisioned on virtual machines or containers. However, the chain of VNFs may suffer from interruption as long as one VNF fails due to software faults or hardware malfunctions. A common approach to ensuring high availability is to provide backup nodes for primary VNFs. However, existing work on allocating backup nodes have not considered the heterogeneous resource demands of different VNFs. In this paper, we formalize the resource-aware backup allocation problem, which aims to minimize the backup resource consumption while meeting the overall availability demand. To this end, we prove the NP-hardness of this problem and propose the RABA-CDDE algorithm based on differential evolution to solve it. Besides, to reduce the computation overhead of RABA-CDDE, a greedy algorithm is proposed. Our extensive evaluation shows that the proposed algorithms can reduce the resource consumption by about 15% and 35% respectively compared to the state-of-art solutions in dedicated and shared protection scenarios.

[1]  Hani Jamjoom,et al.  Pico replication: a high availability framework for middleboxes , 2013, SoCC.

[2]  Chunming Qiao,et al.  Carrier-grade availability-aware mapping of Service Function Chains with on-site backups , 2017, 2017 IEEE/ACM 25th International Symposium on Quality of Service (IWQoS).

[3]  Ori Rottenstreich,et al.  Optimizing virtual backup allocation for middleboxes , 2016, 2016 IEEE 24th International Conference on Network Protocols (ICNP).

[4]  Diego Lopez,et al.  Service Function Chaining Use Cases in Mobile Networks , 2019 .

[5]  H. Kellerer,et al.  Introduction to NP-Completeness of Knapsack Problems , 2004 .

[6]  Ilya Gertsbakh,et al.  Models of Network Reliability: Analysis, Combinatorics, and Monte Carlo , 2009 .

[7]  Dutch T. Meyer,et al.  Remus: High Availability via Asynchronous Virtual Machine Replication. (Best Paper) , 2008, NSDI.

[8]  Jorge E. Hurtado,et al.  Neural-network-based reliability analysis: a comparative study , 2001 .

[9]  Alice E. Smith,et al.  A General Neural Network Model for Estimating Telecommunications Network Reliability , 2009, IEEE Transactions on Reliability.

[10]  Shunsuke Homma,et al.  Service Function Chaining Use Cases In Data Centers , 2017 .

[11]  Michael M. Skolnick,et al.  Using Genetic Algorithms in Engineering Design Optimization with Non-Linear Constraints , 1993, ICGA.

[12]  Benjamín Barán,et al.  A Virtual Machine Placement Taxonomy , 2015, 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[13]  Juan Felipe Botero,et al.  Resilient allocation of service Function chains , 2016, 2016 IEEE Conference on Network Function Virtualization and Software Defined Networks (NFV-SDN).

[14]  Chunming Qiao,et al.  Availability-aware mapping of service function chains , 2017, IEEE INFOCOM 2017 - IEEE Conference on Computer Communications.

[15]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[16]  Chunming Qiao,et al.  GREP: Guaranteeing Reliability with Enhanced Protection in NFV , 2015, HotMiddlebox@SIGCOMM.

[17]  Biswanath Mukherjee,et al.  A Survey on Resiliency Techniques in Cloud Computing Infrastructures and Applications , 2016, IEEE Communications Surveys & Tutorials.

[18]  Chadi Assi,et al.  A Reliability-Aware Network Service Chain Provisioning With Delay Guarantees in NFV-Enabled Enterprise Datacenter Networks , 2017, IEEE Transactions on Network and Service Management.

[19]  W. H I T E P A P,et al.  Protecting Mission-Critical Workloads with VMware Fault Tolerance , 2009 .

[20]  Godfrey C. Onwubolu,et al.  Scheduling flow shops using differential evolution algorithm , 2006, Eur. J. Oper. Res..