NFVactor: A Resilient NFV System Using the Distributed Actor Model

Resilience functionality, including failure resilience and flow migration, is of pivotal importance in practical network function virtualization (NFV) systems. However, existing failure recovery procedures incur high packet processing delay due to heavyweight process checkpointing, while flow migration has poor performance due to centralized control. This paper proposes <italic>NFVactor</italic>, a novel NFV system that aims to provide lightweight failure resilience and high-performance flow migration. <italic>NFVactor</italic> enables these by using actor model to provide a per-flow execution environment, so that each flow can replicate and migrate itself with improved parallelism, while the efficiency of the actor model is guaranteed by a carefully designed runtime system. Moreover, <italic>NFVactor</italic> achieves transparent resilience: once a new network function (NF) is implemented for <italic>NFVactor</italic>, the NF automatically acquires resilience support. Our evaluation result shows that <italic>NFVactor</italic> achieves 10-Gbps packet processing, flow migration completion time that is 144 times faster than the existing system, and packet processing delay stabilized at around 20 <inline-formula> <tex-math notation="LaTeX">$\mu \text{s}$ </tex-math></inline-formula> during replication.

[1]  Eddie Kohler,et al.  The Click modular router , 1999, SOSP.

[2]  Christos Gkantsidis,et al.  Enabling End-Host Network Functions , 2015, Comput. Commun. Rev..

[3]  Angelos D. Keromytis,et al.  Implementing a distributed firewall , 2000, CCS.

[4]  Tarik Taleb,et al.  Towards Edge Slicing: VNF Placement Algorithms for a Dynamic & Realistic Edge Cloud Environment , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[5]  Anat Bremler-Barr,et al.  OpenBox: A Software-Defined Framework for Developing, Deploying, and Managing Network Functions , 2016, SIGCOMM.

[6]  Andrew Prout,et al.  Big Data Analysis using Distributed Actors Framework , 2013 .

[7]  Andrew Warfield,et al.  Split/Merge: System Support for Elastic Execution in Virtual Middleboxes , 2013, NSDI.

[8]  K. K. Ramakrishnan,et al.  OpenNetVM: A Platform for High Performance Network Service Chains , 2016, HotMiddlebox@SIGCOMM.

[9]  Vyas Sekar,et al.  A High Performance Packet Core for Next Generation Cellular Networks , 2017, SIGCOMM.

[10]  Aditya Akella,et al.  Stratos: Virtual Middleboxes as First-Class Entities , 2012 .

[11]  Tarik Taleb,et al.  Efficient virtual evolved packet core deployment across multiple cloud domains , 2018, 2018 IEEE Wireless Communications and Networking Conference (WCNC).

[12]  Tarik Taleb,et al.  On Using SDN in 5G: The Controller Placement Problem , 2016, 2016 IEEE Global Communications Conference (GLOBECOM).

[13]  Vyas Sekar,et al.  Design and Implementation of a Consolidated Middlebox Architecture , 2012, NSDI.

[14]  K. K. Ramakrishnan,et al.  Flurries: Countless Fine-Grained NFs for Flexible Per-Flow Customization , 2016, CoNEXT.

[15]  K. K. Ramakrishnan,et al.  NetVM: High Performance and Flexible Networking Using Virtualization on Commodity Platforms , 2014, IEEE Transactions on Network and Service Management.

[16]  Gul A. Agha,et al.  ACTORS - a model of concurrent computation in distributed systems , 1985, MIT Press series in artificial intelligence.

[17]  Hani Jamjoom,et al.  Pico replication: a high availability framework for middleboxes , 2013, SoCC.

[18]  Aditya Gopalan,et al.  Optimizing distributed actor systems for dynamic interactive services , 2016, EuroSys.

[19]  Roberto Bifulco,et al.  ClickOS and the Art of Network Function Virtualization , 2014, NSDI.

[20]  Franck Le,et al.  Stateless Network Functions: Breaking the Tight Coupling of State and Processing , 2017, NSDI.

[21]  Tarik Taleb,et al.  Towards Modeling Cross-Domain Network Slices for 5G , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[22]  Tarik Taleb,et al.  Service-aware network function placement for efficient traffic handling in carrier cloud , 2014, 2014 IEEE Wireless Communications and Networking Conference (WCNC).

[23]  Tarik Taleb,et al.  Ensuring End-to-End QoS Based on Multi-Paths Routing Using SDN Technology , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[24]  Minlan Yu,et al.  SIMPLE-fying middlebox policy enforcement using SDN , 2013, SIGCOMM.

[25]  Thomas C. Schmidt,et al.  Revisiting actor programming in C++ , 2015, Comput. Lang. Syst. Struct..

[26]  Aditya Akella,et al.  Paving the Way for NFV: Simplifying Middlebox Modifications Using StateAlyzr , 2016, NSDI.

[27]  Scott Shenker,et al.  E2: a framework for NFV applications , 2015, SOSP.

[28]  Peng Wang,et al.  U-HAUL: Efficient State Migration in NFV , 2016, APSys.

[29]  Aditya Akella,et al.  OpenNF , 2014, SIGCOMM.

[30]  Vyas Sekar,et al.  KLEIN: A Minimally Disruptive Design for an Elastic Cellular Core , 2016, SOSR.

[31]  Scott Shenker,et al.  NetBricks: Taking the V out of NFV , 2016, OSDI.

[32]  Scott Shenker,et al.  Rollback-Recovery for Middleboxes , 2015, Comput. Commun. Rev..

[33]  Amin Vahdat,et al.  xOMB: Extensible Open MiddleBoxes with commodity servers , 2012, 2012 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS).

[34]  Sylvia Ratnasamy,et al.  SoftNIC: A Software NIC to Augment Hardware , 2015 .

[35]  Eunyoung Jeong,et al.  mTCP: a Highly Scalable User-level TCP Stack for Multicore Systems , 2014, NSDI.

[36]  Franck Le,et al.  Dynamic Scaling of Virtualized, Distributed Service Chains: A Case Study of IMS , 2017, IEEE Journal on Selected Areas in Communications.

[37]  Rasmus Pagh,et al.  Cuckoo Hashing , 2001, Encyclopedia of Algorithms.

[38]  Mahadev Konar,et al.  ZooKeeper: Wait-free Coordination for Internet-scale Systems , 2010, USENIX ATC.

[39]  Tarik Taleb,et al.  User mobility-aware Virtual Network Function placement for Virtual 5G Network Infrastructure , 2015, 2015 IEEE International Conference on Communications (ICC).

[40]  Alfred V. Aho,et al.  Efficient string matching , 1975, Commun. ACM.

[41]  Mendel Rosenblum,et al.  Fast crash recovery in RAMCloud , 2011, SOSP.