Virtual Network Allocation for Fault Tolerance with Bandwidth Efficiency in a Multi-tenant Data Center

In a multitenant data center, nodes and links of tenants' virtual networks (VNs) share a single component of the physical substrate network (SN). A failure of the single SN component can thereby cause simultaneous failures of multiple nodes and links in a VN, this complex of failures must significantly disrupt the services offered on the VN. In the present paper, we clarify how the fault tolerance of a VN is affected by a SN failure, especially from the perspective of VN allocation in the SN. We propose a VN allocation model for multitenant data centers and formulate a problem that deals with the bandwidth loss in the VN due the SN failure. We conduct numerical simulations with the setting that has 1.7 × 108 bit/s bandwidth demand on each VN. The results show that the bandwidth loss can be reduced to 5.3 × 102 bit/s per VN, but the required bandwidth between physical servers in the SN increases to 1.0 × 109 bit/s per VN when each node in the VN is mapped to an individual physical server. The balance between the bandwidth loss and the required bandwidth between physical servers can be optimized by assigning every four nodes of the VN to each physical server, meaning that we minimize the bandwidth loss without providing too sufficient bandwidth in the core area of the SN.

[1]  Mohamed Faten Zhani,et al.  On Achieving High Survivability in Virtualized Data Centers , 2014, IEICE Trans. Commun..

[2]  Kishor S. Trivedi,et al.  Availability analysis of blade server systems , 2008, IBM Syst. J..

[3]  Xavier Hesselbach,et al.  Virtual Network Embedding: A Survey , 2013, IEEE Communications Surveys & Tutorials.

[4]  Jian Tang,et al.  Survivable Virtual Infrastructure Mapping in Virtualized Data Centers , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[5]  Lawrence Kreeger,et al.  Virtual eXtensible Local Area Network (VXLAN): A Framework for Overlaying Virtualized Layer 2 Networks over Layer 3 Networks , 2014, RFC.

[6]  Thomas H. Cormen,et al.  Introduction to algorithms [2nd ed.] , 2001 .

[7]  Kaisa Miettinen,et al.  Nonlinear multiobjective optimization , 1998, International series in operations research and management science.

[8]  Lisandro Zambenedetti Granville,et al.  Data Center Network Virtualization: A Survey , 2013, IEEE Communications Surveys & Tutorials.

[9]  Ray Jain,et al.  The art of computer systems performance analysis - techniques for experimental design, measurement, simulation, and modeling , 1991, Wiley professional computing.

[10]  virtualization.info 日本語 論文:VMware vSphere High Availability Deployment Best Practices(20110825-2) , 2011 .

[11]  Navendu Jain,et al.  Understanding network failures in data centers: measurement, analysis, and implications , 2011, SIGCOMM.

[12]  Jin B. Hong,et al.  Availability Modeling and Analysis of a Virtualized System , 2009, 2009 15th IEEE Pacific Rim International Symposium on Dependable Computing.

[13]  M. Tornatore,et al.  Design of Disaster-Resilient Optical Datacenter Networks , 2012, Journal of Lightwave Technology.

[14]  Rahul Potharaju,et al.  When the network crumbles: an empirical study of cloud network failures and their impact on services , 2013, SoCC.

[15]  Paramvir Bahl,et al.  Towards highly reliable enterprise network services via inference of multi-level dependencies , 2007, SIGCOMM.

[16]  Robert M. Hinden,et al.  Virtual Router Redundancy Protocol (VRRP) , 2004, RFC.

[17]  David G. Andersen,et al.  Theoretical Approaches to Node Assignment , 2002 .

[18]  W. H I T E P A P,et al.  Protecting Mission-Critical Workloads with VMware Fault Tolerance , 2009 .

[19]  David A. Maltz,et al.  Surviving failures in bandwidth-constrained datacenters , 2012, CCRV.

[20]  Luiz André Barroso,et al.  The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines , 2009, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines.

[21]  Xin-She Yang,et al.  Introduction to Algorithms , 2021, Nature-Inspired Optimization Algorithms.

[22]  Xin Wu,et al.  NetPilot: automating datacenter network failure mitigation , 2012, SIGCOMM '12.

[23]  Masayuki Murata,et al.  Proactive recovery from multiple failures utilizing overlay networking technique , 2013, Telecommun. Syst..

[24]  Biswanath Mukherjee,et al.  Disaster-resilient virtual-network mapping and adaptation in optical networks , 2013, 2013 17th International Conference on Optical Networking Design and Modeling (ONDM).

[25]  Raouf Boutaba,et al.  SVNE: Survivable Virtual Network Embedding Algorithms for Network Virtualization , 2013, IEEE Transactions on Network and Service Management.

[26]  Mohamed Faten Zhani,et al.  Venice: Reliable virtual data center embedding in clouds , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[27]  Acee Lindem,et al.  Virtual Router Redundancy Protocol , 1998, RFC.

[28]  Chunming Qiao,et al.  Cost Efficient Design of Survivable Virtual Infrastructure to Recover from Facility Node Failures , 2011, 2011 IEEE International Conference on Communications (ICC).