zFabric: How to virtualize lossless ethernet?

Converged Enhanced Ethernet (CEE) is a crucial step in embracing storage, cluster, and high-performance computing fabrics under a common network. However, the adoption of lossless CEE in virtualized clusters is hindered by the lack of network hypervisor software that addresses the major issues of losslessness, i.e., head-of-line blocking and saturation trees. Our objective is to design a hypervisor that prevents miscon-figured or malicious virtual machines from filling the lossless network with stalled packets, thus compromising tenant isolation. Furthermore, we observe that current hypervisors perform compulsory isolation, management, and mobility functions, but introduce new bottlenecks on the data-path. By taking advantage of the lossless fabric, we deconstruct the existing virtualized networking stack into its core functions and consolidate them into zFabric, an efficient hypervisor that meets our aforementioned goals. To demonstrate zFabric's benefits, we evaluate a prototype implementation on a datacenter testbed. Besides resolving head-of-line blocking, zFabric improves throughputs for long flows by up to 56%, lowers CPU utilization by up to 63%, and shortens completion times by up to 7x for partition-aggregate queries when compared with current virtualized TCP stacks.

[1]  Srinivasan Seshan,et al.  Measurement and Analysis of TCP Throughput Collapse in Cluster-based Storage Systems , 2008, FAST.

[2]  Manolis Katevenis,et al.  Scheduling in Non-Blocking Buffered Three-Stage Switching Fabrics , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[3]  Robert Birke,et al.  Got loss? Get zOVN! , 2013, SIGCOMM.

[4]  Jeffrey C. Mogul,et al.  NetLord: a scalable multi-tenant network architecture for virtualized datacenters , 2011, SIGCOMM.

[5]  Winston Khoon Guan Seah,et al.  Avoiding congestion collapse on the Internet using TCP tunnels , 2002, Comput. Networks.

[6]  Cho-Li Wang,et al.  PVTCP: Towards practical and effective congestion control in virtualized datacenters , 2013, 2013 21st IEEE International Conference on Network Protocols (ICNP).

[7]  Dinan Gunawardena,et al.  Chatty Tenants and the Cloud Network Sharing Problem , 2013, NSDI.

[8]  Helen J. Wang,et al.  SecondNet: a data center network virtualization architecture with bandwidth guarantees , 2010, CoNEXT.

[9]  I. Stoica,et al.  FairCloud: sharing the network in cloud computing , 2011, CCRV.

[10]  Benny Rochwerger,et al.  An intent-based approach for network virtualization , 2013, 2013 IFIP/IEEE International Symposium on Integrated Network Management (IM 2013).

[11]  A. L. Narasimha Reddy,et al.  Performance of Quantized Congestion Notification in TCP Incast Scenarios of Data Centers , 2010, 2010 IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[12]  Lawrence Kreeger,et al.  Virtual eXtensible Local Area Network (VXLAN): A Framework for Overlaying Virtualized Layer 2 Networks over Layer 3 Networks , 2014, RFC.

[13]  Albert G. Greenberg,et al.  Sharing the Data Center Network , 2011, NSDI.

[14]  Albert G. Greenberg,et al.  EyeQ: Practical Network Performance Isolation at the Edge , 2013, NSDI.

[15]  Antony I. T. Rowstron,et al.  Better never than late: meeting deadlines in datacenter networks , 2011, SIGCOMM.

[16]  George Varghese,et al.  Netshare and stochastic netshare: predictable bandwidth allocation for data centers , 2012, CCRV.

[17]  Junda Liu,et al.  Multi-enterprise networking , 2000 .

[18]  A. Rowstron,et al.  Towards predictable datacenter networks , 2011, SIGCOMM.

[19]  Pankaj Garg,et al.  NVGRE: Network Virtualization Using Generic Routing Encapsulation , 2015, RFC.

[20]  Prashanth Pappu,et al.  Distributed queueing in scalable high performance routers , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[21]  David A. Maltz,et al.  DCTCP: Efficient Packet Transport for the Commoditized Data Center , 2010 .

[22]  Injong Rhee,et al.  CUBIC: a new TCP-friendly high-speed TCP variant , 2008, OPSR.