Detection and Mitigation of Congestion in SDN Enabled Data Center Networks: A Survey

Large-scale data centers play a vital role in supporting the ever growing demand for computation and storage. These data centers often employ commodity hardware switches with shallow buffers arranged in a multi-rooted tree topology. Many-to-one traffic communication pattern is often observed in a data center whereby many servers simultaneously send requested data to an aggregation server. This may result in excessive congestion on the bottleneck switch, overwhelming the shallow switch buffer and resulting in large number of packet drops. Legacy transmission control protocol (TCP) treats a packet drop as an indication of severe congestion in the path and forces the sender to drastically reduce its sending rate. This leads to the phenomena of TCP incast resulting in severe throughput collapse deteriorating the performance of the data center. Many solutions have been proposed in literature that addresses the TCP incast issue by either tweaking TCP parameters or proposing a modified/custom version of TCP to be used in data centers. Recently, software-defined networking (SDN) has gained the attention of researchers due to the ease of centralized control and programmability of the network. The centralized nature of SDN presents an opportunity to address the TCP incast issue in data center networks in an efficient manner. This paper provides a snapshot of research efforts that leverage the capabilities of SDN to mitigate the effect of congestion in data centers. We provide a taxonomy of the existing solutions, discuss complementary concepts and highlight their relative strengths along with their shortcomings.

[1]  Prathima Agrawal,et al.  A Probabilistic Approach to Address TCP Incast in Data Center Networks , 2011, 2011 31st International Conference on Distributed Computing Systems Workshops.

[2]  Yifei Lu,et al.  SED: An SDN-Based Explicit-Deadline-Aware TCP for Cloud Data Center Networks , 2016 .

[3]  Brahim Bensaou,et al.  Reconciling mice and elephants in data center networks , 2015, 2015 IEEE 4th International Conference on Cloud Networking (CloudNet).

[4]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[5]  Monia Ghobadi,et al.  Rethinking end-to-end congestion control in software-defined networks , 2012, HotNets-XI.

[6]  Kyuho Son,et al.  A congestion avoidance algorithm in SDN environment , 2016, 2016 International Conference on Information Networking (ICOIN).

[7]  Christian E. Hopps,et al.  Analysis of an Equal-Cost Multi-Path Algorithm , 2000, RFC.

[8]  Ming Zhang,et al.  Understanding data center traffic characteristics , 2010, CCRV.

[9]  Albert G. Greenberg,et al.  Data center TCP (DCTCP) , 2010, SIGCOMM '10.

[10]  Hyun-Wook Jin,et al.  Scalable Congestion Control Protocol Based on SDN in Data Center Networks , 2014, 2015 IEEE Global Communications Conference (GLOBECOM).

[11]  Sujata Banerjee,et al.  DevoFlow: scaling flow management for high-performance networks , 2011, SIGCOMM.

[12]  秋好 一平,et al.  Open Networking Foundation(ONF)の標準化動向 , 2014 .

[13]  Brahim Bensaou,et al.  SDN-based Incast Congestion Control Framework for Data Centers: Implementation and Evaluation , 2016 .

[14]  Khin Mi Mi Aung,et al.  SDN Controlled Local Re-routing to Reduce Congestion in Cloud Data Center , 2015, 2015 International Conference on Cloud Computing Research and Innovation (ICCCRI).

[15]  David A. Maltz,et al.  Network traffic characteristics of data centers in the wild , 2010, IMC '10.

[16]  Glenn Judd,et al.  Attaining the Promise and Avoiding the Pitfalls of TCP in the Datacenter , 2015, NSDI.

[17]  Daniel S. Marcon,et al.  Achieving minimum bandwidth guarantees and work-conservation in large-scale, SDN-based datacenter networks , 2017, Comput. Networks.

[18]  Srinivasan Seshan,et al.  Measurement and Analysis of TCP Throughput Collapse in Cluster-based Storage Systems , 2008, FAST.

[19]  Amin Vahdat,et al.  Hedera: Dynamic Flow Scheduling for Data Center Networks , 2010, NSDI.

[20]  Jun Li,et al.  A survey on TCP Incast in data center networks , 2014, Int. J. Commun. Syst..

[21]  Albert G. Greenberg,et al.  VL2: a scalable and flexible data center network , 2009, SIGCOMM '09.

[22]  Amar Phanishayee,et al.  Safe and effective fine-grained TCP retransmissions for datacenter communication , 2009, SIGCOMM '09.

[23]  Junda Liu,et al.  Multi-enterprise networking , 2000 .

[24]  Nick McKeown,et al.  OpenFlow: enabling innovation in campus networks , 2008, CCRV.

[25]  Prasanthi Sreekumari,et al.  Transport protocols for data center networks: a survey of issues, solutions and challenges , 2015, Photonic Network Communications.

[26]  Michael Karl,et al.  Network supported congestion avoidance in software-defined networks , 2013, 2013 19th IEEE International Conference on Networks (ICON).

[27]  Yukai Yang,et al.  Staggered flows: An application layer's way to avoid incast problem , 2012, 2012 IEEE Asia Pacific Cloud Computing Congress (APCloudCC).

[28]  Rong Pan,et al.  Data center transport mechanisms: Congestion control theory and IEEE standardization , 2008, 2008 46th Annual Allerton Conference on Communication, Control, and Computing.

[29]  Behzad Akbari,et al.  Congestion control in software defined data center networks through flow rerouting , 2015, 2015 23rd Iranian Conference on Electrical Engineering.

[30]  Ling Tang,et al.  SDTCP: Towards Datacenter TCP Congestion Control with SDN for IoT Applications , 2017, Sensors.

[31]  Ramana Rao Kompella,et al.  The TCP Outcast Problem: Exposing Unfairness in Data Center Networks , 2012, NSDI.

[32]  Ihsan Ayyub Qazi,et al.  eSDN: Rethinking Datacenter Transports Using End-Host SDN Controllers , 2015, SIGCOMM.

[33]  David Malone,et al.  A lossless switch for data acquisition networks , 2015, 2015 IEEE 40th Conference on Local Computer Networks (LCN).

[34]  Gunjan Tank,et al.  Software-Defined Networking-The New Norm for Networks , 2012 .

[35]  Albert G. Greenberg,et al.  The nature of data center traffic: measurements & analysis , 2009, IMC '09.

[36]  Colin Perkins,et al.  OTCP: SDN-managed congestion control for data center networks , 2016, NOMS 2016 - 2016 IEEE/IFIP Network Operations and Management Symposium.

[37]  Tao Yang,et al.  The Panasas ActiveScale Storage Cluster - Delivering Scalable High Bandwidth Storage , 2004, Proceedings of the ACM/IEEE SC2004 Conference.

[38]  Hong Liu,et al.  Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Google's Datacenter Network , 2015, Comput. Commun. Rev..

[39]  Haitao Wu,et al.  ICTCP: Incast Congestion Control for TCP in Data-Center Networks , 2010, IEEE/ACM Transactions on Networking.

[40]  Deng Pan,et al.  OpenFlow based Load Balancing for Fat-Tree Networks with Multipath Support , 2013 .

[41]  Jean C. Walrand,et al.  Knowledge-Defined Networking: Modelització de la xarxa a través de l’aprenentatge automàtic i la inferència , 2016 .