A Congestion Control Algorithm for Data Center Area Communications

Ethernet is extending its applications to data-center area, i.e. very short distance, communications such as CPU-CPU and CPU-I/O interconnections, where short end-to-end (MAC-to-MAC) data transfer delay is the key performance factor. In this paper, we propose an extended Ethernet MAC mechanism providing end-to-end reliable and congestion controlled packet transfer that minimizes data transfer delay. The proposed MAC mechanism employs a packet retransmission mechanism and a delay-based congestion control algorithm with inline initial bandwidth probing. Since inline bandwidth probing with small number of data packets tends to overestimate the initial bandwidth, we introduced two extended mechanisms; two-stage probing and RTT prediction. The RTT prediction mechanism detects congestion several RTTs earlier than using measured RTT and thus minimize queuing delay. The two-stage probing mechanism adaptively starts up data transfers without overflowing the link. The simulation results show that the proposed mechanism achieves roughly 10 times shorter transfer delay compared to TCP and 2-3 times shorter delay compared to existing delay-based congestion controls.

[1]  Atsushi Iwata,et al.  ExpressEther - Ethernet-Based Virtualization Technology for Reconfigurable Hardware Platform , 2006, 14th IEEE Symposium on High-Performance Interconnects (HOTI'06).

[2]  Sally Floyd,et al.  The NewReno Modification to TCP's Fast Recovery Algorithm , 2004, RFC.

[3]  Sally Floyd,et al.  Quick-Start for TCP and IP , 2007, RFC.

[4]  Ren Wang,et al.  Adaptive bandwidth share estimation in TCP Westwood , 2002, Global Telecommunications Conference, 2002. GLOBECOM '02. IEEE.

[5]  Masayuki Murata,et al.  ImTCP : TCP with an inline network measurement mechanism , 2004 .

[6]  Robert Shorten,et al.  Delay-based AIMD congestion control , 2007 .