Deadline Is Not Enough: Importance-Aware Transmission Control Protocol for Server-Centric Data Centers

An important challenge for modern data centers is to provide users with low-latency, high-quality, and interactive services. To improve effective application throughput, much work has been done to take into account application deadline in the design of the network flow schedule for transmission of the queried data. This raises a critical question: does application-level throughput necessarily translate to better quality of service for users? We note that queries responses typically include a set of semantically related documents (within a flow), and that while some documents are highly related to that query, others are not. This motivates us to consider the degree of a document’s semantic relatedness in the design of an efficient data transmission scheme for data centers. To understand the performance upper bound, we derive a mixed integer linear programming (MILP) model for the above problem, which can be regarded as the schedule of a network subflow problem and is hence complicated. Thus, we propose the importance-aware transmission control protocol (ITCP), a distributed event-driven rate-based delivery control protocol, for server-centric data center networks. ITCP jointly considers the subflow importance, size, and deadline, as well as the characteristics of the underlying infrastructure to maximize the goodput of the most relevant responses of a query. The results of both real and synthetic data simulations show that ITCP outperforms recent works in terms of mean average precision of top $k$ data and the sum of application-level importance.

[1]  GhemawatSanjay,et al.  The Google file system , 2003 .

[2]  Susan T. Dumais,et al.  The good, the bad, and the random: an eye-tracking study of ad quality in web search , 2010, SIGIR.

[3]  David A. Maltz,et al.  Data center TCP (DCTCP) , 2010, SIGCOMM 2010.

[4]  Nick McKeown,et al.  pFabric: minimal near-optimal datacenter transport , 2013, SIGCOMM.

[5]  Haitao Wu,et al.  MDCube: a high performance network structure for modular data center interconnection , 2009, CoNEXT '09.

[6]  Christo Wilson,et al.  Better never than late , 2011, SIGCOMM 2011.

[7]  Eugene Agichtein,et al.  ViewSer: enabling large-scale remote user studies of web search examination and interaction , 2011, SIGIR.

[8]  Ling Xia,et al.  Eye tracking and online search: Lessons learned and challenges ahead , 2008, J. Assoc. Inf. Sci. Technol..

[9]  Edward Cutrell,et al.  What are you looking for?: an eye-tracking study of information usage in web search , 2007, CHI.

[10]  Haitao Wu,et al.  BCube: a high performance, server-centric network architecture for modular data centers , 2009, SIGCOMM '09.

[11]  Mark Handley,et al.  Data center networking with multipath TCP , 2010, Hotnets-IX.

[12]  Haitao Wu,et al.  ICTCP: Incast Congestion Control for TCP in Data-Center Networks , 2013, IEEE/ACM Transactions on Networking.

[13]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[14]  Amit Kumar,et al.  Approximation Algorithms for the Unsplittable Flow Problem , 2002, Algorithmica.

[15]  Junda Liu,et al.  Multi-enterprise networking , 2000 .

[16]  T. N. Vijaykumar,et al.  Deadline-aware datacenter tcp (D2TCP) , 2012, CCRV.