I Sent It: Where Does Slow Data Go to Wait?

Emerging applications like virtual reality (VR), augmented reality (AR), and 360-degree video aim to exploit the unprecedentedly low latencies promised by technologies like the tactile Internet and mobile 5G networks. Yet these promises are still unrealized. In order to fulfill them, it is crucial to understand where packet delays happen, which impacts protocol performance such as throughput and latency. In this work, we empirically find that sender-side protocol stack delays can cause high end-to-end latencies, though existing solutions primarily address network delays. Unfortunately, however, current latency diagnosis tools cannot even distinguish between delays on network links and delays in the end hosts. To close this gap, we present ELEMENT, a latency diagnosis framework that decomposes end-to-end TCP latency into endhost and network delays, without requiring admin privileges at the sender or receiver. We validate that ELEMENT achieves more than 90% accuracy in delay estimation compared to the ground truth in different production networks. To demonstrate ELEMENT's potential impact on real-world applications, we implement a relatively simple user-level library that uses ELEMENT to minimize delays. For evaluation, we integrate ELEMENT with legacy TCP applications and show that it can reduce latency by up to 10 times while maintaining throughput and fairness. We finally demonstrate that ELEMENT can significantly reduce the latency of a virtual reality application that needs extremely low latencies and high throughput.

[1]  Markku Kojo,et al.  Evaluating CoDel, PIE, and HRED AQM techniques with load transients , 2014, 39th Annual IEEE Conference on Local Computer Networks.

[2]  Jennifer Rexford,et al.  LatLong: Diagnosing Wide-Area Latency Changes for CDNs , 2012, IEEE Transactions on Network and Service Management.

[3]  Michael Welzl,et al.  The new AQM kids on the block: An experimental evaluation of CoDel and PIE , 2014, 2014 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[4]  Minlan Yu,et al.  Identifying performance bottlenecks in CDNs through TCP-level monitoring , 2011, W-MUST '11.

[5]  Steven K. Feiner,et al.  Combating VR sickness through subtle dynamic field-of-view modification , 2016, 2016 IEEE Symposium on 3D User Interfaces (3DUI).

[6]  Thomas F. Wenisch,et al.  The Mystery Machine: End-to-end Performance Analysis of Large-scale Internet Services , 2014, OSDI.

[7]  Klara Nahrstedt,et al.  An XML-based Quality of Service Enabling Language for the Web , 2002, J. Vis. Lang. Comput..

[8]  Israel Cidon,et al.  One-way delay estimation using network-wide measurements , 2006, IEEE Transactions on Information Theory.

[9]  Behnaz Arzani,et al.  Taking the Blame Game out of Data Centers Operations with NetPoirot , 2016, SIGCOMM.

[10]  H. S. Varsha,et al.  The tactile Internet , 2017, 2017 International Conference on Innovative Mechanisms for Industry Applications (ICIMIA).

[11]  Stephen D. Strowes Passively measuring TCP round-trip times , 2013, Commun. ACM.

[12]  Miguel A. Labrador,et al.  On the applicability of available bandwidth estimation techniques and tools , 2010, Comput. Commun..

[13]  Marius A. Eriksen,et al.  Trickle: A Userland Bandwidth Shaper for UNIX-like Systems , 2005, USENIX Annual Technical Conference, FREENIX Track.

[14]  M. Frans Kaashoek,et al.  A measurement study of available bandwidth estimation tools , 2003, IMC '03.

[15]  Matt Mathis,et al.  TCP Extended Statistics MIB , 2007, RFC.

[16]  Vijay Subramanian,et al.  PIE: A lightweight control scheme to address the bufferbloat problem , 2013, 2013 IEEE 14th International Conference on High Performance Switching and Routing (HPSR).

[17]  Godred Fairhurst,et al.  Unicast UDP Usage Guidelines for Application Designers , 2008, RFC.

[18]  Ashvin Goel,et al.  Low-latency adaptive streaming over tcp , 2008, TOMCCAP.

[19]  Van Jacobson,et al.  BBR: Congestion-Based Congestion Control , 2016, ACM Queue.

[20]  Vivek S. Pai,et al.  ModNet: A Modular Approach to Network Stack Extension , 2015, NSDI.

[21]  Hari Balakrishnan,et al.  Stochastic Forecasts Achieve High Throughput and Low Delay over Cellular Networks , 2013, NSDI.

[22]  Toke Høiland-Jørgensen,et al.  The Flow Queue CoDel Packet Scheduler and Active Queue Management Algorithm , 2018, RFC.

[23]  Emmanuel Lochin,et al.  Revisiting old friends: is CoDel really achieving what RED cannot? , 2014, CSWS@SIGCOMM.

[24]  kc claffy,et al.  Bandwidth estimation: metrics, measurement techniques, and tools , 2003, IEEE Netw..

[25]  Cheng Jin,et al.  FAST TCP: Motivation, Architecture, Algorithms, Performance , 2006, IEEE/ACM Transactions on Networking.

[26]  Shunzheng Yu,et al.  One-way queuing delay measurement and its application on detecting DDoS attack , 2009, J. Netw. Comput. Appl..

[27]  Mung Chiang,et al.  Fog Networking: An Overview on Research Opportunities , 2016, ArXiv.

[28]  Van Jacobson,et al.  Controlling Queue Delay , 2012, ACM Queue.

[29]  Partha Kanuparthy,et al.  YTrace: End-to-end Performance Diagnosis in Large Content Providers , 2016, ArXiv.

[30]  Teruo Higashino,et al.  Edge-centric Computing: Vision and Challenges , 2015, CCRV.

[31]  S Matilda.,et al.  Bufferbloat Mitigation for Real-time Video Streaming using Adaptive Controlled Delay Mechanism , 2013 .

[32]  Guido Appenzeller,et al.  Optimization of Video Streaming through Bufferbloat Mitigation using Hybrid FQ-CoDel Algorithm , 2016 .

[33]  Timothy Roscoe,et al.  Script-driven Packet Marking for Quality of Service Support in Legacy Applications , 1999 .

[34]  Feng Qian,et al.  Understanding On-device Bufferbloat for Cellular Upload , 2016, Internet Measurement Conference.

[35]  Jean-Charles Grégoire,et al.  Accurate One-Way Delay Estimation: Limitations and Improvements , 2012, IEEE Transactions on Instrumentation and Measurement.

[36]  Larry L. Peterson,et al.  TCP Vegas: End to End Congestion Avoidance on a Global Internet , 1995, IEEE J. Sel. Areas Commun..

[37]  Injong Rhee,et al.  DRWA: A Receiver-Centric Solution to Bufferbloat in Cellular Networks , 2016, IEEE Transactions on Mobile Computing.

[38]  B. Annappa,et al.  On the Effectiveness of CoDel for Active Queue Management , 2013, 2013 Third International Conference on Advanced Computing and Communication Technologies (ACCT).

[39]  Paramvir Bahl,et al.  The Case for VM-Based Cloudlets in Mobile Computing , 2009, IEEE Pervasive Computing.

[40]  Injong Rhee,et al.  CUBIC: a new TCP-friendly high-speed TCP variant , 2008, OPSR.

[41]  Paul Rad,et al.  Chameleon: A Scalable Production Testbed for Computer Science Research , 2019, Contemporary High Performance Computing.

[42]  Lakshminarayanan Subramanian,et al.  Adaptive Congestion Control for Unpredictable Cellular Networks , 2015, Comput. Commun. Rev..