Server-assisted Latency Management for Wide-area Distributed Systems

Recently many Internet services employ wide-area platforms to improve the end-user experience in the WAN. To maintain close control over their remote nodes, the wide-area systems require low-latency dissemination of new updates for system configurations, customer requirements, and task lists at runtime. However, we observe that existing data transfer systems focus on resource efficiency for open client populations, rather than focusing on completion latency for a known set of nodes. In examining this problem, we find that optimizing for latency produces strategies radically different from existing systems, and can dramatically reduce latency across a wide range of scenarios. This paper presents a latency-sensitive file transfer system, Lsync that can be used as synchronization building block for wide-area systems where latency matters. Lsync performs novel node selection, scheduling, and adaptive policy switching that dynamically chooses the best strategy using information available at runtime. Our evaluation results from a PlanetLab deployment show that Lsync outperforms a wide variety of data transfer systems and achieves significantly higher synchronization ratio even under frequent file updates.

[1]  Marius A. Eriksen,et al.  Trickle: A Userland Bandwidth Shaper for UNIX-like Systems , 2005, USENIX Annual Technical Conference, FREENIX Track.

[2]  David Mazières,et al.  Democratizing Content Publication with Coral , 2004, NSDI.

[3]  William E. Weihl,et al.  Edgecomputing: extending enterprise applications to the edge of the internet , 2004, WWW Alt. '04.

[4]  Joel Wein,et al.  ACMS: the Akamai configuration management system , 2005, NSDI.

[5]  Miguel Castro,et al.  SplitStream: high-bandwidth multicast in cooperative environments , 2003, SOSP '03.

[6]  Nalini Venkatasubramanian,et al.  CREW: A Gossip-based Flash-Dissemination System , 2006, 26th IEEE International Conference on Distributed Computing Systems (ICDCS'06).

[7]  Himabindu Pucha,et al.  Adaptive File Transfers for Diverse Environments , 2008, USENIX Annual Technical Conference.

[8]  Ramesh K. Sitaraman,et al.  The Akamai network: a platform for high-performance internet applications , 2010, OPSR.

[9]  Anne-Marie Kermarrec,et al.  Lightweight probabilistic broadcast , 2003, TOCS.

[10]  KyoungSoo Park,et al.  Scale and Performance in the CoBlitz Large-File Distribution Service , 2006, NSDI.

[11]  Amin Vahdat,et al.  Bullet: high bandwidth data dissemination using an overlay mesh , 2003, SOSP '03.

[12]  Albert G. Greenberg,et al.  Reining in the Outliers in Map-Reduce Clusters using Mantri , 2010, OSDI.

[13]  Nicolas Bonichon,et al.  Scheduling divisibleworkloads on heterogeneous platforms under bounded multi-port model , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[14]  Arun Venkataramani,et al.  Do incentives build robustness in bit torrent , 2007 .

[15]  Mor Harchol-Balter,et al.  Analysis of SRPT scheduling: investigating unfairness , 2001, SIGMETRICS '01.

[16]  Andrew Tridgell,et al.  Efficient Algorithms for Sorting and Synchronization , 1999 .

[17]  Emin Gün Sirer,et al.  A content propagation metric for efficient content distribution , 2011, SIGCOMM.

[18]  Amin Vahdat,et al.  Loose Synchronization for Large-Scale Networked Systems , 2006, USENIX Annual Technical Conference, General Track.

[19]  Kenneth P. Birman,et al.  Bimodal multicast , 1999, TOCS.