Planet scale software updates

Fast and effective distribution of software updates (a.k.a. patches) to millions of Internet users has evolved into a critical task over the last years. In this paper, we characterize "Windows Update", one of the largest update services in the world, with the aim to draw general guidelines on how to best design and architect a fast and effective planet-scale patch dissemination system. To this end, we analyze an extensive set of data traces collected over the period of a year, consisting of billions of queries from over 300 million computers. Based on empirical observations and analytical results, we identify interesting properties of today's update traffic and user behavior.Building on this analysis, we consider alternative patch delivery strategies such as caching and peer-to-peer and evaluate their performance. We identify key factors that determine the effectiveness of these schemes in reducing the server workload and the network traffic, and in speeding-up the patch delivery. Most of our findings are invariant properties induced by either user behavior or architectural characteristics of today's Internet, and thus apply to the general problem of Internet-wide dissemination of software updates.

[1]  Christopher Hemmerich Automating Request-based Software Distribution , 2000, LISA.

[2]  Saikat Guha,et al.  Characterization and measurement of TCP traversal through NATs and firewalls , 2005, IMC '05.

[3]  Venkata N. Padmanabhan,et al.  The content and access dynamics of a busy web site: findings and implicatins , 2000, SIGCOMM.

[4]  David Ressman,et al.  Use of Cfengine for Automated, Multi-Platform Software and Patch Distribution , 2000, LISA.

[5]  David Mazières,et al.  A low-bandwidth network file system , 2001, SOSP.

[6]  Petr Tuma,et al.  SOFAnet: middleware for software distribution over Internet , 2005, The 2005 Symposium on Applications and the Internet.

[7]  Noga Alon,et al.  The Probabilistic Method , 2015, Fundamentals of Ramsey Theory.

[8]  Christos Gkantsidis,et al.  Anatomy of a P2P Content Distribution system with Network Coding , 2006, IPTPS.

[9]  Helen J. Wang,et al.  Automatic Misconfiguration Troubleshooting with PeerPressure , 2004, OSDI.

[10]  Shan Lu,et al.  Analyzing persistent state interactions to improve state management , 2006, SIGMETRICS '06/Performance '06.

[11]  Marvin Theimer,et al.  Feasibility of a serverless distributed file system deployed on an existing set of desktop PCs , 2000, SIGMETRICS '00.

[12]  Pablo Rodriguez,et al.  Should internet service providers fear peer-assisted content distribution? , 2005, IMC '05.

[13]  Bruce M. Maggs,et al.  The feasibility of supporting large-scale live streaming applications with dynamic application end-points , 2004, SIGCOMM 2004.

[14]  Michael E. Shaddock,et al.  How to Upgrade 1500 Workstations on Saturday, and Still Have Time to Mow the Yard on Sunday , 1995, LISA.

[15]  John Dunagan,et al.  Towards a self-managing software patching process using black-box persistent-state manifests , 2004 .

[16]  Shamkant B. Navathe,et al.  Grouping techniques for update propagation in intermittently connected databases , 1998, Proceedings 14th International Conference on Data Engineering.

[17]  Peter W. Osel,et al.  OpenDist: Incremental Software Distribution , 1995, LISA.