Dynamic file bundling for large-scale content distribution

One highly-scalable approach to content delivery is to harness the upload bandwidth of the clients. Peer-assisted content delivery systems have been shown to effectively offload the servers of popular files, as the request rates of popular content enable the formation of self-sustaining torrents, where the entire content of the file is available among the peers themselves. However, for less popular files, these systems are less helpful in offloading servers. With a long tail of mildly popular content, with a high aggregate demand, a large fraction of the file requests must still be handled by servers. In this paper, we present the design, implementation, and evaluation of a dynamic file bundling system, where peers are requested to download content which they may not otherwise download in order to “inflate” the popularity of less popular files. Our system introduces the idea of a super bundle, which consists of a large catalogue of files. From this catalogue, smaller bundles, consisting of a small set of files, can dynamically be assigned to individual users. The system can dynamically adjust the number of downloaders of each file and thus enables the popularity inflation to be optimized according to current file popularities and the desired tradeoff between download times and server resource usage. The system is evaluated on PlanetLab.

[1]  Minghong Lin,et al.  Stochastic analysis of file-swarming systems , 2007, Perform. Evaluation.

[2]  Johan A. Pouwelse,et al.  The Bittorrent P2P File-Sharing System: Measurements and Analysis , 2005, IPTPS.

[3]  Niklas Carlsson,et al.  Characterizing web-based video sharing workloads , 2009, WWW '09.

[4]  Niklas Carlsson,et al.  Using Torrent Inflation to Efficiently Serve the Long Tail in Peer-Assisted Content Delivery Systems , 2010, Networking.

[5]  B. Cohen,et al.  Incentives Build Robustness in Bit-Torrent , 2003 .

[6]  Bruce M. Maggs,et al.  Globally Distributed Content Delivery , 2002, IEEE Internet Comput..

[7]  Stefan Savage,et al.  Understanding Availability , 2003, IPTPS.

[8]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[9]  Yan Yang,et al.  Multi-Torrent: a Performance Study , 2008, 2008 IEEE International Symposium on Modeling, Analysis and Simulation of Computers and Telecommunication Systems.

[10]  Bin Li,et al.  Content Availability and Bundling in Swarming Systems , 2013, IEEE/ACM Trans. Netw..

[11]  Bo Li,et al.  CoolStreaming/DONet: a data-driven overlay network for peer-to-peer live media streaming , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[12]  Zongpeng Li,et al.  Towards a Dynamic File Bundling System for Large-Scale Content Distribution , 2011, 2011 IEEE 19th Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems.

[13]  Arun Venkataramani,et al.  Estimating Self-Sustainability in Peer-to-Peer Swarming Systems , 2010, Perform. Evaluation.

[14]  Zongpeng Li,et al.  Youtube traffic characterization: a view from the edge , 2007, IMC '07.

[15]  Xiaoning Ding,et al.  Measurements, analysis, and modeling of BitTorrent-like systems , 2005, IMC '05.

[16]  Guillaume Urvoy-Keller,et al.  Rarest first and choke algorithms are enough , 2006, IMC '06.

[17]  Niklas Carlsson,et al.  Power-law revisited: large scale measurement study of P2P content popularity , 2010, IPTPS.

[18]  Niklas Carlsson,et al.  Tradeoffs in cloud and peer-assisted content delivery systems , 2012, 2012 IEEE 12th International Conference on Peer-to-Peer Computing (P2P).

[19]  Rayadurgam Srikant,et al.  Modeling and performance analysis of BitTorrent-like peer-to-peer networks , 2004, SIGCOMM 2004.

[20]  Krishna P. Gummadi,et al.  Measurement, modeling, and analysis of a peer-to-peer file-sharing workload , 2003, SOSP '03.

[21]  Niklas Carlsson,et al.  Dynamic swarm management for improved BitTorrent performance , 2009, IPTPS.

[22]  Yanghee Choi,et al.  Bundling practice in BitTorrent: what, how, and why , 2012, SIGMETRICS '12.