An Optimized Video-on-Demand System: Theory, Design and Implementation

Internet on-demand video traffic has seen explosive growth in recent years. This brings a number of challenges in deploying video-on-demand services at large scale. The first challenge has to do with the enormity of the video catalog size. Traditionally, content providers replicate the entire video library in different locations, but this is wasteful and non-scalable as the video catalog size expands. The second challenge comes with the increase in video quality, which calls for efficient utilization of the scarce network bandwidth resources that continue to be economically expensive to expand. The emergence of different device modalities, including smart-phones, high-definition and 3D TV, tablets and etc poses another challenge in designing efficient systems and algorithms that cater to all device characteristics and user needs.In this dissertation, we aim to present a general approach to designing, optimizing and architecting a video-on-demand system. Our approach considers the practical constraints of disk space, network link bandwidth, and node connection degree bound. In general, the joint optimization problem is combinatorially difficult. To tackle this, we first design a simple fractional storage architecture, which uses a class of regeneration codes that fluidifies the content, thereby enabling a distributed content placement and link rate allocation algorithm. We show that by storing only a fractional of the entire catalog everywhere, the system is able to fully support user demand at large scale. Second, we develop a Markov approximation technique to solve the problem of topology selection under node degree bound using a simple distributed algorithm. We prove that our algorithm achieves close-to-optimal solution, which we verify using extensive realworld trace simulations.On the system side, we show extensive results to test the algorithm's scalability and robustness to changes in user dynamics and demand patterns. We show that our solution achieves high utilization of cache nodes storage and bandwidth resources, and automatically learns and caches the video according to the demand patterns. We observe that there exists a complex interplay between disk space, network bandwidth and node degree bound. We also present guidelines to important practical design choices including caching update intervals, demand prediction and provisioning. We also demonstrate the feasibility and efficiency of our design choice by building and experimenting a prototype system at Berkeley.

[1]  Nikolaos Laoutaris,et al.  Responsible Editor: J. Misic , 2004 .

[2]  Ion Stoica,et al.  Robust incentive techniques for peer-to-peer networks , 2004, EC '04.

[3]  Biplab Sikdar,et al.  Modeling seed scheduling strategies in BitTorrent , 2007 .

[4]  Cheng-Zhong Xu,et al.  Efficient algorithms of video replication and placement on a cluster of streaming servers , 2007, J. Netw. Comput. Appl..

[5]  Rudolf Ahlswede,et al.  Network information flow , 2000, IEEE Trans. Inf. Theory.

[6]  P. Diaconis,et al.  Geometric Bounds for Eigenvalues of Markov Chains , 1991 .

[7]  Minghua Chen,et al.  Reverse-engineering BitTorrent: A Markov approximation perspective , 2012, 2012 Proceedings IEEE INFOCOM.

[8]  Bobby Bhattacharjee,et al.  Bittorrent is an auction: analyzing and improving bittorrent's incentives , 2008, SIGCOMM '08.

[9]  Cheng-Zhong Xu,et al.  Optimal video replication and placement on a cluster of video-on-demand servers , 2002, Proceedings International Conference on Parallel Processing.

[10]  Kannan Ramchandran,et al.  Fractional repetition codes for repair in distributed storage systems , 2010, 2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[11]  Pablo Rodriguez,et al.  I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system , 2007, IMC '07.

[12]  Minghua Chen,et al.  Optimal Distributed P2P Streaming Under Node Degree Bounds , 2010, IEEE/ACM Transactions on Networking.

[13]  Laurent Massoulié,et al.  Greening the internet with nano data centers , 2009, CoNEXT '09.

[14]  Baochun Li,et al.  Keep Cache Replacement Simple in Peer-Assisted VoD Systems , 2009, IEEE INFOCOM 2009.

[15]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[16]  Mikel Izal,et al.  Dissecting BitTorrent: Five Months in a Torrent's Lifetime , 2004, PAM.

[17]  Rayadurgam Srikant,et al.  The Mathematics of Internet Congestion Control , 2003 .

[18]  Laurent Massoulié,et al.  Brief announcement: adaptive content placement for peer-to-peer video-on-demand systems , 2010, PODC '10.

[19]  S. Kulkarni,et al.  Bandwidth efficient video-on-demand algorithm (BEVA) , 2003, 10th International Conference on Telecommunications, 2003. ICT 2003..

[20]  B. Levine,et al.  Exploring the Use of BitTorrent as the Basis for a Large Trace Repository , 2004 .

[21]  V. Aggarwal,et al.  Improving user and ISP experience through ISP-aided P2P locality , 2008, IEEE INFOCOM Workshops 2008.

[22]  Bin Fan,et al.  The Delicate Tradeoffs in BitTorrent-like File Sharing Protocol Design , 2006, Proceedings of the 2006 IEEE International Conference on Network Protocols.

[23]  Michael Luby,et al.  A digital fountain approach to reliable distribution of bulk data , 1998, SIGCOMM '98.

[24]  Kwok-Tung Lo,et al.  Video-on-Demand Systems With Cooperative Clients in Multicast Environment , 2009, IEEE Trans. Circuits Syst. Video Technol..

[25]  Rayadurgam Srikant,et al.  Modeling and performance analysis of BitTorrent-like peer-to-peer networks , 2004, SIGCOMM 2004.

[26]  Kannan Ramchandran,et al.  DRESS codes for the storage cloud: Simple randomized constructions , 2011, 2011 IEEE International Symposium on Information Theory Proceedings.

[27]  Nazareno Andrade,et al.  Influences on cooperation in BitTorrent communities , 2005, P2PECON '05.

[28]  Minghua Chen,et al.  Markov Approximation for Combinatorial Network Optimization , 2010, IEEE Transactions on Information Theory.

[29]  Keith W. Ross,et al.  A Measurement Study of a Large-Scale P2P IPTV System , 2007, IEEE Transactions on Multimedia.

[30]  Eddie Kohler,et al.  Clustering and sharing incentives in BitTorrent systems , 2006, SIGMETRICS '07.

[31]  Stephen J. Wright,et al.  Minimizing delivery cost in scalable streaming content distribution systems , 2004, IEEE Transactions on Multimedia.

[32]  Minghong Lin,et al.  Stochastic analysis of file-swarming systems , 2007, Perform. Evaluation.

[33]  Diego Perino,et al.  Achievable catalog size in peer-to-peer video-on-demand systems , 2008, IPTPS.

[34]  Nikolaos Laoutaris,et al.  Uplink allocation beyond choke/unchoke: or how to divide and conquer best , 2008, CoNEXT '08.

[35]  J. P. Lasalle Some Extensions of Liapunov's Second Method , 1960 .

[36]  Ness B. Shroff,et al.  Utility maximization for communication networks with multipath routing , 2006, IEEE Transactions on Automatic Control.

[37]  Siddhartha Annapureddy,et al.  Providing Video-on-Demand using Peer-to-Peer Networks , 2006 .

[38]  Li Xiao,et al.  Location-aware topology matching in P2P systems , 2004, IEEE INFOCOM 2004.

[39]  Cheng Huang,et al.  Challenges, design and analysis of a large-scale p2p-vod system , 2008, SIGCOMM '08.

[40]  Alexandros G. Dimakis,et al.  Network Coding for Distributed Storage Systems , 2007, IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.

[41]  Sem C. Borst,et al.  Distributed Caching Algorithms for Content Distribution Networks , 2010, 2010 Proceedings IEEE INFOCOM.

[42]  Chuan Wu,et al.  UUSee: Large-Scale Operational On-Demand Streaming with Random Network Coding , 2010, 2010 Proceedings IEEE INFOCOM.

[43]  Mustaque Ahamad,et al.  Incentives in BitTorrent induce free riding , 2005, P2PECON '05.

[44]  Johan A. Pouwelse,et al.  The Bittorrent P2P File-Sharing System: Measurements and Analysis , 2005, IPTPS.

[45]  Cheng Huang,et al.  On ISP-friendly rate allocation for peer-assisted VoD , 2008, ACM Multimedia.

[46]  Tracey Ho,et al.  A Random Linear Network Coding Approach to Multicast , 2006, IEEE Transactions on Information Theory.

[47]  Nihar B. Shah,et al.  Enabling node repair in any erasure code for distributed storage , 2010, 2011 IEEE International Symposium on Information Theory Proceedings.

[48]  Soung Chang Liew,et al.  Back-of-the-Envelope Computation of Throughput Distributions in CSMA Wireless Networks , 2007, 2009 IEEE International Conference on Communications.

[49]  Guillaume Urvoy-Keller,et al.  Rarest first and choke algorithms are enough , 2006, IMC '06.

[50]  Cheng Huang,et al.  Can internet video-on-demand be profitable? , 2007, SIGCOMM '07.

[51]  Arun Venkataramani,et al.  Do Incentives Build Robustness in BitTorrent? (Awarded Best Student Paper) , 2007, NSDI.

[52]  Laurent Massoulié,et al.  Coupon replication systems , 2008, TNET.

[53]  Donald F. Towsley,et al.  A Network Formation Game Approach to Study BitTorrent Tit-for-Tat , 2007, NET-COOP.

[54]  Peter Sanders,et al.  Polynomial time algorithms for multicast network code construction , 2005, IEEE Transactions on Information Theory.

[55]  B. A. Sevast'yanov An Ergodic Theorem for Markov Processes and Its Application to Telephone Systems with Refusals , 1957 .