Serving content with unknown demand: the high-dimensional regime

In this paper we look at content placement in the high-dimensional regime: there are n servers, and O(n) distinct types of content. Each server can store and serve O(1) types at any given time. Demands for these content types arrive, and have to be served in an online fashion; over time, there are a total of O(n) of these demands. We consider the algorithmic task of content placement: determining which types of content should be on which server at any given time, in the setting where the demand statistics (i.e. the relative popularity of each type of content) are not known a-priori, but have to be inferred from the very demands we are trying to satisfy. This is the high-dimensional regime because this scaling (everything being O(n)) prevents consistent estimation of demand statistics; it models many modern settings where large numbers of users, servers and videos/webpages interact in this way. We characterize the performance of any scheme that separates learning and placement (i.e. which use a portion of the demands to gain some estimate of the demand statistics, and then uses the same for the remaining demands), showing it is order-wise strictly suboptimal. We then study a simple adaptive scheme - which myopically attempts to store the most recently requested content on idle servers - and show it outperforms schemes that separate learning and placement. Our results also generalize to the setting where the demand statistics change with time. Overall, our results demonstrate that separating the estimation of demand, and the subsequent use of the same, is strictly suboptimal.

[1]  Edith Cohen,et al.  Search and replication in unstructured peer-to-peer networks , 2002 .

[2]  Sharayu Moharir,et al.  Serving content with unknown demand: the high-dimensional regime , 2015, Queueing Syst. Theory Appl..

[3]  Laurent Massoulié,et al.  Bipartite graph structures for efficient balancing of heterogeneous loads , 2012, SIGMETRICS '12.

[4]  Laurent Massoulié,et al.  Adaptive Replication in Distributed Content Delivery Networks , 2014, ArXiv.

[5]  Edith Cohen,et al.  Replication strategies in unstructured peer-to-peer networks , 2002, SIGCOMM.

[6]  Edith Cohen,et al.  Search and replication in unstructured peer-to-peer networks , 2002, SIGMETRICS '02.

[7]  Zongpeng Li,et al.  Youtube traffic characterization: a view from the edge , 2007, IMC '07.

[8]  Ward Whitt,et al.  A Staffing Algorithm for Call Centers with Skill-Based Routing , 2005, Manuf. Serv. Oper. Manag..

[9]  David A. McAllester,et al.  On the Convergence Rate of Good-Turing Estimators , 2000, COLT.

[10]  Edward A. Fox,et al.  Removal policies in network caches for World-Wide Web documents , 1996, SIGCOMM '96.

[11]  Virgílio A. F. Almeida,et al.  A hierarchical characterization of a live streaming media workload , 2006, TNET.

[12]  Edward A. Fox,et al.  Caching Proxies: Limitations and Potentials , 1995, WWW.

[13]  Michele Garetto,et al.  Stochastic analysis of self-sustainability in peer-assisted VoD systems , 2012, 2012 Proceedings IEEE INFOCOM.

[14]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[15]  John C. S. Lui,et al.  Exploring the Optimal Replication Strategy in P2P-VoD Systems: Characterization and Evaluation , 2012, IEEE Transactions on Parallel and Distributed Systems.

[16]  Jia Wang,et al.  A survey of web caching schemes for the Internet , 1999, CCRV.

[17]  Cheng-Zhong Xu,et al.  Optimal video replication and placement on a cluster of video-on-demand servers , 2002, Proceedings International Conference on Parallel Processing.

[18]  Jussi Kangasharju,et al.  Optimizing File Availability in Peer-to-Peer Content Distribution , 2007, IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.

[19]  Ben Y. Zhao,et al.  Understanding user behavior in large-scale video-on-demand systems , 2006, EuroSys.

[20]  Yipeng Zhou,et al.  On Replication Algorithm in P2P VoD , 2013, IEEE/ACM Transactions on Networking.

[21]  I. Good THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS , 1953 .

[22]  R. Srikant,et al.  Fast Algorithms and Performance Bounds for Sum Rate Maximization in Wireless Networks , 2013, IEEE/ACM Transactions on Networking.

[23]  Philippe Robert,et al.  Impact of traffic mix on caching performance in a content-centric network , 2012, 2012 Proceedings IEEE INFOCOM Workshops.

[24]  John N. Tsitsiklis,et al.  Queueing system topologies with limited flexibility , 2013, SIGMETRICS '13.

[25]  WhittWard,et al.  A Staffing Algorithm for Call Centers with Skill-Based Routing , 2005 .

[26]  Jussi Kangasharju,et al.  Object replication strategies in content distribution networks , 2002, Comput. Commun..

[27]  Gregory Valiant,et al.  Estimating the unseen: an n/log(n)-sample estimator for entropy and support size, shown optimal via new CLTs , 2011, STOC '11.

[28]  Paolo Giaccone,et al.  Temporal locality in today's content caching: why it matters and how to model it , 2013, CCRV.

[29]  Edward A. Fox,et al.  Removal Policies in Network Caches for World-Wide Web Documents , 1996, SIGCOMM.

[30]  Ian T. Foster,et al.  Small-world file-sharing communities , 2003, IEEE INFOCOM 2004.