Accurate Learning or Fast Mixing? Dynamic Adaptability of Caching Algorithms

Typical analysis of content caching algorithms using the metric of steady state hit probability under a stationary request process does not account for performance loss under a variable request arrival process. In this paper, we instead conceptualize caching algorithms as complexity-limited online distribution learning algorithms and use this vantage point to study their adaptability from two perspectives: 1) the accuracy of learning a fixed popularity distribution and 2) the speed of learning items’ popularity. In order to attain this goal, we compute the distance between the stationary distributions of several popular algorithms with that of a genie-aided algorithm that has the knowledge of the true popularity ranking, which we use as a measure of learning accuracy. We then characterize the mixing time of each algorithm, i.e., the time needed to attain the stationary distribution, which we use as a measure of learning efficiency. We merge both the abovementioned measures to obtain the “learning error” representing both how quickly and how accurately an algorithm learns the optimal caching distribution and use this to determine the trade-off between these two objectives of many popular caching algorithms. Informed by the results of our analysis, we propose a novel hybrid algorithm, adaptive-least recently used, that learns both faster and better the changes in the popularity. We show numerically that it also outperforms all other candidate algorithms when confronted with either a dynamically changing synthetic request process or using real world traces.

[1]  Ronald Fagin,et al.  Asymptotic Miss Ratios over Independent References , 1977, J. Comput. Syst. Sci..

[2]  Pablo Rodriguez,et al.  I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system , 2007, IMC '07.

[3]  C. Villani Optimal Transport: Old and New , 2008 .

[4]  Heriot-Watt University AN OVERVIEW OF SOME STOCHASTIC STABILITY METHODS , 2004 .

[5]  J. Cheeger A lower bound for the smallest eigenvalue of the Laplacian , 1969 .

[6]  Daniel S. Hirschberg,et al.  Self-organizing linear search , 1985, CSUR.

[7]  Alistair Sinclair,et al.  Improved Bounds for Mixing Rates of Markov Chains and Multicommodity Flow , 1992, Combinatorics, Probability and Computing.

[8]  Erol Gelenbe,et al.  A Unified Approach to the Evaluation of a Class of Replacement Algorithms , 1973, IEEE Transactions on Computers.

[9]  J. A. Fill Eigenvalue bounds on convergence to stationarity for nonreversible markov chains , 1991 .

[10]  Ravi Montenegro,et al.  Mathematical Aspects of Mixing Times in Markov Chains , 2006, Found. Trends Theor. Comput. Sci..

[11]  R. Phatarfod,et al.  On the transition probabilities of the move-to-front scheme , 1994, Journal of Applied Probability.

[12]  V. Climenhaga Markov chains and mixing times , 2013 .

[13]  Milena Mihail,et al.  Conductance and convergence of Markov chains-a combinatorial treatment of expanders , 1989, 30th Annual Symposium on Foundations of Computer Science.

[14]  Yu Gu,et al.  Watch global, cache local: YouTube network traffic at a campus network: measurements and implications , 2008, Electronic Imaging.

[15]  Florin Ciucu,et al.  Exact analysis of TTL cache networks , 2014, Perform. Evaluation.

[16]  W. J. Hendricks An Account of Self-Organizing Systems , 1976, SIAM J. Comput..

[17]  Giuseppe Bianchi,et al.  Check before storing: what is the performance price of content integrity verification in LRU caching? , 2013, CCRV.

[18]  Javad Ghaderi,et al.  Adaptive TTL-Based Caching for Content Delivery , 2017, SIGMETRICS.

[19]  Yong-Yeol Ahn,et al.  Analyzing the Video Popularity Characteristics of Large-Scale User Generated Content Systems , 2009, IEEE/ACM Transactions on Networking.

[20]  Hao Che,et al.  Hierarchical Web caching systems: modeling, design and experimental results , 2002, IEEE J. Sel. Areas Commun..

[21]  Donald F. Towsley,et al.  Approximate Models for General Cache Networks , 2010, 2010 Proceedings IEEE INFOCOM.

[22]  Benny Van Houdt,et al.  Asymptotically Exact TTL-Approximations of the Cache Replacement Algorithms LRU(m) and h-LRU , 2016, 2016 28th International Teletraffic Congress (ITC 28).

[23]  Vijay G. Subramanian,et al.  Eigenvalues of LRU via a linear algebraic approach , 2018, Oper. Res. Lett..

[24]  Stratis Ioannidis,et al.  Orchestrating massively distributed CDNs , 2012, CoNEXT '12.

[25]  Asit Dan,et al.  An approximate analysis of the LRU and FIFO buffer replacement schemes , 1990, SIGMETRICS '90.

[26]  Ronald Fagin,et al.  Comparing top k lists , 2003, SODA '03.

[27]  Jie Li,et al.  Performance analysis of local caching replacement policies for internet video streaming services , 2014, 2014 22nd International Conference on Software, Telecommunications and Computer Networks (SoftCOM).

[28]  Laszlo A. Belady,et al.  A Study of Replacement Algorithms for Virtual-Storage Computer , 1966, IBM Syst. J..

[29]  Nimrod Megiddo,et al.  ARC: A Self-Tuning, Low Overhead Replacement Cache , 2003, FAST.

[30]  Philippe Robert,et al.  Impact of traffic mix on caching performance in a content-centric network , 2012, 2012 Proceedings IEEE INFOCOM Workshops.

[31]  Donald F. Towsley,et al.  Analysis of TTL-based cache networks , 2012, 6th International ICST Conference on Performance Evaluation Methodologies and Tools.

[32]  David Tse,et al.  Probabilistic methods for web caching , 2001, Perform. Evaluation.

[33]  F. Chung Laplacians and the Cheeger Inequality for Directed Graphs , 2005 .

[34]  Sergei Vassilvitskii,et al.  Generalized distances between rankings , 2010, WWW '10.

[35]  Peter J. Denning,et al.  Operating Systems Theory , 1973 .

[36]  James R. Bitner,et al.  Heuristics That Dynamically Organize Data Structures , 1979, SIAM J. Comput..

[37]  R. Phatarfod On the matrix occurring in a linear search problem , 1991, Journal of Applied Probability.

[38]  S. Foss,et al.  AN OVERVIEW OF SOME STOCHASTIC STABILITY METHODS( Network Design, Control and Optimization) , 2004 .

[39]  E. G. Coffman,et al.  Stochastic Analysis of Computer Storage , 1987 .

[40]  Michele Garetto,et al.  A unified approach to the performance analysis of caching systems , 2013, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[41]  James Allen Fill,et al.  An exact formula for the move-to-front rule for self-organizing lists , 1996 .

[42]  Anthony Unwin,et al.  Reversibility and Stochastic Networks , 1980 .