Asymptotic insensitivity of least-recently-used caching to statistical dependency

We investigate a widely popular least-recently-used (LRU) cache replacement algorithm with semiMarkov modulated requests. SemiMarkov processes provide the flexibility for modeling strong statistical correlation, including the broadly reported long-range dependence in the World Wide Web page request patterns. When the frequency of requesting a page n is equal to the generalized Zipf's law c/n/sup /spl alpha//, /spl alpha/ > 1, our main result shows that the cache fault probability is asymptotically, for large cache sizes, the same as in the corresponding LRU system with i.i.d. requests. This appears to be the first explicit average case analysis of LRU caching with statistically dependent request sequences. The surprising insensitivity of LRU caching performance demonstrates its robustness to changes in document popularity. Furthermore, we show that the derived asymptotic result and simulation experiments are in excellent agreement, even for relatively small cache sizes. The potential of using our results in predicting the behavior of Web caches is tested using actual, strongly correlated, proxy server access traces.

[1]  James Allen Fill,et al.  On the distribution of search cost for the move-to-front rule , 1996 .

[2]  P. Jelenkovic Asymptotic approximation of the move-to-front search cost distribution and least-recently used caching fault probabilities , 1999 .

[3]  James Allen Fill,et al.  An exact formula for the move-to-front rule for self-organizing lists , 1996 .

[4]  Sandy Irani,et al.  Strongly competitive algorithms for paging with locality of reference , 1992, SODA '92.

[5]  James Allen Fill,et al.  On the distribution of search cost for the move-to-front rule , 1996, Random Struct. Algorithms.

[6]  Allan Borodin,et al.  Competitive paging with locality of reference , 1991, STOC '91.

[7]  Robert E. Tarjan,et al.  Amortized efficiency of list update and paging rules , 1985, CACM.

[8]  A. Lazar,et al.  Subexponential asymptotics of a Markov-modulated random walk with queueing applications , 1998, Journal of Applied Probability.

[9]  David Thomas,et al.  The Art in Computer Programming , 2001 .

[10]  Philippe Flajolet,et al.  Birthday Paradox, Coupon Collectors, Caching Algorithms and Self-Organizing Search , 1992, Discret. Appl. Math..

[11]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[12]  Virgílio A. F. Almeida,et al.  Characterizing reference locality in the WWW , 1996, Fourth International Conference on Parallel and Distributed Information Systems.

[13]  Predrag R. Jelenkovic,et al.  Least-recently-used caching with dependent requests , 2004, Theor. Comput. Sci..

[14]  L BentleyJon,et al.  Amortized analyses of self-organizing sequential search heuristics , 1985 .